Gene Franean1_5088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5088 
SymbolileS 
ID5673423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6088793 
End bp6092038 
Gene Length3246 bp 
Protein Length1081 aa 
Translation table11 
GC content72% 
IMG OID641243939 
Productisoleucyl-tRNA synthetase 
Protein accessionYP_001509353 
Protein GI158316845 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0060] Isoleucyl-tRNA synthetase 
TIGRFAM ID[TIGR00392] isoleucyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00024165 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGTACCC CCGACGCCCC GACGCCCGCG TTCCGCCCAC TGCCCTCGCG GGTCGACCTG 
CCGGCCTTCG AGCGCGAGAC GCTGGAGCGC TGGAAGGATT CCAAGGTCTT CCACCGCTCG
CTGCAGGCCG GCGCAGACCG GCCGCTGTGG GTGTTCTACG AAGGCCCGCC GACCGCCAAC
GGCAAGCCGG GCGCCCACCA CGTCGAGGCG CGGGTCTTCA AGGACCTCTT CCCCCGTTAC
AGGACGATGA AGGGCTACCA CGTGCCCCGC CGGGCGGGCT GGGACTGCCA CGGCCTGCCT
GTCGAGCTCG CCGTCGAGAA GGAACTCGGC TTCACCAGCA AGAGCGACAT TGAGGACTAC
GGCATCGCCG AGTTCAACGA GAGGTGCCGC GAGTCCGTCC TGCGCCACGT GGCCGACTTC
TCCGCGATGA CGGAGCGGAT GGGCTACTGG GTCGACCTCG ACGGCGCCTA CCGGACGATG
GACACCTCGT ACATCGAGAG CGTCTGGTGG TCCCTCAAGC AGATCTTCGA CAAGGGCCTG
CTCGTCGAGG ACTTCCGGGT CACCCCGTAC TGCCCCCGGG ACGAGACGCC GCTGTCCGAC
CACGAGGTCT CCCAGGGCTA CAGCGACACC GACGACCCGT CGGTCTACGT GCGGTTCCCG
GTCGTCGAGG GCCCGCGCGG GCTGGCCGAG CACGGCGCGC ACCTGCTGGT GTGGACGACC
ACCCCGTGGA CGCTGGTCTC CAACACCGCG GTCGCCGTCC ATCCCGAGGT CAGCTACGTG
CTGGCCCGGG CCGCCGGCGG TGAACTGCTC GTCGTCGCCG AGCCGCTGCT GACCGCGGCC
CTGGGCGAGG GCGCCGAGGT CGTCGAGCGC TTCACCGGCG CCGAGCTCGC CGGCACCCGC
TATGCCCGGC CGTTCGAGCT GATCGCGCTG GAGCGCTTCG AGGAGGCGGC GGGGAAGACC
GCCGTCGGCC GGGGCGCCGG GACATCCGCC CGCCCGCACT CGGTGGTCCT GGCGGACTAC
GTGACGACCA CGGACGGCAC CGGGCTCGTG CACCAGGCAC CCGCCTTCGG CGCCGAGGAC
CTCGCGGTCT GCCGCGCCAA CGGCCTCGGC GTCGTCAACC CGGTCGGGAC GGACGGGCGT
TTCCTGCCCG AGATCCCGCT GGTCGGCGGG CTGTTCTTCA AGGACGCCGA CGCGCCGCTG
TCCGCGGACC TCACCGCGCG CGGCCTGATG TGGCGGGCCG CGACCTACAC GCACAGCTAC
CCGCGCTGCT GGCGCTGCCA CACACCGCTG ATCTACTACC CGCTGCCGTC CTGGTACATC
CGCACGACCG CCGTCCGCGA GCGCCTGCTC GCCGAGAACG CGCGGACGGA CTGGCACCCT
GAGCGCATCC GCGAGGGCCG CTACGGCGAG TGGCTGCGCA ACAACGTCGA CTGGGCGCTG
TCGCGCAACC GGTACTGGGG CACCCCGCTG CCCGTCTGGC GGTGCGAGGC CGACCCGTCC
CACCTGACCT GCGTCGGCTC GCTGGCGGAG CTCTCCGAGC TGACCGGCAC GGACCATTCG
ACGCTCGACC CGCACCGCCC GTTCGTCGAC GACGTGACCC TGCCGTGCCC CACCTGTGGG
GAGACCGCCC GGCGGGTGCC TGAGGTCATC GATGTCTGGT ACGACAGCGG CGCGATGCCG
TTCGCCCAGT GGGGCGCCCC GCACCGCAAC GCCGACGAGT TCGCGAAGCA GTACCCGGCG
CAGTACATCT GCGAGGCGAT CGACCAGACC CGGGGCTGGT TCTACACGCT CATGGCCGTG
GGCACGCTGG TCTTCGACCG GTCGTCCTAC GAGACCGTGC TGTGCCTGGG CCTGCTGCTC
GACGCCGAGG GCCGCAAGAT GAGCAAGCAC GTCGGTAACG TGCTGGACCC GTTCGACCTG
TTCGACCGGC ACGGGGCGGA CGCCGTCCGG TGGCTGATGC TGGCCGGCGG CTCACCGTGG
TCGGACCGCC GGGTCAGCCA CGAGTCGATC GAGGACATCG TCCGCAAGAT CCTGCTCACC
TACTGGAACA CCGCGTCGTT CTTCGCGCTC TACGCGGGCG CGGCGGACTG GTGCCCGACC
GGCGCCGACG GGTCGGGCGA GGGCGCCGCG GCCCCGGCCG AACGCCCGGT GCTGGACAGG
TGGGCGCTCT CCGAGCTCGC CGACACCGTC GCCGAGGTGG ACGCCGCCCT CGAGGGCTTC
GACGCGCTGC GCGCCGGGCG CCGCATCGCC CGCTTCGTGG ACGACCTGTC GAACTGGTAC
GTCCGGCGTT CCCGCCGCCG CTTCTGGGCG GGCGACCCGA ACGCGCTGGC CACCCTCTAC
ACCTGCCTGG ACGGCCTGAC CCGGGTGATG TCGCCGTTCA CCCCGTTCCT GACCGACTGG
CTGTGGTCCC GGCTGTTCGC CGGCATCCTG CCCGGCGCCC CGGACTCGGT GCACCTGGCC
TCGTGGCCGC AGCTGCCGGC GGACCTCGTC GAGCCGGGCC TGGCCGAGCG GATGGACCTC
GTGCGCCGGA TCGTCGAGCT CGGCCGCGCG GCCCGGGCCG CCAGCGGGCT GCGCACCCGT
CAGCCGCTGC CGCGTGCGGT GGTCGGCTCC TCGGCGTTCG ACGACCTCTC GGCGGAGCTG
CGCGCGCAGA TCGCCGAGGA GCTCAACGTC GTCGCCGTCG AGGCCGCGAC CTCGGACCTC
GTCGACATCA GCGTGAAGCC GAACTTCCGG GCGCTGGGAC GGCGCTTCGG CAACCGCACG
AAGGCGGTCG CCGCCGCCGT CACCACCGCC GGGGCACCGG TGGACGGCCG GCTGACCGTC
ACCGTGGACG GCGAGCGCAT CGAGCTCGGC GGTGAGGATC TGATCGTCAC CGAGACACCG
CGCGAGGGCT GGTCGGTGAC GAGTGAGTCG GGCCTGTCGG TCGCCCTCGA CCTGACCGTC
ACCCCGGAAC TGGCCCGGAC CGGGCTCGCC CGCGACGTGG TGCGCGTCCT GCAGGACGCG
CGGCGGGCCG CGGGGCTGGA GATCACCGAC CGGGTCGAGC TGCGCTGGGT CGCCGCCAAG
GAGGAGACGG CGGCAGCGTT GCGTGAGCAC GCGGCGACCG TCGCCGACGA GATCCTGGCC
ACAGTGTTCC AGGAGGCACC GGTCGAGCAG ACGGCGGAGC CGGCATGGCA CCGGGGATCA
TCCGCGGAGC TCGGCCTGAC CTTCGCGCTG ACCAGGACGA CCCCCGCGCC GCCGACGGTT
GGCTGA
 
Protein sequence
MSTPDAPTPA FRPLPSRVDL PAFERETLER WKDSKVFHRS LQAGADRPLW VFYEGPPTAN 
GKPGAHHVEA RVFKDLFPRY RTMKGYHVPR RAGWDCHGLP VELAVEKELG FTSKSDIEDY
GIAEFNERCR ESVLRHVADF SAMTERMGYW VDLDGAYRTM DTSYIESVWW SLKQIFDKGL
LVEDFRVTPY CPRDETPLSD HEVSQGYSDT DDPSVYVRFP VVEGPRGLAE HGAHLLVWTT
TPWTLVSNTA VAVHPEVSYV LARAAGGELL VVAEPLLTAA LGEGAEVVER FTGAELAGTR
YARPFELIAL ERFEEAAGKT AVGRGAGTSA RPHSVVLADY VTTTDGTGLV HQAPAFGAED
LAVCRANGLG VVNPVGTDGR FLPEIPLVGG LFFKDADAPL SADLTARGLM WRAATYTHSY
PRCWRCHTPL IYYPLPSWYI RTTAVRERLL AENARTDWHP ERIREGRYGE WLRNNVDWAL
SRNRYWGTPL PVWRCEADPS HLTCVGSLAE LSELTGTDHS TLDPHRPFVD DVTLPCPTCG
ETARRVPEVI DVWYDSGAMP FAQWGAPHRN ADEFAKQYPA QYICEAIDQT RGWFYTLMAV
GTLVFDRSSY ETVLCLGLLL DAEGRKMSKH VGNVLDPFDL FDRHGADAVR WLMLAGGSPW
SDRRVSHESI EDIVRKILLT YWNTASFFAL YAGAADWCPT GADGSGEGAA APAERPVLDR
WALSELADTV AEVDAALEGF DALRAGRRIA RFVDDLSNWY VRRSRRRFWA GDPNALATLY
TCLDGLTRVM SPFTPFLTDW LWSRLFAGIL PGAPDSVHLA SWPQLPADLV EPGLAERMDL
VRRIVELGRA ARAASGLRTR QPLPRAVVGS SAFDDLSAEL RAQIAEELNV VAVEAATSDL
VDISVKPNFR ALGRRFGNRT KAVAAAVTTA GAPVDGRLTV TVDGERIELG GEDLIVTETP
REGWSVTSES GLSVALDLTV TPELARTGLA RDVVRVLQDA RRAAGLEITD RVELRWVAAK
EETAAALREH AATVADEILA TVFQEAPVEQ TAEPAWHRGS SAELGLTFAL TRTTPAPPTV
G