Gene Franean1_2121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2121 
Symbol 
ID5670521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2546208 
End bp2549267 
Gene Length3060 bp 
Protein Length1019 aa 
Translation table11 
GC content75% 
IMG OID641241042 
Productglycyl-tRNA synthetase, beta subunit 
Protein accessionYP_001506463 
Protein GI158313955 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0751] Glycyl-tRNA synthetase, beta subunit
[COG0752] Glycyl-tRNA synthetase, alpha subunit 
TIGRFAM ID[TIGR00211] glycyl-tRNA synthetase, tetrameric type, beta subunit
[TIGR00388] glycyl-tRNA synthetase, tetrameric type, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGACCA TGCAGGACGC CCTGCTGACG TTGACCCGCT ACTGGACCGA GCGCGGCTGC 
ATGATCGTGC AGCCGTTCAA CACCGAGGTC GGCGCCGGTA CGCACAACCC GGCGACCATC
CTGCGGGTGC TCGGCGGCGA GCCGTGGCGG GTGGCCTACG TCGAGCCCTC GGTGCGCCCG
GACGACTCCC GCTACGGCTT CAACCCGAAC CGGCTGCAGA CCCACACCCA GTTCCAGGTC
GTCCTCAAGC CCGACCCGGG CAACCCTCAG GAGCTCTACC TGGGCAGCCT GCAGGCCCTG
GGCATCGACA CCGCTGCGCA CGACGTCCGC TTCGTGGAGG ACAACTGGGC CTCGCCGGCG
CTCGGCGCCT GGGGGCTGGG CTGGGAGGTG TGGCTGGACG GGCAGGAGAT CACCCAGTTC
ACCTACTTCC AGCAGGCCGG CGGCCAGAGC CTGGACCCGG TGAGCGTCGA GATCACCTAC
GGGATCGAGC GGATCCTGAT GGTGCTGCAG AAGGTCCGGC ACTTCACCGA GATCGCCTAC
GCGCCGGGGG TCTCCTACGG CGAGGTGTTC GGCCAGGCCG AGTACGAGAT GTCGCGGTAC
TACCTCGACG AGGCCGACAT CGACACGGTG CGCCGCATGT ACGCCGACTG CGCCGCCGAG
GCGCGCCGCC TCATCGACGC CCGGCTGCCC GTGCCGGCGC ACATCTACGT GCTGAAGTGC
TCGCACGCGT TCAACATCCT GGACTCGCGC GGCGCGGTCT CCACCACCGA GCGGGCCACC
TCCTTCGCCC AGATGCGCGG CATGTCGCGC GAGGTCGCCC AGCTCTGGCG GGACAGGCGC
GACGAGCTCG GCGACCCGCT GGGGGAGGCG CCGCGCCTGG CCCCGGCCGC CGCCCGGCCG
GTCGACCCGG CCGCCGTGCC GGCGGGGCCG GCCACCCTGC TGTTCGAGAT CGGCACCGAG
GAGCTGCCCG CCGCCGAGGT GACCCGCCAG ACGGCCGCCG TCCGCGCGGC CCTCGTCGAG
CGGCTCGCGG CGACCCGGCT CACCTATGGC GAGCTGCGGG TGCAGGGCAC CCCGCGGCGC
ATCGTCGCGA TCGTCGACGA CGTCGCGCCG CGTGAGCCCG ACGTCGAGCG GGTCGTGCGC
GGCCCGCGCC GCTCCGCCGC CTACGACGCC CAGGGCGCGC CGACGAAGGC CGCGACGGGT
TTCGTGCGCG GGCAGGGCGG CGACGTGGCG GACCTGACGG TCGTCGAGCA CCGCGGGGTG
GAGCACGTCG CCCTGGTGCG CACCGACACC GGCCGTCCCG CGACGGAGGT GCTCGCCGAG
GTCCTCGGCG CCGTCCTGGG CGGGCTGCGC GCCGAGCGGA ACATGCGCTG GAACGACCCG
GAGCTGTCCT TCAGCCGTCC GGTGCGCTGG GTGGTGGCGC TGCTGGGCGG GGCGGTGGTC
CCGGTCGCGG CGTCGACGCT GGCCGCGGGC TCCACCACCC GCGGGCACCG GCGCGCCGGG
TCCCCGCCGA TCGAGGTGAC CGGCGCCGTC GGCTACCCGG AGCTGCTCGC CGAGAACTCC
GTCCTGCTCG ATCCGGCCGC GCGCCGCGAA CTGATCGTCG ACGAGGCCGC GAAGCTGGCG
GCCGAGCACG CGGGCACGAT CGACCTCGAC ACCGAGGCCG ACGTCCTCGA CGAGGTGACC
AACCTGGTCG AGTTCCCGCG TCCGGTGCTC GGCTCCTTCG AGCCGCGTTA CCTCGACCTG
CCTGCCGAGA TCCTCACCAC CGTCATGCAC AAGCACCAGC GCTACCTGGC GGTGCGGGAC
ACGGCCGGGG CGCTGCTGCC CTGCTTCGTC ACGGTCGCGG ACGGCGCGGT GGACGTGCCG
CTGGTCCGCG CGGGCAACGA GGCGGTGGTG CGGGCCCGGT TCGAGGACGC GGCGTTCTTC
TTCGACGCCG ACCTCAAGGT CCCGCTGGAG ACGTTCCGCG CGGGGCTGGC GAAGCTGACC
TTCGAGCAGC GGCTCGGGTC GGTGGCCGAC CGCGCGGAGC GCATCGCGCG GCTGGCCGGC
CTGTTCGCGG ACCGGCTCGC CACCTCCGAC GCGGGCGCGC TGTCCCCGGC GGAGCGGCGG
ACGCTGGACC GCGCCGGCGA GCTGGCCAAG TTCGACCTCG CCACCCAGAT GGTGATCGAG
CTGTCCAGCC TGGCCGGGAC GATGGCCCGC GAGTACGCCC GCCGCGCCGG CGAGCCGGAG
GCGGTCGCCG TCGCGCTCCA CGAGATGGAG CTGCCGCGCC GGGCCGGCGG GACGCTGCCG
GCCACGGTCC CCGGCGCGCT GCTCGCCCTG GCCGACCGGA TCGACCTGCT TGCCGGGCTG
TTCGCGCTCG ACGCGGCGCC CACCGGCAGC TCCGACCCGT TCGGTCTGCG CCGCGCCGCG
CTGGGCGTCA CCGCGATCCT GGCCGAGCGG CCGGAGCTGG CCGGGATCAC CGTGGCCGAC
ATGCTGGCCG AGGCCGCGAA GCTCGTCCCG GTGCCGGTTC CGGAGGCGGC CCTGGAGCAG
GCCGACGCGT TCGTGCGCGG CCGGTTCGCC CAGTACCTGC TCGACACCGG CGTCGACCAC
CGGCTGGTCG GCGCGCTGCG CCCGCTGACC GGCCGGCCCG GCCAGGCGAT GGTGACCCTG
AAGGCGCTGC GCCCGCTGGT GGAGACCGCG TCGTTCGCGC GGCTGGCCGC CGCCCTGGGG
CGGGTCCGCC GGATCGTCCC GGCCGACGTC GCGCCGGGCA TAGAGATCAG CGCCCTCGTC
GACCCCGCCG ACCACCGCCT CGCCCAGGCC GTCACCGAGC TGTCCCGCCA GCTGGCGGCG
CGCACGGCCG GCCGCCTGCC CGGGCTTGAC GACTTCGTGG CCGCCGCCGG GCAGCTCCCG
GACGCGGTCG ACGGCTTCTT CGACGCGGTA CTGGTGATGG CCGACGATCC CGCGCTGCGC
GCGGCCCGGC TCGGGCTGCT GGCGGAGATC CGCGACGTCG CCGCCGAGGT GCTGGACTGG
GACGCGCTCG GCCCGCTGCC GGCCGCGCCG GGCGGCGCCG ACGGCCCCAC CGGCACCTGA
 
Protein sequence
MLTMQDALLT LTRYWTERGC MIVQPFNTEV GAGTHNPATI LRVLGGEPWR VAYVEPSVRP 
DDSRYGFNPN RLQTHTQFQV VLKPDPGNPQ ELYLGSLQAL GIDTAAHDVR FVEDNWASPA
LGAWGLGWEV WLDGQEITQF TYFQQAGGQS LDPVSVEITY GIERILMVLQ KVRHFTEIAY
APGVSYGEVF GQAEYEMSRY YLDEADIDTV RRMYADCAAE ARRLIDARLP VPAHIYVLKC
SHAFNILDSR GAVSTTERAT SFAQMRGMSR EVAQLWRDRR DELGDPLGEA PRLAPAAARP
VDPAAVPAGP ATLLFEIGTE ELPAAEVTRQ TAAVRAALVE RLAATRLTYG ELRVQGTPRR
IVAIVDDVAP REPDVERVVR GPRRSAAYDA QGAPTKAATG FVRGQGGDVA DLTVVEHRGV
EHVALVRTDT GRPATEVLAE VLGAVLGGLR AERNMRWNDP ELSFSRPVRW VVALLGGAVV
PVAASTLAAG STTRGHRRAG SPPIEVTGAV GYPELLAENS VLLDPAARRE LIVDEAAKLA
AEHAGTIDLD TEADVLDEVT NLVEFPRPVL GSFEPRYLDL PAEILTTVMH KHQRYLAVRD
TAGALLPCFV TVADGAVDVP LVRAGNEAVV RARFEDAAFF FDADLKVPLE TFRAGLAKLT
FEQRLGSVAD RAERIARLAG LFADRLATSD AGALSPAERR TLDRAGELAK FDLATQMVIE
LSSLAGTMAR EYARRAGEPE AVAVALHEME LPRRAGGTLP ATVPGALLAL ADRIDLLAGL
FALDAAPTGS SDPFGLRRAA LGVTAILAER PELAGITVAD MLAEAAKLVP VPVPEAALEQ
ADAFVRGRFA QYLLDTGVDH RLVGALRPLT GRPGQAMVTL KALRPLVETA SFARLAAALG
RVRRIVPADV APGIEISALV DPADHRLAQA VTELSRQLAA RTAGRLPGLD DFVAAAGQLP
DAVDGFFDAV LVMADDPALR AARLGLLAEI RDVAAEVLDW DALGPLPAAP GGADGPTGT