Gene Gdia_2149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2149 
SymbolthrS 
ID6975577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2379421 
End bp2381346 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content69% 
IMG OID643391678 
Productthreonyl-tRNA synthetase 
Protein accessionYP_002276522 
Protein GI209544293 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.320592 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGCCA TCACCCTGCC TGACGGATCC GTTCGCCGCT TTGACGGTCC GGTGACGGGC 
ACTATGGTGG CGGAGTCCAT CGGCCCGGGC CTGGCCCGCG CCGCGCTGGC GATGGAGGTG
GACGGCGCGC TGGTCGACCT GTCGCGCGAA ATCGCCGACG ACGCGTCGGT CCGCTTCATC
ACCCGCAAGG ACGACGCGGC GCTGGAGATG ATCCGCCACG ACACCGCCCA TGTCCTGGCC
GAGGCCGTGC AGTCCCTGTG GCCCGGCACC CAGGTCACCA TCGGCCCGTC GATCGAGAAC
GGCTTCTATT ACGATTTCTA TCGCAACGAG CCCTTCACGC CCGAGGACTT CCCCGCGATC
GAGGCCCGCA TGCGCGAGAT CGTGGCCGCC AACGCCCGCT TCGAGCGTGA GGTCTGGCCG
CGCGACGAGG CGATCCGCTT CTTCGAGAAC CGGGGCGAGC GCTTCAAGGC CGAACTGATC
CGCGACCTGC CGGAAAGCGA GCCGATCTCG ATCTACCGCC AGGGCGAATG GCTGGACCTG
TGCCGTGGCC CGCATCTGCG CGGCACGGCG GACGTGGGCA GCGCCTTCAA GCTGATGAAG
GTGGCCGGCG CCTACTGGCG CGGCGACCAC CGCAACCCGA TGCTGACGCG CATCTACGGC
ACCGCGTGGC GCGACCAGAA GGAACTGGAC GCCCACCTGC ACCGGCTGGA GGAAGCGGAA
CGCCGCGACC ATCGCCGCAT CGGGCGCGAG ATGGACCTGT TCCATATCCA GGAAGAGGCC
GTCGGTTCGA TCTTCTGGCA CCCCAAGGGC TGGCGCCTCT ATACCGCGTT GCAGGATTAC
ATGCGCCGCG CCCAGACCCG GGGCGGCTAC CAGGAAGTCC GCACCCCGCA ACTGGTCGAC
CGCGCGCTGT GGGAGGCCTC GGGCCACTGG GACAAATATC GCGAGCACAT GTTCATCGCG
ACGGTCGAGG ACGAGGACAA GACCCTCGCG CTGAAGCCGA TGAACTGCCC GTGCCATGTC
CAGATCTTCC GCCACGGCCT GCGGTCCTAT CGCGAACTGC CGCTGCGCAT GGCGGAATTC
GGCGCCTGCC ATCGCTACGA GCCCTCGGGC GCGCTGCACG GCATCATGCG CGTGCGTTCG
TTCACCCAGG ATGACGCCCA CATCTTCTGC ACCGAGTCGC AGATCGCGGC CGAGACGGCG
CGCTTCGTGC GCATGCTGGC CGAAGTCTAT GCCGACCTGG GCTTCGAAAG CTTCCGGGTG
AAATTCGCCG ACCGGCCGGA ACAGCGCGCC GGCAGCGACG AGACCTGGGA CCGGGCCGAG
GGCGCGCTGA TCGAGGCCTG CCGCCTGGCC GGCGTCGAAT ACGAGTACAA CCCCGGCGAG
GGCGCGTTCT ACGGGCCGAA ACTGGAATTC GTGCTGCGCG ACGCCATCGG CCGCGACTGG
CAGTGCGGCA CCCTGCAGGT CGATTACGTG CTGCCCGAGC GGCTGGACGC GTCCTTCGTC
GGCGAGGACA GCGCCCGCCA CCGCCCGGTG ATGCTGCATC GCGCGATCCT GGGCTCGTTC
GAGCGCTTCC TGGGTATCCT GATCGAGCAG CATGCGGGCC GGTTCCCGCT GTGGCTGGCG
CCGGTGCAGG TGGTGGTGGC CTCGATCGTC ACCGACGCCG CGCCCTATGC CGAACAGGTG
GCCGAGACGC TGACGCAGGC GGGCCTGGTG GTCGAGACCG ACATCCGGAA CGAGAAGATC
AACGCCAAGG TGCGCGAGCA CAGCCTGGCC CGCGTGCCGG TGATCCTGGT CGTCGGCCGC
AAGGAGGCCG AGGACGGCAC CGTCGCGATC CGCCGCCTGG GCGGCGCGGC GCAGGAGGTC
ATGAGCCTGG CCGATGCCGC GACCGCGCTG GCGGCCGAGG CCCTGCCGCC CGACCTGCGC
CGGTAA
 
Protein sequence
MPAITLPDGS VRRFDGPVTG TMVAESIGPG LARAALAMEV DGALVDLSRE IADDASVRFI 
TRKDDAALEM IRHDTAHVLA EAVQSLWPGT QVTIGPSIEN GFYYDFYRNE PFTPEDFPAI
EARMREIVAA NARFEREVWP RDEAIRFFEN RGERFKAELI RDLPESEPIS IYRQGEWLDL
CRGPHLRGTA DVGSAFKLMK VAGAYWRGDH RNPMLTRIYG TAWRDQKELD AHLHRLEEAE
RRDHRRIGRE MDLFHIQEEA VGSIFWHPKG WRLYTALQDY MRRAQTRGGY QEVRTPQLVD
RALWEASGHW DKYREHMFIA TVEDEDKTLA LKPMNCPCHV QIFRHGLRSY RELPLRMAEF
GACHRYEPSG ALHGIMRVRS FTQDDAHIFC TESQIAAETA RFVRMLAEVY ADLGFESFRV
KFADRPEQRA GSDETWDRAE GALIEACRLA GVEYEYNPGE GAFYGPKLEF VLRDAIGRDW
QCGTLQVDYV LPERLDASFV GEDSARHRPV MLHRAILGSF ERFLGILIEQ HAGRFPLWLA
PVQVVVASIV TDAAPYAEQV AETLTQAGLV VETDIRNEKI NAKVREHSLA RVPVILVVGR
KEAEDGTVAI RRLGGAAQEV MSLADAATAL AAEALPPDLR R