Gene Athe_0801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0801 
Symbol 
ID7407988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp890949 
End bp892892 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content37% 
IMG OID643715179 
Productthreonyl-tRNA synthetase 
Protein accessionYP_002572689 
Protein GI222528807 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00121365 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACAAAA TAGCTGTGAC ACTTCCAGAC GGAAAAGTGT TGGAAGCAGA AAAAGGAATA 
TCTGGTATGG AGTTTATAAA GACAATCTCC ATGAGACTTT ACAAAGAAGC AGTTGCGTGT
AAGATAGATG GTGTTTTGAA GGATTTGTGG ACATCTCTTG AAAGAGATTG CAGCTTTGAG
GTTGTGACAT TTTCAAGTGA TGAGGGTAAA AAGGTTTATT GGCACACAAC GTCTCATATT
TTAGCTCAGG CTGTTAAAAG GCTCTTTGGC GATAAAGTAA AGCTTGGCAT AGGACCTGCA
ATTGACAATG GTTTTTATTA CGACTTTGAT ATTGAAGAGT CAATTACGAG GGAGATTTTA
GAGAAGATTG AAGAAGAGAT GCAAAAAATA ATAAAAGAAG ATTTAAAGAT TGAGAGATTT
GAACTTTCAA GAGAAGAAGC AATAAAACTT ATGCAACAAA GAGGAGAAAA CTACAAGGTT
GAGCTTATAA ATGATATTCC AGAGGGAGAG ATTATATCTT TTTATAGGCA AGGTGAATTT
GTTGACCTGT GCACAGGACC ACATCTTCCT TCAACAGGGA GAGTAAAAGC ATTCAAACTA
CTTTCTGTAG CAGGTGCGTA TTGGCGCGGA AATTCTAAAA ATAAGATGCT TCAAAGAATT
TATGGAATTT CTTTTGAAAA GAAATCTGAA CTTGATGAGT ATCTCAAAAA GCTTGAAGAG
GCAAAAAAAC GTGACCACAA CAGAATAGGA AGAGAGCTTG AGATTTTTAC AACATCCGAG
GTTGTAGGGC AAGGTCTTCC TCTTTTAATG CCAAAAGGCG CAAAAATCCT CCAGATTCTT
CAAAGGTTTG TTGAGGATGA AGAGGAAAAG AGAGGTTATC TTCTGACAAA GACTCCTTAC
ATGGCAAAAA GTGATTTGTA TAAAATCTCT GGGCACTGGG ACCATTACAG AGATGGCATG
TTTGTAATAG AGGAAGATGA AAATGAAGTT CTGGCTTTAA GACCAATGAC ATGTCCATTT
CAGTTTTTGA TATATAACTC AAAGCAGAGA AGCTACAGGG ATTTGCCAAT AAGATACAGT
GAGACATCAA CACTTTTCAG AAATGAAAGT TCAGGTGAGA TGCACGGACT TATAAGGGTT
AGGCAGTTTA CTCTTTCAGA TGCACACATA ATCTGCAGAC CCGACCAGGT TGAGGAAGAG
TTCAAAGGAG TTTTAGATTT GATTCAGTAT ATAATGAAAA TACTTGGTAT CGAGAATGAT
ATATGGTACA GATTTTCGCG CTGGGACCCG AACAACAAGG AAAAATATAT TGACAATCCT
GAGGCTTGGG AAAAGACAGA AAATGATATG AAAAATATCT TAGATAAGCT TGGAATAAAT
TATAAAGAGG CAAAAGGTGA GGCTGCTTTT TATGGACCAA AACTTGACAT TCAGTTCAAA
AACGTCTATG GTAAAGAGGA TACAATAATA ACAATACAGA TAGATTTTGC TCTGGCAGAA
CGGTTTGATA TGACGTATGT AGACAGAGAT GGGCAGAAGA AAAGACCGAT TATTATCCAC
CGCTCATCGA TAGGTTGTTA TGAAAGAACG CTTGCGATGC TTATTGAAAA GTACAACGGC
GCTTTTCCGC TGTGGCTGGC GCCTGTGCAG ATTAGAGTGA TTCCTGTTTC TGATAATTTC
AATGAGTATG CTAAAAATGT CGCAAGGATA TTAAAAGAAA ATGGGTTTAG GGTAGAAGAA
GATTACAGGT CAGAAACAGT AGGGTACAAG ATAAGAGATG CTCAGCTTCA AAAGATACCG
TATATGGTGA TTGTAGGTGA AAAGGAGCAA AAAGAGAATA CTGTGGCAGT AAGAGATAGG
AAAAAAGGAG ATTTGGGTTC GTTTACCATT GAAGATTTTA TTGCAATGGT AAAAGAAAAA
GTAGACAAAA AAGTGATAGA GTAA
 
Protein sequence
MDKIAVTLPD GKVLEAEKGI SGMEFIKTIS MRLYKEAVAC KIDGVLKDLW TSLERDCSFE 
VVTFSSDEGK KVYWHTTSHI LAQAVKRLFG DKVKLGIGPA IDNGFYYDFD IEESITREIL
EKIEEEMQKI IKEDLKIERF ELSREEAIKL MQQRGENYKV ELINDIPEGE IISFYRQGEF
VDLCTGPHLP STGRVKAFKL LSVAGAYWRG NSKNKMLQRI YGISFEKKSE LDEYLKKLEE
AKKRDHNRIG RELEIFTTSE VVGQGLPLLM PKGAKILQIL QRFVEDEEEK RGYLLTKTPY
MAKSDLYKIS GHWDHYRDGM FVIEEDENEV LALRPMTCPF QFLIYNSKQR SYRDLPIRYS
ETSTLFRNES SGEMHGLIRV RQFTLSDAHI ICRPDQVEEE FKGVLDLIQY IMKILGIEND
IWYRFSRWDP NNKEKYIDNP EAWEKTENDM KNILDKLGIN YKEAKGEAAF YGPKLDIQFK
NVYGKEDTII TIQIDFALAE RFDMTYVDRD GQKKRPIIIH RSSIGCYERT LAMLIEKYNG
AFPLWLAPVQ IRVIPVSDNF NEYAKNVARI LKENGFRVEE DYRSETVGYK IRDAQLQKIP
YMVIVGEKEQ KENTVAVRDR KKGDLGSFTI EDFIAMVKEK VDKKVIE