Gene Athe_2197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2197 
Symbol 
ID7408393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2324596 
End bp2325783 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content33% 
IMG OID643716565 
ProducttRNA pseudouridine synthase D TruD 
Protein accessionYP_002574045 
Protein GI222530163 
COG category[S] Function unknown 
COG ID[COG0585] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00094] tRNA pseudouridine synthase, TruD family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000286837 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTGGATTT TAAAGGGTGA TAGTAGTTTG AAGATTAAGG TTTTACCAGA GGATTTTGTT 
GTGAGAGAAA AACTTAAGAT TGAAATAAAA CCGTCGGGCA AATACAAAAT TTATCTTTTA
ACAAAAAGGC ACTGGAACAC TGTTGATGCT CTCAGGTTCA TTTCAAAAGA AAACAAGATT
CCACTTGAGA AGATTGGCTG TGCTGGGAGA AAAGATAGAC ACGCACTCAC TTTTCAGTAT
ATTTCTGTGC CAAGAGAATA TACCATAAAT TTTAACAAAG AAAATGTAAA GGCTCAGTTC
ATAGGGTATT CAGATGATTT TGTATCTCCT TTAATTCTCG AAGGAAATTT TTTTGAAATA
ACAATAAGAA AATTAAAAAA TAAGGATGAA AAAATTTTAC AAAGACTGAA CGAAGTACAG
CAGTTTGGTT TTCCCAACTA CTTTGATGAC CAGCGGTTTG GAAGCATCCA AAATGAAGAT
GAGTTCATAG GCGAAAAGAT TGTTAAAAAA CATTACAACG GTGCTCTCAA ACTTTATTTT
ACAACCATTC ATCCTGAAGA TAAAAAAGAA GAAAAGGAGA GGAAGAAAAA AATTTCTGAG
CTTTGGGGAG ATTTCGAAAA GATACTGCCT CTTTGTAAAA CGAAGGTAGA AAAGAATATT
ATAAAAACTC TCCTTAAAGG CAAGAGCAGA CACTATTTAA TTAGAGCGCT CAATCTTATT
TCCAAAGAAG AGATGTCTAT TTTTCTTTCT GCTTATCAGA GCTATATATG GAACAGGACA
TTGATTGCAG TTTTGCCCTA TTATGTTGAT TTGTTAAAAC CTGTAAAGGG GAAAATTATG
GACTATTTAA TTTACCCTAT ACTTTCAACA AAGTCTTTAA ATAACTTAAA AAACCTTCAG
ATTCCAACCG TCTCATCAAA AATACCATAT GTGAGTGATA TTGTAAATAA CGCTATTTTA
GAGATTTTAA ATGAAAGAGG AGTAAAGCCT TCTGATTTTG ATACCAAAAA GATCAGGAGC
TGGTATTTTA AATCGTTTTT AAGACCTGCC ATAGTCTTTC CAGAAAAGCT TGAAGTGTTC
GACTTTGAAG AAGATGACTT TTATGAAGGG TATTATAAGC TTAGGATAAA ATTCTATCTT
CCTGCAGGTT CTTTTGCGAC CATGCTTATG AAAAGTTTAA CTGTTTAA
 
Protein sequence
MWILKGDSSL KIKVLPEDFV VREKLKIEIK PSGKYKIYLL TKRHWNTVDA LRFISKENKI 
PLEKIGCAGR KDRHALTFQY ISVPREYTIN FNKENVKAQF IGYSDDFVSP LILEGNFFEI
TIRKLKNKDE KILQRLNEVQ QFGFPNYFDD QRFGSIQNED EFIGEKIVKK HYNGALKLYF
TTIHPEDKKE EKERKKKISE LWGDFEKILP LCKTKVEKNI IKTLLKGKSR HYLIRALNLI
SKEEMSIFLS AYQSYIWNRT LIAVLPYYVD LLKPVKGKIM DYLIYPILST KSLNNLKNLQ
IPTVSSKIPY VSDIVNNAIL EILNERGVKP SDFDTKKIRS WYFKSFLRPA IVFPEKLEVF
DFEEDDFYEG YYKLRIKFYL PAGSFATMLM KSLTV