Gene Athe_2198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2198 
Symbol 
ID7408394 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2325901 
End bp2326968 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content34% 
IMG OID643716566 
Productprotein of unknown function DUF43 
Protein accessionYP_002574046 
Protein GI222530164 
COG category[R] General function prediction only 
COG ID[COG1568] Predicted methyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000324238 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGATGTAT TAAAAGGTGC TGTTGATTTT GTACAGAACA AGACAAAGGT TGAAGTAAAC 
CAAAGAGATA TAGAAAAAAT ATTGTCGGCG CTGAACTCCA CAAACCATTT TTGGGAAGTT
ATTTTCCTTT CACAAAAACC ATTTGCTGTG GTAAGAGAAA CAATCAACTA TCTTATTTCA
ATAGATTTTG TTAAGACGGA TGAATCGGGT AACTTGATAT TAACAGAAAA AGGAAAAGAA
TTTATAAGCG CTAACAATAT TCCCGTTGTA AAAAATTACA CTTGTTCCTA CTGTGAAGGA
AGAGGAATAG TCTTTTCTGA AATCAAGGAT GCTTATGAAA AGTTTAAAGA GATTGTCAAG
ACAAGACCTG ATGCAATAGT TGAATATGAC CAAGGTTATG TAACAGAAGA GACAGCTTTC
TCAAGAATTG CTCTTATGAT TAAGAAGGGC GATTTAGTAG GAAAAAGGCT AATAGTATTT
GGTGACGATG ACCTTGTGTC AATCGCAGCA GCACTAACAA AACTTCCAAA AGAGGTCATA
GTTTTAGAGA TAGATAAGCG TCTTGTTGAG TTTATAAATC AGGCTGCAAA AGAACACAAT
TTAAACCTCA AAGCTATTGA ATATGACTTT AGAAACAAAC TTCCTGATGA TTTTGTAAAA
AGCTTTGACA CATTTACAAT AGATCCACCT GAGACAATTG AAGCTTTGGA CCTTTGCTTT
ACAAGGACAA TTTCAAGCTT AAAAGGTGCA GGCTGTGCAG GCTACTTTGG TCTTACAAAC
ATCGAAGCTT CACTTTCAAA ATGGCATGAA TTTCAAAAAC TTCTTTTGAA CAAGTTCAAT
GCGGTTATTA CAGACATCAT TGAGAATTTC AATCATTATG TAAACTGGAA CTATCTCTTG
CCATCGCTTG AGAGTAGCCT TACTTTTGTA AATGTTCAAC CAAAGCTCAA CTGGTACACA
TCAAGTATGT ACAGGATTGA GCTTGTAAAG GATGTAGACA TTAAAAATGA ATTTATTAAT
TGTGAACTTT ATATAGACAA CGAAGCTATA CTTTATAAGG AAAATTAA
 
Protein sequence
MDVLKGAVDF VQNKTKVEVN QRDIEKILSA LNSTNHFWEV IFLSQKPFAV VRETINYLIS 
IDFVKTDESG NLILTEKGKE FISANNIPVV KNYTCSYCEG RGIVFSEIKD AYEKFKEIVK
TRPDAIVEYD QGYVTEETAF SRIALMIKKG DLVGKRLIVF GDDDLVSIAA ALTKLPKEVI
VLEIDKRLVE FINQAAKEHN LNLKAIEYDF RNKLPDDFVK SFDTFTIDPP ETIEALDLCF
TRTISSLKGA GCAGYFGLTN IEASLSKWHE FQKLLLNKFN AVITDIIENF NHYVNWNYLL
PSLESSLTFV NVQPKLNWYT SSMYRIELVK DVDIKNEFIN CELYIDNEAI LYKEN