Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2198 |
Symbol | |
ID | 7408394 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 2325901 |
End bp | 2326968 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 643716566 |
Product | protein of unknown function DUF43 |
Protein accession | YP_002574046 |
Protein GI | 222530164 |
COG category | [R] General function prediction only |
COG ID | [COG1568] Predicted methyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000324238 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGATGTAT TAAAAGGTGC TGTTGATTTT GTACAGAACA AGACAAAGGT TGAAGTAAAC CAAAGAGATA TAGAAAAAAT ATTGTCGGCG CTGAACTCCA CAAACCATTT TTGGGAAGTT ATTTTCCTTT CACAAAAACC ATTTGCTGTG GTAAGAGAAA CAATCAACTA TCTTATTTCA ATAGATTTTG TTAAGACGGA TGAATCGGGT AACTTGATAT TAACAGAAAA AGGAAAAGAA TTTATAAGCG CTAACAATAT TCCCGTTGTA AAAAATTACA CTTGTTCCTA CTGTGAAGGA AGAGGAATAG TCTTTTCTGA AATCAAGGAT GCTTATGAAA AGTTTAAAGA GATTGTCAAG ACAAGACCTG ATGCAATAGT TGAATATGAC CAAGGTTATG TAACAGAAGA GACAGCTTTC TCAAGAATTG CTCTTATGAT TAAGAAGGGC GATTTAGTAG GAAAAAGGCT AATAGTATTT GGTGACGATG ACCTTGTGTC AATCGCAGCA GCACTAACAA AACTTCCAAA AGAGGTCATA GTTTTAGAGA TAGATAAGCG TCTTGTTGAG TTTATAAATC AGGCTGCAAA AGAACACAAT TTAAACCTCA AAGCTATTGA ATATGACTTT AGAAACAAAC TTCCTGATGA TTTTGTAAAA AGCTTTGACA CATTTACAAT AGATCCACCT GAGACAATTG AAGCTTTGGA CCTTTGCTTT ACAAGGACAA TTTCAAGCTT AAAAGGTGCA GGCTGTGCAG GCTACTTTGG TCTTACAAAC ATCGAAGCTT CACTTTCAAA ATGGCATGAA TTTCAAAAAC TTCTTTTGAA CAAGTTCAAT GCGGTTATTA CAGACATCAT TGAGAATTTC AATCATTATG TAAACTGGAA CTATCTCTTG CCATCGCTTG AGAGTAGCCT TACTTTTGTA AATGTTCAAC CAAAGCTCAA CTGGTACACA TCAAGTATGT ACAGGATTGA GCTTGTAAAG GATGTAGACA TTAAAAATGA ATTTATTAAT TGTGAACTTT ATATAGACAA CGAAGCTATA CTTTATAAGG AAAATTAA
|
Protein sequence | MDVLKGAVDF VQNKTKVEVN QRDIEKILSA LNSTNHFWEV IFLSQKPFAV VRETINYLIS IDFVKTDESG NLILTEKGKE FISANNIPVV KNYTCSYCEG RGIVFSEIKD AYEKFKEIVK TRPDAIVEYD QGYVTEETAF SRIALMIKKG DLVGKRLIVF GDDDLVSIAA ALTKLPKEVI VLEIDKRLVE FINQAAKEHN LNLKAIEYDF RNKLPDDFVK SFDTFTIDPP ETIEALDLCF TRTISSLKGA GCAGYFGLTN IEASLSKWHE FQKLLLNKFN AVITDIIENF NHYVNWNYLL PSLESSLTFV NVQPKLNWYT SSMYRIELVK DVDIKNEFIN CELYIDNEAI LYKEN
|
| |