Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0322 |
Symbol | |
ID | 7407639 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 366987 |
End bp | 367964 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 643714712 |
Product | protein of unknown function DUF199 |
Protein accession | YP_002572235 |
Protein GI | 222528353 |
COG category | [S] Function unknown |
COG ID | [COG1481] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR00647] conserved hypothetical protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTTTT CATCGACTGC AAAAGCAGAA GTGAGCAAAA AACTTTCTCA AAATTCCTGC TGCGCAAGAG CTGCAGCTGC AGCATTTTTG AAATTCACAG GGAACATATA TGAACTTGAT GGTACTTTTT CATTTAAAAC TTCATTTGAA AATGCTCAGA CTGCAAGGTC TTTTTTCCTT CTCATGAAAA ACGGGTTTTC AAAACACTGT GAGGTAAGTA TCAAGAAAAA TAGTAAGCTG CAGAAAAACT ATGTATATAC TATCTTTTTA CCTCCAAGCA GCGACAACAT AGGCGTTTTG AAAGACCTTC ATTTTGTTAG AAAAGGTGCA AAAGAGTATC ATCTCAGTTT TTCGCTAAAA GAGGAGCTTG TGAGGAAAAA ATGCTGCAGA AAGGCTTTTT TGCAGGCAAC ATTTTTGTCG TGCGGTTCTA TTACAAACCC TGAGAAGATG TATCATTTGG AGTTTGATGT GAAAACAAAG GATGATGCGG AGTTTTTGCA GAAGGTTTTA AAGAGTTTTG AGTTTGAGGC AAAAATTGTT GAAAGAAAGT CTCATTATGT AGTATACTTA AAAGAAGGTG ATAGAATAGT AGACTTTTTA AACATAATTG GAGCACATTC TTCACTTTTA GAGCTTGAAA ATATTCGCAT AGTAAAAGAG CTCAGAAACA ATGTCAACCG TCTTGTCAAT TGTGAAACAG CTAATTTAGA AAAGACTATA AATGCTTCTA TGAGGCATAT AGAAAATATT GAATTTATTG AAAGAACAAT TGGTATTGAA AACCTTCCAC AGAATCTGCA AGAGATAGCA CGCCTTAGGA TTAAATATAA AGATGCCTCT TTGAAAGAAT TAGGTAATAT GCTTGAAAAA CCACTTGGCA AATCTGGTGT CAATCATAGG CTGAGGAAAA TAGATAAAAT TGCCGAGGAA CTTAGAAAAG GAGGAGTAGT ACATGCAAAG CCTTCACATG AAGATTGA
|
Protein sequence | MSFSSTAKAE VSKKLSQNSC CARAAAAAFL KFTGNIYELD GTFSFKTSFE NAQTARSFFL LMKNGFSKHC EVSIKKNSKL QKNYVYTIFL PPSSDNIGVL KDLHFVRKGA KEYHLSFSLK EELVRKKCCR KAFLQATFLS CGSITNPEKM YHLEFDVKTK DDAEFLQKVL KSFEFEAKIV ERKSHYVVYL KEGDRIVDFL NIIGAHSSLL ELENIRIVKE LRNNVNRLVN CETANLEKTI NASMRHIENI EFIERTIGIE NLPQNLQEIA RLRIKYKDAS LKELGNMLEK PLGKSGVNHR LRKIDKIAEE LRKGGVVHAK PSHED
|
| |