Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1619 |
Symbol | |
ID | 7409449 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 1720468 |
End bp | 1721568 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643715988 |
Product | protein of unknown function DUF34 |
Protein accession | YP_002573486 |
Protein GI | 222529604 |
COG category | [S] Function unknown |
COG ID | [COG0327] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00486] dinuclear metal center protein, YbgI/SA1388 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.15807 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTAAGTG CTCAGGAGAT AATTTCGTTT ATAGAAACCT ATTTTCCCAA AAAACTCTCA TATGAATGGG ACAACTGCGG TCTTCAGGTT GGAAGTTATT CAGACAAAGT AGATTCAGTT TTGATATGTG TGGATGTGAC AGAGGAGGTC TTAAAAGAAG CTATCTTGCT TGGAGCAAAG CTTATAATTT CCCATCATCC ACTTATTTTT CAGGGAATCA AAAGTATAAA AGATGACACA CCAGAAGGAA GGATTATCAT AGATGCTATC AAAAACGGCA TAAATATATA TTCTGCTCAC ACCAGTGCAG ATGTCTCGAA ACATGGTATA AACTACTGGC TTGCCAATCT CATAGGTCTT GAAAACATTG AGGGTTTGAA CATCAAACAA AAAAGTGGGT ATTTTAAAGT TGTTGTGTAT GTACCAGTAG ACTATGTACA AAATGTGTTA GAGGCAATGG CAAATGAAGG TGCGGGCTTT GTTGGGAAAT ACAGCCATTG CTTTTTTGCA GTCGAAGGTG AAGGAAGTTT TAAACCTCAA GAAGGTGCAA AACCTTTTTT AGGACAGGTG GGGAGGCTTG AAAAGGTTAA AGAGGTAAGA CTTGAGAGCA TAGTGCCTGA AGATAAGCTC AAAAATGTAA TAAAATCGAT GTTAAAAGCT CATCCTTATG AAGAAGTTGC ATATGACATA TACCGGCTTG AAAATGATAT ATCATATGAA AGTTTAGGAG TTGTTGGAGA GAGAGAGGTT TTGGCAAAAG AACTTATCTT AGAGCTAAAA CAAAAACTAA ACCTTGATTT TGTAAAAGCA AGCATTCAAA AAGATGCTTT TAAGAAGATA GCCATTGTCA GTGGTTCTGG TAAAGACCTT ATAAAAGATG CATATTTCAA AGGTGCAGAC TGTCTTATCA CAGGCGAAGT TGGTCACCAC GGGATTTTGC TGGCAAAGTC GCTATCGATG AGTATAATAG AGCTTGGACA TTATGAGAGC GAGAAGGTGT TTGTGGATAT CGTTTACAGC CTTTTTGAAG ACTTTAAGAA AAAAGATGAT CTGAAAATAT ATAAATCCAA AATCAATACC AGCTTTACAA ACATTTACTA A
|
Protein sequence | MVSAQEIISF IETYFPKKLS YEWDNCGLQV GSYSDKVDSV LICVDVTEEV LKEAILLGAK LIISHHPLIF QGIKSIKDDT PEGRIIIDAI KNGINIYSAH TSADVSKHGI NYWLANLIGL ENIEGLNIKQ KSGYFKVVVY VPVDYVQNVL EAMANEGAGF VGKYSHCFFA VEGEGSFKPQ EGAKPFLGQV GRLEKVKEVR LESIVPEDKL KNVIKSMLKA HPYEEVAYDI YRLENDISYE SLGVVGEREV LAKELILELK QKLNLDFVKA SIQKDAFKKI AIVSGSGKDL IKDAYFKGAD CLITGEVGHH GILLAKSLSM SIIELGHYES EKVFVDIVYS LFEDFKKKDD LKIYKSKINT SFTNIY
|
| |