Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0312 |
Symbol | |
ID | 7407629 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 356356 |
End bp | 357819 |
Gene Length | 1464 bp |
Protein Length | 487 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643714702 |
Product | LmbE family protein |
Protein accession | YP_002572225 |
Protein GI | 222528343 |
COG category | [S] Function unknown |
COG ID | [COG2120] Uncharacterized proteins, LmbE homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCACCGT TTAAAGCTAA AAGTTCTGAA GGTGAAGTTT ACAGAAAACC TCAGAATGAA AGAAGAATAA GATTTAAGCG AAAGTACATT TTATATGCAT TTGTTATATG GATTGTTGCA AATGTTCTTG TATTTTTGGC AAGAGAATAT ATTTCAAACT GCTTTTTTTC CGCTCAGCTT CCGCAGCTTC GGGTTGAAAA TTACAAACGT ATACTCGTCT TTGCCCCACA CTGTGATGAT GAAACTTTGT CTTCTGCAGG TGTGATACAA AAAGCTCTTT TGAGTGGGAG TAAAGTTAAA GTTGTTGTTA TGACAAATGG AGATGGCTTT ACACGTGCTG CAGGGCAGAA CTTTGGCAAG ATAAGACTTA CTCCTGATGA CTATATAAGA TTTGGTTATC TTCGACAGAA TGAAACAATC CATGCGCTTG AAGACTTAGG TTTGAAGAGG GAAGATATAA TTTTTTTAGG ATATCCTGAT AGGGGTTTGC GGTTTTTATG GGAAAAATAT TTTGATTCTA AGATAGGGTA TTTTAATCCT TTAACACGCA CATTCAAAAG TCCGTATTCA AACTCCTATC AGAGGGCAGT AGAATACAAG GGAATAAACG TTGTGAAAAA TATTCAGAGC ATAATAAAAT TATTTGAACC TGATTTAGTA ATCTATCCAT ACTCACGCGA CCAGCATCCT GACCACTGGG CAACATCAGC ATTTGTCAAG TTTTCAATCC TGACGCTAAA TTATAAATGT GAAGAGTGGC AATATCTTGT GCACAGAGGA GACTGGCCGA CTCCTTTTGG CAAACACCCT CAGATGTATC TTGTTCCGCC TTTCAAGCTT GCATTTACAG ATACAAAGTG GTATCAGGTA CCACTGGATG ACTATATGAT AGAAAGAAAA TCAAATTCAA TCTCAGACTA CCACTCTCAG ATGAAGGTTA TGAGAGGATT TTTGGAGGCA TTTGTTCGTC AAAACGAATT ATTCGCAAAA TCAGATAGCA AAGATGCAAA AAAATATGAG GGTGATAACC TGTTTTCAGA CAAGTATTTA GTTAGCAAGG AGCCAACGCA CGACATATGG TCACTCATTT TTGAAAAAGG TGCTGATATT GAAGCGATAT TTGCGGCTCA CGATAGCAAG AACATCTATA TAGGTATCGA AATGGTTGGT TCTGCTAAAA AACTTATTTC GTATTATCTT CACATAAGAG CATTTGAAGA TTATAAATAT CTTGGCAGAA TATATATTTT GGTTTCTGGA AACAAGATGA ATGTAGTAAA GACTATGACA ACCCCTGCTT TTTCATTTCA AAATGCAAAG ATGCGCAGAA AAAAGAATCA GATTGAGATA GTATTTTCCA AAAAAGATTT ACAAAATCCG AACATGCTTT ATTTGAGCGT GCGCACAGAA TTTCTTGGTC GTCAACTTGA CAGAAGCGCA TGGAAGGTAG TAAAGCTCAA ATAA
|
Protein sequence | MPPFKAKSSE GEVYRKPQNE RRIRFKRKYI LYAFVIWIVA NVLVFLAREY ISNCFFSAQL PQLRVENYKR ILVFAPHCDD ETLSSAGVIQ KALLSGSKVK VVVMTNGDGF TRAAGQNFGK IRLTPDDYIR FGYLRQNETI HALEDLGLKR EDIIFLGYPD RGLRFLWEKY FDSKIGYFNP LTRTFKSPYS NSYQRAVEYK GINVVKNIQS IIKLFEPDLV IYPYSRDQHP DHWATSAFVK FSILTLNYKC EEWQYLVHRG DWPTPFGKHP QMYLVPPFKL AFTDTKWYQV PLDDYMIERK SNSISDYHSQ MKVMRGFLEA FVRQNELFAK SDSKDAKKYE GDNLFSDKYL VSKEPTHDIW SLIFEKGADI EAIFAAHDSK NIYIGIEMVG SAKKLISYYL HIRAFEDYKY LGRIYILVSG NKMNVVKTMT TPAFSFQNAK MRRKKNQIEI VFSKKDLQNP NMLYLSVRTE FLGRQLDRSA WKVVKLK
|
| |