Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0578 |
Symbol | |
ID | 7406919 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 652880 |
End bp | 654358 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 643714961 |
Product | PBS lyase HEAT domain protein repeat-containing protein |
Protein accession | YP_002572477 |
Protein GI | 222528595 |
COG category | [C] Energy production and conversion |
COG ID | [COG1413] FOG: HEAT repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.94571 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTCAG CTATGGAAAT AATTGAAATG CTAAATTCAG ATGACTATCT ATTACAAAAA GAAGCTATTG AAAATGCACC AAAGTTTAAG GAACCTGAAA TTGTTGATAG ATTAATAGAT CTTTTTATAA AAACAAACAA TAAAATGATT GAGGAGCACA TTACAGAAGC TTTAAAACAA ATAGGTGGCA GTTATACTGT TGAAAAATTA TTGAGGCTTT TAGACCATGA AGAAGCAAGG GTTAGGGTTT TTGCTTTTGA GGTTTTAAGT AAAATAGGTA ATGATAATAT TCACGCAATA ATCAAAGAGG CGGAAAATCC AGATAAAAAT GTTAGGAAAT TTGTAGTGGA TATTTTAGGT GCTCTTAAAA ACAAAGAAGC AGTTGATACT TTATTGAAAA GGTTATCAGA TGATGATGTA AATGTAGTAC AGGGGGCCAT TGAGGCACTT GGCAATATTG GGGATGTTGA AGCTCTCAAA AAGGTAATTG AGTTTTTACC TTCTGCTCAT CTGTGGGTAC AGTGGACAAT AATTGAGAGC ATAAAAAAAG TGAACAATAG AGAACTAATT TCAGAGGTTT TAAATCTGCC ATGGGAGATT GAGGATATCA TCTTTGACAG TATTTTTGAC ATGGTGAAAG AAAATGGTAC CCTTGAGAAT GTGGAGGATG CAGTAAATCT TTATTTAAAG CTTTCAACAC AGCTGAGGAT AAAGGTTTTA GATACCATAT ATTCGATTTA TATAAAGTCA GATAAACAAA AAGTTGAAAA AGTTTTATCA AATACAAGCT TTTTTGATGA AATAAAAACC ATTTTGATAT ATGGTTCTGA TACACAAAAA TTTGAGATTT TTAACTATAT GGGTAGCATA GAAGATAAGG ATTTTGTAAG GTTTATAAAG AGTAGGGTGT TTGATGAAAC AGTTATTCTA AGCGCTATAA AACTCTATTA CGCATCGAAT ACTTTAGAGA AAAGAGAATT AGTTAGAGTG TTTAAATATT TTAATAAGTC AAAACTGGTT GAATATATGA GAGAGATATT CAAAGGTGAT GATAATATCT TAAAGCTTAG CGGGTTGAAG ATTATGAGAT ACAATGGTAT CAAGGAGGCA GCAGATGTGT TGCCAGAGAT GATAAAAGAA GGAGAGCTTT TACCAGAGGT CTTAAAAACT GTGATAGAAC TAAACTTGAA GGAGTTATTT GATAGAATTT ATGAAGAATA TTTCAATGTC AAAAGCGATG ACTTAAGACT TTTGATGCTT GAATGTATGG TTGAACTAAG ACCTGACGAT GCAAAGGTTG TGGCTCTAAT TAAAGATGAG CTTGCAAATG AGTATTTATC TGATGTTCAT ATTCTCAAGC TTTTACAACT TGTAAGAAAA ATTAATGATA AAGAACCATT TAGATTACAG TTAGAATATT TGGCTGACCA TCCTAATATA GAGATTTCTA TTGAAGCTCA GGACCTGCTT GGAGGCTAA
|
Protein sequence | MNSAMEIIEM LNSDDYLLQK EAIENAPKFK EPEIVDRLID LFIKTNNKMI EEHITEALKQ IGGSYTVEKL LRLLDHEEAR VRVFAFEVLS KIGNDNIHAI IKEAENPDKN VRKFVVDILG ALKNKEAVDT LLKRLSDDDV NVVQGAIEAL GNIGDVEALK KVIEFLPSAH LWVQWTIIES IKKVNNRELI SEVLNLPWEI EDIIFDSIFD MVKENGTLEN VEDAVNLYLK LSTQLRIKVL DTIYSIYIKS DKQKVEKVLS NTSFFDEIKT ILIYGSDTQK FEIFNYMGSI EDKDFVRFIK SRVFDETVIL SAIKLYYASN TLEKRELVRV FKYFNKSKLV EYMREIFKGD DNILKLSGLK IMRYNGIKEA ADVLPEMIKE GELLPEVLKT VIELNLKELF DRIYEEYFNV KSDDLRLLML ECMVELRPDD AKVVALIKDE LANEYLSDVH ILKLLQLVRK INDKEPFRLQ LEYLADHPNI EISIEAQDLL GG
|
| |