Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2581 |
Symbol | |
ID | 7409535 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 2718467 |
End bp | 2721457 |
Gene Length | 2991 bp |
Protein Length | 996 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643716945 |
Product | hypothetical protein |
Protein accession | YP_002574419 |
Protein GI | 222530537 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.520562 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTGATT TAGAAAAACT CAAAAACCCA GATAATTTTT ACAGGTGCGC ACCTTTCTGG AGTTGGAATG ACAATTTAAA AGAGGAAGAG CTTTTGCGCC AGATAACTGA AATGCACCAA AAAGGTTATG GCGGTTTTTT CATGCACTCA AGGGTTGGGC TTGTGACAGA GTATCTGTCT GAAGAGTGGC TAAATCTTGT CAAAAAATGT ATAGAGCATG CAAAAAAGCT AAACATGCTT GCATGGCTTT ATGATGAGGA CAAGTGGCCG TCTGGTTTTG CTGGGGGTGC TGTGGCTTTT AAAAATCCTT CATATAGGCA CAAGTTTTTA GTACTTTTGA AAGAGGACCA GGTTGAACAC GACGATGAGC TGCTCTCAAG CTTTGTCCAC AGAAACACAA AGTACATTGT TGCAAAGCGC ACAATGAAGC TTGGCGACAA GTGGTTCAAT GGAAGCTGCT ATGTCGACCT TCTTTCAAAA GAGGCAACTT TGGAGTTTAT AAACCTCACA CATGAGAGAT ACAAAAGCTA TTGCCAAGAC TATTTTGGTG ATGCAATGCC GGGAATATTC ACCGATGAGC CAACATATTT AAGGGTACAC TACAAAGATA TTGCTACATT GCCATGGACA GAAAAGCTTC CAGAAAGGTT TTTGCAGAAA AAAGGATATG ACATAAAAGA GCACTTTGAA GAGCTCTTCT TCAATGTGGG TAATTACCAC AAGGTCAGGT TTGACTTTTT TGATATTGCC CTTGAAATGT TCATAGAAAA CTTTACCATT CCTTATGCAA AGTGGTGCGA AGAAAATGGT ATTATGATGA CAGGACACTA CATGGCAGAA GATACAATGC GCGGCCAGAT TGAATGGATA GGCGCTGCAA TGCCCCACTA TGAGTACATG CAGCTTCCTG GAGTTGACAA GCTTGCAAGA CATTTAGAGC AGGTAGTTAC AATGAAACAG GTGTCATCGG TTGCAGAGCA GCTTGGCAAA AAAGGGGTAC TGTGCGAAAC CTTTGGAACA ACAGGTCAGC ATGTAAGTTT TCTTCACAGA AAATGGATAT CAGACTGGCA GGCAGTGCTT GGTATAACAT ACATAAACCC ACATCTTAGC CTGTATTCCA TGAGAGGTGA GAGAAAAAGA GACTATCCAC CAAACCTTTT TTACCAGCAG CCATGGTGGG AAGATGAAAA GTTCTTTGCA GACTATCTTG CAAGAATTTC GTACATAGCA GGGCTCGGCA AAAGAGATGT TGATGTGCTT GTTCTCCACC CCATATCATC TGCCTGGGCA GAGTATTCTA AGTTTGATGA CAGCGTGGAC AAGCTGGATA TTCTGCTTGA CAAAACGGTA AAAGAGCTTA TAGCAAACAA GATTGACTTT CACTTTGGCG ATGAGATAAT CCTATCAAAG TATGCACAAG TTGAAAATTG CAAAATTAAA GTTGGCAGTT ATAGCTACAA AGTAGTTGTC CTGCCGCCTC TTACAAACTT GAGAAAGACA ACACTTAAGC TTTTGCAAAA CTTTGTAAAT ACCGGTGGCA AGATAGTTGC ACTTAAAGAT TTCATGTTTT CAAGATTTGA ACTTTGCATG ATAGACGGCA GCAAGTGTGA CATTCCTTTA AAAGAAAAAT TCAAAAATGC TTCAGATTTA AACGAACTTG TGAACATCTT AAAAGAAGAG GTTTCAAACT ATATTGAAGT GATTGACAAA AAGACAGGTC AAAACGCTAA AAAAATCATA TTTCAGAACA GAAAGCTAAA TGATGGAAAC AGAATAATAT TTTTGGCAAA CACAGGCCTT CAAAGAGAAG CAGAAATCAC AATAAAAATT CTTACTGACA AAAATGTTTT TGTCGCAGAC CTTGTAGATT TTGGTGTGTT CAAGATTCCA ACATTAAGAA GGGAAAATGG TTGTGCTGTG ATTGATGCAA CAATGTATCC TGCATCAAGT TTGTGTCTTT TGGTATCTAG CAAAGAGCTT TCGCAAAACA CAAAAAATGT CATTTCGGGC GTTGTATTTG ACAATAGTTT TGAATACAAA ACAGGCAGTT GTGAGTTTGA CATAGCGCTT AAAAACTACA ATACATTGAT ACTTGATAGA ATAAAATATG AGGTTGATGG CAAGGTAATT TTTGAAGATT GTTATTGTGC CCAGGTTTGG CACAAGCATT TTTATAATCT CCCGGAAGGA ACACCTTTCA AAGCCACGTA CCATTTTGAG CTTGAAAAAG TGCCGTCAAA GCTTTTTGTG GCAATAGAGT GTGCAGAGAA TCTTGACATG ATTCTTGTAA ATAATCAGCC TGTCAAATTT GAAAGAAAAA GCGAAAGTTT TTCGCCCGAC CAAAACTTTT TGGATGTGAA CATTGGCAAG ATTGATATTA CATCTTTTGT CAAAGAAGGC AAAAATGAAA TAGTACTCAG TGGCAGAAAG TCAAACAACA TCACAGCACC TGGCTGCCAT GAGAGGGTAA AAGACCCAAA AAACCACAGA CCAACTGAGG TAGAAGCCAT TTATCTTGTA GGAAGCTTTT CTCTTATTTG CGTTGATGAG ACAAGGTTTA TTTTAACCGA GCCTAAAAAA CCATGTCATT TTGATATAAC AAAAGACGGG TATCCTTTTT ATGTCGGGTC GGTAAGCTTA AAAAGCTCTT TTGAGATTAC AAAAGAAAGT TCAAAAAGAG TTTATATAAA GCTCAATAGC GTCAGCGCAG CAGTTGCCAA AGTTTTTGTA AACGGCAAGA AAACATGCAC CCTTTTTTCC CAGCCGTTTT TGGCTGACAT AACAGAGTAT GTAAATGACG GTAAAAATGA GCTTGAGATA GTTTTAACAA ACACACTTTT TAACCTCATA GAAGCAAACC ACAAAGCAGA CGTGTTTGAT GAGCTGTACA GACGTCCCCA GAGCTTTATT GACTTTGAAA ATTTCACATC ACGATATATG ATCTTGCCAT TTGGCCTTGG AAGTTATAGT ATATTAACAT CCAATGTTTA A
|
Protein sequence | MLDLEKLKNP DNFYRCAPFW SWNDNLKEEE LLRQITEMHQ KGYGGFFMHS RVGLVTEYLS EEWLNLVKKC IEHAKKLNML AWLYDEDKWP SGFAGGAVAF KNPSYRHKFL VLLKEDQVEH DDELLSSFVH RNTKYIVAKR TMKLGDKWFN GSCYVDLLSK EATLEFINLT HERYKSYCQD YFGDAMPGIF TDEPTYLRVH YKDIATLPWT EKLPERFLQK KGYDIKEHFE ELFFNVGNYH KVRFDFFDIA LEMFIENFTI PYAKWCEENG IMMTGHYMAE DTMRGQIEWI GAAMPHYEYM QLPGVDKLAR HLEQVVTMKQ VSSVAEQLGK KGVLCETFGT TGQHVSFLHR KWISDWQAVL GITYINPHLS LYSMRGERKR DYPPNLFYQQ PWWEDEKFFA DYLARISYIA GLGKRDVDVL VLHPISSAWA EYSKFDDSVD KLDILLDKTV KELIANKIDF HFGDEIILSK YAQVENCKIK VGSYSYKVVV LPPLTNLRKT TLKLLQNFVN TGGKIVALKD FMFSRFELCM IDGSKCDIPL KEKFKNASDL NELVNILKEE VSNYIEVIDK KTGQNAKKII FQNRKLNDGN RIIFLANTGL QREAEITIKI LTDKNVFVAD LVDFGVFKIP TLRRENGCAV IDATMYPASS LCLLVSSKEL SQNTKNVISG VVFDNSFEYK TGSCEFDIAL KNYNTLILDR IKYEVDGKVI FEDCYCAQVW HKHFYNLPEG TPFKATYHFE LEKVPSKLFV AIECAENLDM ILVNNQPVKF ERKSESFSPD QNFLDVNIGK IDITSFVKEG KNEIVLSGRK SNNITAPGCH ERVKDPKNHR PTEVEAIYLV GSFSLICVDE TRFILTEPKK PCHFDITKDG YPFYVGSVSL KSSFEITKES SKRVYIKLNS VSAAVAKVFV NGKKTCTLFS QPFLADITEY VNDGKNELEI VLTNTLFNLI EANHKADVFD ELYRRPQSFI DFENFTSRYM ILPFGLGSYS ILTSNV
|
| |