Gene Athe_2581 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2581 
Symbol 
ID7409535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2718467 
End bp2721457 
Gene Length2991 bp 
Protein Length996 aa 
Translation table11 
GC content39% 
IMG OID643716945 
Producthypothetical protein 
Protein accessionYP_002574419 
Protein GI222530537 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.520562 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTGATT TAGAAAAACT CAAAAACCCA GATAATTTTT ACAGGTGCGC ACCTTTCTGG 
AGTTGGAATG ACAATTTAAA AGAGGAAGAG CTTTTGCGCC AGATAACTGA AATGCACCAA
AAAGGTTATG GCGGTTTTTT CATGCACTCA AGGGTTGGGC TTGTGACAGA GTATCTGTCT
GAAGAGTGGC TAAATCTTGT CAAAAAATGT ATAGAGCATG CAAAAAAGCT AAACATGCTT
GCATGGCTTT ATGATGAGGA CAAGTGGCCG TCTGGTTTTG CTGGGGGTGC TGTGGCTTTT
AAAAATCCTT CATATAGGCA CAAGTTTTTA GTACTTTTGA AAGAGGACCA GGTTGAACAC
GACGATGAGC TGCTCTCAAG CTTTGTCCAC AGAAACACAA AGTACATTGT TGCAAAGCGC
ACAATGAAGC TTGGCGACAA GTGGTTCAAT GGAAGCTGCT ATGTCGACCT TCTTTCAAAA
GAGGCAACTT TGGAGTTTAT AAACCTCACA CATGAGAGAT ACAAAAGCTA TTGCCAAGAC
TATTTTGGTG ATGCAATGCC GGGAATATTC ACCGATGAGC CAACATATTT AAGGGTACAC
TACAAAGATA TTGCTACATT GCCATGGACA GAAAAGCTTC CAGAAAGGTT TTTGCAGAAA
AAAGGATATG ACATAAAAGA GCACTTTGAA GAGCTCTTCT TCAATGTGGG TAATTACCAC
AAGGTCAGGT TTGACTTTTT TGATATTGCC CTTGAAATGT TCATAGAAAA CTTTACCATT
CCTTATGCAA AGTGGTGCGA AGAAAATGGT ATTATGATGA CAGGACACTA CATGGCAGAA
GATACAATGC GCGGCCAGAT TGAATGGATA GGCGCTGCAA TGCCCCACTA TGAGTACATG
CAGCTTCCTG GAGTTGACAA GCTTGCAAGA CATTTAGAGC AGGTAGTTAC AATGAAACAG
GTGTCATCGG TTGCAGAGCA GCTTGGCAAA AAAGGGGTAC TGTGCGAAAC CTTTGGAACA
ACAGGTCAGC ATGTAAGTTT TCTTCACAGA AAATGGATAT CAGACTGGCA GGCAGTGCTT
GGTATAACAT ACATAAACCC ACATCTTAGC CTGTATTCCA TGAGAGGTGA GAGAAAAAGA
GACTATCCAC CAAACCTTTT TTACCAGCAG CCATGGTGGG AAGATGAAAA GTTCTTTGCA
GACTATCTTG CAAGAATTTC GTACATAGCA GGGCTCGGCA AAAGAGATGT TGATGTGCTT
GTTCTCCACC CCATATCATC TGCCTGGGCA GAGTATTCTA AGTTTGATGA CAGCGTGGAC
AAGCTGGATA TTCTGCTTGA CAAAACGGTA AAAGAGCTTA TAGCAAACAA GATTGACTTT
CACTTTGGCG ATGAGATAAT CCTATCAAAG TATGCACAAG TTGAAAATTG CAAAATTAAA
GTTGGCAGTT ATAGCTACAA AGTAGTTGTC CTGCCGCCTC TTACAAACTT GAGAAAGACA
ACACTTAAGC TTTTGCAAAA CTTTGTAAAT ACCGGTGGCA AGATAGTTGC ACTTAAAGAT
TTCATGTTTT CAAGATTTGA ACTTTGCATG ATAGACGGCA GCAAGTGTGA CATTCCTTTA
AAAGAAAAAT TCAAAAATGC TTCAGATTTA AACGAACTTG TGAACATCTT AAAAGAAGAG
GTTTCAAACT ATATTGAAGT GATTGACAAA AAGACAGGTC AAAACGCTAA AAAAATCATA
TTTCAGAACA GAAAGCTAAA TGATGGAAAC AGAATAATAT TTTTGGCAAA CACAGGCCTT
CAAAGAGAAG CAGAAATCAC AATAAAAATT CTTACTGACA AAAATGTTTT TGTCGCAGAC
CTTGTAGATT TTGGTGTGTT CAAGATTCCA ACATTAAGAA GGGAAAATGG TTGTGCTGTG
ATTGATGCAA CAATGTATCC TGCATCAAGT TTGTGTCTTT TGGTATCTAG CAAAGAGCTT
TCGCAAAACA CAAAAAATGT CATTTCGGGC GTTGTATTTG ACAATAGTTT TGAATACAAA
ACAGGCAGTT GTGAGTTTGA CATAGCGCTT AAAAACTACA ATACATTGAT ACTTGATAGA
ATAAAATATG AGGTTGATGG CAAGGTAATT TTTGAAGATT GTTATTGTGC CCAGGTTTGG
CACAAGCATT TTTATAATCT CCCGGAAGGA ACACCTTTCA AAGCCACGTA CCATTTTGAG
CTTGAAAAAG TGCCGTCAAA GCTTTTTGTG GCAATAGAGT GTGCAGAGAA TCTTGACATG
ATTCTTGTAA ATAATCAGCC TGTCAAATTT GAAAGAAAAA GCGAAAGTTT TTCGCCCGAC
CAAAACTTTT TGGATGTGAA CATTGGCAAG ATTGATATTA CATCTTTTGT CAAAGAAGGC
AAAAATGAAA TAGTACTCAG TGGCAGAAAG TCAAACAACA TCACAGCACC TGGCTGCCAT
GAGAGGGTAA AAGACCCAAA AAACCACAGA CCAACTGAGG TAGAAGCCAT TTATCTTGTA
GGAAGCTTTT CTCTTATTTG CGTTGATGAG ACAAGGTTTA TTTTAACCGA GCCTAAAAAA
CCATGTCATT TTGATATAAC AAAAGACGGG TATCCTTTTT ATGTCGGGTC GGTAAGCTTA
AAAAGCTCTT TTGAGATTAC AAAAGAAAGT TCAAAAAGAG TTTATATAAA GCTCAATAGC
GTCAGCGCAG CAGTTGCCAA AGTTTTTGTA AACGGCAAGA AAACATGCAC CCTTTTTTCC
CAGCCGTTTT TGGCTGACAT AACAGAGTAT GTAAATGACG GTAAAAATGA GCTTGAGATA
GTTTTAACAA ACACACTTTT TAACCTCATA GAAGCAAACC ACAAAGCAGA CGTGTTTGAT
GAGCTGTACA GACGTCCCCA GAGCTTTATT GACTTTGAAA ATTTCACATC ACGATATATG
ATCTTGCCAT TTGGCCTTGG AAGTTATAGT ATATTAACAT CCAATGTTTA A
 
Protein sequence
MLDLEKLKNP DNFYRCAPFW SWNDNLKEEE LLRQITEMHQ KGYGGFFMHS RVGLVTEYLS 
EEWLNLVKKC IEHAKKLNML AWLYDEDKWP SGFAGGAVAF KNPSYRHKFL VLLKEDQVEH
DDELLSSFVH RNTKYIVAKR TMKLGDKWFN GSCYVDLLSK EATLEFINLT HERYKSYCQD
YFGDAMPGIF TDEPTYLRVH YKDIATLPWT EKLPERFLQK KGYDIKEHFE ELFFNVGNYH
KVRFDFFDIA LEMFIENFTI PYAKWCEENG IMMTGHYMAE DTMRGQIEWI GAAMPHYEYM
QLPGVDKLAR HLEQVVTMKQ VSSVAEQLGK KGVLCETFGT TGQHVSFLHR KWISDWQAVL
GITYINPHLS LYSMRGERKR DYPPNLFYQQ PWWEDEKFFA DYLARISYIA GLGKRDVDVL
VLHPISSAWA EYSKFDDSVD KLDILLDKTV KELIANKIDF HFGDEIILSK YAQVENCKIK
VGSYSYKVVV LPPLTNLRKT TLKLLQNFVN TGGKIVALKD FMFSRFELCM IDGSKCDIPL
KEKFKNASDL NELVNILKEE VSNYIEVIDK KTGQNAKKII FQNRKLNDGN RIIFLANTGL
QREAEITIKI LTDKNVFVAD LVDFGVFKIP TLRRENGCAV IDATMYPASS LCLLVSSKEL
SQNTKNVISG VVFDNSFEYK TGSCEFDIAL KNYNTLILDR IKYEVDGKVI FEDCYCAQVW
HKHFYNLPEG TPFKATYHFE LEKVPSKLFV AIECAENLDM ILVNNQPVKF ERKSESFSPD
QNFLDVNIGK IDITSFVKEG KNEIVLSGRK SNNITAPGCH ERVKDPKNHR PTEVEAIYLV
GSFSLICVDE TRFILTEPKK PCHFDITKDG YPFYVGSVSL KSSFEITKES SKRVYIKLNS
VSAAVAKVFV NGKKTCTLFS QPFLADITEY VNDGKNELEI VLTNTLFNLI EANHKADVFD
ELYRRPQSFI DFENFTSRYM ILPFGLGSYS ILTSNV