Gene Athe_0746 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0746 
Symbol 
ID7408440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp834397 
End bp835752 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content37% 
IMG OID643715118 
Producthypothetical protein 
Protein accessionYP_002572634 
Protein GI222528752 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000789691 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAACA TCATAGAACT TGAGACACTT TTAAATTCAG CTCAAAACAA TCAAGAGTTT 
GAAGATTTGG ACAATATAGA GATAAAAGAA TTTCCTCAGC CTATGCCTGC AGAGGTATAT
CGAGGAATGG TAGGTACTAT TGTAAAATAT CTTGAAAACT TTACTGAAGC AGCTCCTGAA
GCTTTGCTTA TCAACTTATT GGTAGTATTT GGTGCCATTG TTGGGAAGGA AGCATGGATA
GAGGTAGGAG GCGACAGGCA TTATCCAAAT CTCTTTGCTG TCTTGGTAGG AGATACTGCA
AGTGGACGAA AAGGGTCAAG TTGGTCAATT ATTGAAAGGG TATTGGAGAA AGCTGACAAA
AATTTTGTAT TAAACAATTT AAGAAATGGT ACAGTGTCAG GTGAGGGTAT TATATATCAT
GTCAGAGACC CTATTTTCAA GTGGGACAAG AACTCTGAGA CTTATGAAAT GATAGACCCC
GGCGTTGAGG ATAAGAGGTT ACTTATTATT GAGTCTGAGT TTGCCTCTCT ACTTAGGGTT
ATGAAAAGAG AAGGGAACAC AATTTCTCCA TTGTTAAGGA ATGCATGGGA TGGCAAATAC
AAATTAGAGA CACTCTCAAA AACAAATTAC ACAAAGGCAA CTAATGCTCA TATCTCATTG
ATTGGGCATA TAACGTTTGA TGAACTGAAA AAAGAATTAT CAGACGTTGA GAAAATGAAC
GGTTTTGGCA ACAGGTTTTT ATGGGTATGT ACACGAAGAA GCAAACTGTT ACCTAATCCA
CCGTTATTAC CAGAGGACAA GCTTACAGGT TGGGGATTAT TATTAAGAGA GAGTATTTCA
AAAGCACCAA AAGGTTTAAT TACAAAAACT CCTGCAGCTG AAGAAGCTTG GGCACTTATA
TATGAAAAAT ACGCAGACAA GGGAGAAGGT GAGACAGCGG CTTTAATAGG CAGGGCAGAA
GCACAGATTT TGAGATTAAG CCTAATATAT GCTTTATTAG ATGGGAGCGA GAAAATTACT
CATGAACATA TATGCACTGC AAGGTTGGTG TGGGAGTACT GTCAAAAATC TGTTGAATTC
ATTTTCAGTG AATTCAACAG AGAAAAAGAA AGCTCAATGG TTTTAAATTT ATTGAGCGCA
CTAAAAGAAA AACCATTGAG CCAAAGCGAA ATTTATGAGG TTTTCAACAA ACATATCAAT
GCCAAGAAAA TGGCTTATTT GCTAAAAAAG ATGAGTACAA AAGGTTATAT AGAAGCAAAG
AAAGAAAGAA GCAACGGCCG ACCAAAAACA CGCTGGTACA TTACACCACT TGGTCTAAAG
AAACTGGAAT CTTCTAATAT CGATTTTGCT TCTTAA
 
Protein sequence
MSNIIELETL LNSAQNNQEF EDLDNIEIKE FPQPMPAEVY RGMVGTIVKY LENFTEAAPE 
ALLINLLVVF GAIVGKEAWI EVGGDRHYPN LFAVLVGDTA SGRKGSSWSI IERVLEKADK
NFVLNNLRNG TVSGEGIIYH VRDPIFKWDK NSETYEMIDP GVEDKRLLII ESEFASLLRV
MKREGNTISP LLRNAWDGKY KLETLSKTNY TKATNAHISL IGHITFDELK KELSDVEKMN
GFGNRFLWVC TRRSKLLPNP PLLPEDKLTG WGLLLRESIS KAPKGLITKT PAAEEAWALI
YEKYADKGEG ETAALIGRAE AQILRLSLIY ALLDGSEKIT HEHICTARLV WEYCQKSVEF
IFSEFNREKE SSMVLNLLSA LKEKPLSQSE IYEVFNKHIN AKKMAYLLKK MSTKGYIEAK
KERSNGRPKT RWYITPLGLK KLESSNIDFA S