Gene Athe_2114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2114 
Symbol 
ID7408823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2241500 
End bp2243590 
Gene Length2091 bp 
Protein Length696 aa 
Translation table11 
GC content31% 
IMG OID643716479 
ProductKWG repeat protein 
Protein accessionYP_002573962 
Protein GI222530080 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTTTAA AAAGAATTCA AGATAATATT TTGTACAAAT CAAGATTGAT TTTAATTATA 
GCAATTGTAA TTTTGTTGTT AACTGAATTT AAAGTTGCTT TTGCAAACAT TGAAATTGAA
AGATTTCCAG TTAAGAAATT TGTTTATATT CACCCTCAAT TTACCGAAGT TAAATATTTA
GATTGGGTAC ATGGTGATTG GTCAAGAGAC TATGAAATAA GGGTTTTGGT TAAAAAAGAT
GGTAAATGGG GTATTTTTTT AAAAAAAGCA AGAGTACTTG TGAAACCTCA ATTCGATGAA
ATAGAACAGC TAAGCACCGG GTTTAAGGTA AAGAAAAACG AAAAATGGGG ATTTATTGAT
AATACAGTGA AGGTATTAGT TGAACCTGTA TTTGATGATG TCTATGATAT ATACAACGGT
CTTTTAAAAA TAAAAGTTGG CAATAAGTTT GGTTTTGTAA ATGAAAATGG TAAGATTGAA
ATTGAGCCCA AGTTTGAAGA CGCTAAATAT TTTATTGGAA ACATGGCTCC TGTAAAACAA
AACGGGAAAT GGGGCATAAT TGATAAAACT GGAAGGTTTA TTGCTGAACC TATGTATGAT
GAATGTGTTA TACCAAATTT ACCTCCATAT GACAAAATAA TTATAATTTC AAAAGATGAA
AAATATGGTT ATGTTTCAGC TGCTGGGACT ACAGTTGTTC AGCCACAGTT TGAAGAAGTA
GAACTACTAA ATGAGAACTT GGTTGCAATA AAAAAAGAAG GAAAAATCGG GTTTGCAGAT
ATAAGTGGAA AAATTCTCAT TAATCCAGAG TATGATAAAT ACTATTCTAT TGTTGGAGGA
TCTGATAATA TACGAATAAT AGCAGTATCT AAAAATAATC ACATTGGTGC TGTTTCTATG
ACTGGACATG TTATGTTTGA ACCTGCTTAT GAAGATATCT CAGTTGTTTC AAAGAATATA
TTGATCGCAA AGAAAAACGG AAAATGGGGA TTTATAACTT TCGACGGTAA GGTGAAGGTT
GATTTTAAGT ATGATGAGTT TGAACGGTTG ATAGATAAAA ACTTTATATT AATTAAAAAA
GGAAAAAAGG TTGGGGTTGC AAATTTGAAT GGTGAAATTA TTGCTGAGCC TCAATATGAT
TATGTCGGTG ACCCCTTTTT GGGTCCGACT AAAAAAGATG CTTTGATGAC AGGCTCGAAA
GGAAAAAGAG GAATTATTTA CAACAAAATT GTTGTTCCTC CTCAATTTGA TGTCATTAAA
TTTTGTTCAA CCTCTAAAAA TGCTACTATA TTAACTGCTG TAAAAAAAGA TGGTAAGTGG
ACTTATATAA ATAAATATGG GAAACTTATT ACTCAACCTC AGTTTGACAG TGTAGACGAA
TATTTTTATT CTGGTGTAGC AAAGATTATA GAAAACAACA AAATTGGATT TATAAATGAA
AATGGTAAAA TTATAACCAA ACCACAGTTT GATGGTGTTA CACCATTTGA TGACTGTGGG
TTTGCAGGAG TTAATCAAAA AGGTAAGTGG GGCTTCATTG ATAAAAGCGG AAAGCTCATT
ATAAAACCTC AGTTTGAAGA AATATCTAAT TTTACAGCGG ATGGTTTGGC AAGAATAAAG
CTAAAAGGTA AATGGGGGTA CATTGAAAAA GGTGGGAAAG TTATAATCAA ACCTAAGTTT
AATCAATTGG GTATTTTCAA AGAAGGGTTA GCTCCTGCAA AACTTGGTGG GAAATGTGGT
TATATAGACA GGAAAGGTAA TTTTGCAATT AAACCGCAAT ATGAAGATGC ATTGTATTTT
GTTGGTGATA CTGCTGCTGT CAAACTGAAT GGCAAATGGG GTTTTATTGA TAAAAAAGGC
AGGTTTAAAA TAAAACCTCA ATATGATGAA GTGATAAACG TTTATATTAT GGGATATAAG
GATTTAAGAG TTATAATTAA AAACAACAGA ACTGGACTAA TTGATTCGAA AGGAAATATT
CTAATTGATC CAAATTTCGA ATCAATAGAA GGTGGTAATT TACAGTTTTT AGGTTATGTA
CTTTTAAAAT CAACAGACAA TAAGTACGGT TTTTTGCTTG ATGAAGAATA A
 
Protein sequence
MFLKRIQDNI LYKSRLILII AIVILLLTEF KVAFANIEIE RFPVKKFVYI HPQFTEVKYL 
DWVHGDWSRD YEIRVLVKKD GKWGIFLKKA RVLVKPQFDE IEQLSTGFKV KKNEKWGFID
NTVKVLVEPV FDDVYDIYNG LLKIKVGNKF GFVNENGKIE IEPKFEDAKY FIGNMAPVKQ
NGKWGIIDKT GRFIAEPMYD ECVIPNLPPY DKIIIISKDE KYGYVSAAGT TVVQPQFEEV
ELLNENLVAI KKEGKIGFAD ISGKILINPE YDKYYSIVGG SDNIRIIAVS KNNHIGAVSM
TGHVMFEPAY EDISVVSKNI LIAKKNGKWG FITFDGKVKV DFKYDEFERL IDKNFILIKK
GKKVGVANLN GEIIAEPQYD YVGDPFLGPT KKDALMTGSK GKRGIIYNKI VVPPQFDVIK
FCSTSKNATI LTAVKKDGKW TYINKYGKLI TQPQFDSVDE YFYSGVAKII ENNKIGFINE
NGKIITKPQF DGVTPFDDCG FAGVNQKGKW GFIDKSGKLI IKPQFEEISN FTADGLARIK
LKGKWGYIEK GGKVIIKPKF NQLGIFKEGL APAKLGGKCG YIDRKGNFAI KPQYEDALYF
VGDTAAVKLN GKWGFIDKKG RFKIKPQYDE VINVYIMGYK DLRVIIKNNR TGLIDSKGNI
LIDPNFESIE GGNLQFLGYV LLKSTDNKYG FLLDEE