Gene Athe_2018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2018 
Symbol 
ID7408230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2128594 
End bp2129967 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content33% 
IMG OID643716385 
Producthypothetical protein 
Protein accessionYP_002573869 
Protein GI222529987 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTTATA AAGGTGAATT TTCGAACAAA ATAATGCATG AGCGCTTGAT TAAAAATCCT 
GAATTTCAAA AGAGACTTAA AGAGTTAAGA GTGGCTTATG ATACTCCAAA TAGAGACATA
ACCTCGGAAA TTAGACAAAT GTTTAGAATA ATCGATACTG AAGATGCTTC AAAAAACGTG
AAAATCGTTT TTGCGGTAGA TGGATCATAT ACTGACATTC CCATAAACAA CAATATTCCT
TCTGCAAGAA TAGGTATTGC TAACTTTTGT GCATCAGTTG TTAAGTTAGA TGAATTAAAA
AAAAGTGCTC AATATGAATT TCTAGATCCC CACGAATTCA ATGACACTTA CACTACTGGT
TTACTTACTT TTGTTGGTCC TCTTGCTAAT ATAATTGAAG AGGGTAAAAC TACAACATCT
CAATCTATTC GCTATGCAAT TTATAAATTC ATGTGCAACA AACCATTCGA TGAAACATTA
CCTTTAATTA ACACACTCTA TACAATTTTA AAAGAAGGTA ATGACAAAAC TGTTGAATCA
TTTAATTGTC CAAATCCAGA GTGTAACGAA CACATTGAAT GGGACTTAGA AAAAGACAAT
ATTAATCCAA AAAAATGTCC TGGGTGTGGG GAAGAGGTGT ATTTAACTGA TTGGTTACGA
CTGCATGAAG CAGTTGAAGA AGATTTTGAA AGCACCTCAA TACTTTCGCG CTTAACGCAA
GTTGTGGAAC ATTTACTCGT ATTTAATCTT ATTCAGACTT GTTTAAGTAA CCAAACATTA
GTTTCTCTAC CTTTTTCAAT GGCCTTTATT TTAGATCGTC CATTGGCAAT ATACGGCGAA
CCAGCAAAGT TACATAGATA TATTTTGAAA TATTACCATA AGCTGATGCA AAACAAAAAT
ACTCCTTTAA TAATTTTTGG ATTAGCTAAA AGCGGCAGAC TAAAAGACCA CTTTGAACTG
CTTGAAAGGA GAATGAAAGA AATAGGAGAA GAGATTCCTA AAAATGCAGT TATGTTAGTA
AGTGATGCAT ATAGATTTAA ATATATTCAG CAAAGACCTA AGCGAAATGA ATATTTTGGT
CAAGAAATTA GCTGGGGACA GGATTTTTTG TTCTATTCTA AGGAAGCCAA AAAATTTGTG
GTTTCTCTAC CTTATTCCGT CGATGAGAAA AAGAAAGAAT ACTATGAAAA AATGATTTTC
AATATTGATT CATACTCAAC ACTTCCCACT GTTTTGGATT TGATTAATAA GATAACTACT
GACTTATACG AAGATGCAAT ATTGCCTGTT GCGTTAGCCC ATCGTTATGC TTCTATTAGC
TTAAAACCGA GCAAACAAAT ATTGGAGATG TTTGCCAGAG AGCTTATAAA ATAG
 
Protein sequence
MPYKGEFSNK IMHERLIKNP EFQKRLKELR VAYDTPNRDI TSEIRQMFRI IDTEDASKNV 
KIVFAVDGSY TDIPINNNIP SARIGIANFC ASVVKLDELK KSAQYEFLDP HEFNDTYTTG
LLTFVGPLAN IIEEGKTTTS QSIRYAIYKF MCNKPFDETL PLINTLYTIL KEGNDKTVES
FNCPNPECNE HIEWDLEKDN INPKKCPGCG EEVYLTDWLR LHEAVEEDFE STSILSRLTQ
VVEHLLVFNL IQTCLSNQTL VSLPFSMAFI LDRPLAIYGE PAKLHRYILK YYHKLMQNKN
TPLIIFGLAK SGRLKDHFEL LERRMKEIGE EIPKNAVMLV SDAYRFKYIQ QRPKRNEYFG
QEISWGQDFL FYSKEAKKFV VSLPYSVDEK KKEYYEKMIF NIDSYSTLPT VLDLINKITT
DLYEDAILPV ALAHRYASIS LKPSKQILEM FARELIK