Gene Athe_1140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1140 
Symbol 
ID7408722 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1234420 
End bp1236084 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content38% 
IMG OID643715506 
Productprotein of unknown function DUF87 
Protein accessionYP_002573014 
Protein GI222529132 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAGTA GTATGTATGA AGAGAACAGA ATTGGCAAAA TCATAGGTGG TTCGTATTCA 
GAAGGTCTTG CAATAAAAGT CGAGGATGAT TCTGTTGTGG AAAGTACAAG GATTGGAGCA
ATTCTTGTTA GCCAAACAGA AAAGAGGAAG TACTACTGTA TGCTTACCGA CATGGTAATA
GAGGGCATGA ACAAGCAAGC TTTGACAGAA CTTCCGCGAG GAAACTCAAG CCTGCTTTTG
AACAGAATTA CAAGGGGGAC TTCAATTTAT ACTGTGTTCA AGGCACAGCC AGTCCTTTCT
TACGACCTTG AGGAAAAGAA AAATCAGCCT ATAAGAAACA TACCCGTTCA TGCTTCAAGT
GTTAGAAGAG CTACCTATGA TGATATTTCA GATGTGTTTG GAAGTTTTGA AAAAAGTCCA
AGACGTTATT TTCCAGTTGG AAGTGTTCTT GACATGGACG AAAGCTCTAC AGTATGCATA
GACATGGAAA GATTTATTGA ACGAAGCAGT GGCATTTATG GGAGGACTGG TACAGGAAAA
TCATTTATTG CAAGATTATT GATGGCGGGG ATTATCCTTT GTGATAAGGC ATCGCTTCTC
ATTTTTGATG CTCACTCAGA CCATGGACCT GACAGCGTTG ATGAGGAAAA CCGTCCTGTT
AAAGGGCTTA AAAGTCTTTT TGGAAGCAAA GTCCAGATAA TGACAATTGA AAATTCCTCG
TCAATGGCAG GTGTTTTGCC GATTGAGATT GATGTTAGAG ATGTTGAGAT CGAGGATATT
TTATCAATTG CAGAAGAGCT AAACCTCAAT GAGACAGCAC AACAGGTTAT GATTGCACTG
AAAAATAAAT TGGAGACAGA GGGTAAACAC TGGCTTGAAG AGATACTTAT AAATGGTGAG
GACTTAGCAG AGAGGTTTAA AGACAGCGAA GCAGTTGTTA ACAGAAGTTC GCTTTTGGCA
CTTATCAGAA AGCTTTCTGT GTTAAAAGAA TTACCCTACC TTAGATATGA TAGACGACCT
GGTACAAACT CAATTGATAT TATTTTAAAC TATCTTCAAA AAGGTATAAG TGTTGATATA
ACATTTGGCA AAAGTGATAA ATTACTTAAT TACCTCTTTG TTACAAACGT ATTATCAAGA
CGTATTTACC AAAGATATAT GGAGATGTAC GAAAGGTATA TCTCAAACAG ACAAAAATAT
TCTCCTCCAA GGCCACTTGT GATTGCTATT GAAGAAGCAC ACAGATTTTT ATCGCCCGAT
GTTGCAAAGC AGACAATATT TGGAACAATA GCAAGAGAGA TGAGAAAAGC TAAGGTAAGT
CTTATGTGTA TAGACCAGAG ACCTTCTCAG ATAGACAGCG AGATTGCATC GCAAATTGGA
ACAAGGATTA TTTTATCTCT TTCTGATGAA GCTGACATTA CAAGTGCACT TGCTGGTATG
AAAAATAGTA AACAGCTGAG GGCAATTATA GAGTCGCTTG ATTCGAAACA GCAGGCTTTG
TTAATAGGTC ATGCAGTTCC TATGCCAATT GCGATAAAAA CGAGGGGGTA TGATAGTAGC
TTTTATGATT TTGTTTCAAT TTATTTCAAA GAAGATGAAG TGGATGAAAA GTATGAAAGA
ACTTTAGAGG CATCTAAGAA GTGGCTTGAT GAGATGTGCT ATTAA
 
Protein sequence
MASSMYEENR IGKIIGGSYS EGLAIKVEDD SVVESTRIGA ILVSQTEKRK YYCMLTDMVI 
EGMNKQALTE LPRGNSSLLL NRITRGTSIY TVFKAQPVLS YDLEEKKNQP IRNIPVHASS
VRRATYDDIS DVFGSFEKSP RRYFPVGSVL DMDESSTVCI DMERFIERSS GIYGRTGTGK
SFIARLLMAG IILCDKASLL IFDAHSDHGP DSVDEENRPV KGLKSLFGSK VQIMTIENSS
SMAGVLPIEI DVRDVEIEDI LSIAEELNLN ETAQQVMIAL KNKLETEGKH WLEEILINGE
DLAERFKDSE AVVNRSSLLA LIRKLSVLKE LPYLRYDRRP GTNSIDIILN YLQKGISVDI
TFGKSDKLLN YLFVTNVLSR RIYQRYMEMY ERYISNRQKY SPPRPLVIAI EEAHRFLSPD
VAKQTIFGTI AREMRKAKVS LMCIDQRPSQ IDSEIASQIG TRIILSLSDE ADITSALAGM
KNSKQLRAII ESLDSKQQAL LIGHAVPMPI AIKTRGYDSS FYDFVSIYFK EDEVDEKYER
TLEASKKWLD EMCY