Gene Athe_1424 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1424 
Symbol 
ID7409167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1505467 
End bp1506870 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content41% 
IMG OID643715787 
ProductF0F1 ATP synthase subunit beta 
Protein accessionYP_002573295 
Protein GI222529413 
COG category[C] Energy production and conversion 
COG ID[COG0055] F0F1-type ATP synthase, beta subunit 
TIGRFAM ID[TIGR01039] ATP synthase, F1 beta subunit 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACAAA ATGTAGGGTA TGTCGTCCAG ATTATAGGAC CTGTTATTGA TATACGATTT 
GAGAGTGAAA ATTTACCAGC CATCAATAAT GCTATTGAAA TTCACTTTGA TGGTAAAAAA
CTTGTTGCTG AAGTTGCTCA GCATCTCGGG AATGACACTG TTCGATGTGT GGCTTTGGGT
TCAACAGACG GACTTAGAAG AGGTGTGAAG GCAATAGATA CAGGTGGACC TATTAAAGTA
CCTGTGGGGA GAGGAACGCT GGGTAGGATA TTTAATGTAT TGGGAGAGCC TATTGACAAT
AAAGGTGAAG TAGTAGCTTC AGATTACTGG CCCATTCACA GAAGTGCACC GTCGTTTGAA
GAACAGGTAC CTGCAGTTGA AATTTTCGAA ACGGGTATAA AAGTTATTGA TCTTTTAGCT
CCGTACGCAA AAGGTGGTAA GATAGGACTT TTTGGCGGTG CGGGCGTTGG TAAGACTGTC
CTTATAATGG AGCTTATAAG AAATATAGCG ACAGAGCACG GGGGTTTTTC AATTTTCACA
GGTGTGGGTG AAAGGACAAG AGAAGGTAAC GACCTGTGGC TTGATATGAA TGAGTCTGGT
GTTATAGAAA AGACTGTATT GGTGTTTGGT CAGATGAACG AGCCGCCTGG GGCAAGAATG
AGAGTAGCTC TGACCGGGCT TACCATGGCA GAATATTTCA GAGATGTAGA AGGGCAAGAT
GTTCTTTTGT TCATTGACAA TATCTTCAGG TTCATCCAAG CAGGCTCTGA AGTGTCAGCG
CTTTTAGGAA GAATTCCTTC GGCAGTTGGG TATCAACCAA CACTTGCAAA CGAGGTAGGG
GCATTACAGG AAAGAATAAC ATCTACAAAA AAAGGCTCAA TTACATCTGT TCAAGCAATA
TATGTTCCTG CAGACGATTT AACTGATCCA GCACCTGCTA CAACTTTTGC TCATTTGGAT
GCAACAACAG TTTTGTCAAG ACAGATTGCT GAGCTTGGAA TATATCCTGC GGTTGATCCT
CTTGATTCAA CCTCACGTAT ACTTGATCCG CGAATTGTGG GAGAAGAACA CTATTATGTT
GCAAGGACTG TGCAGCAAAT ACTTCAAAGA TATAAAGAGC TGCAGGACAT TATTGCTATT
TTGGGTATGG ATGAGCTATC AGAAGAAGAT AAACTGATTG TCTACAGAGC AAGAAAAATT
CAGAGATTTT TATCACAGCC ATTCTTTGTT GCTGAAGCTT TCACAGGAAG ACCTGGAAGG
TATGTGAAGT TGAAAGACAC TATAAGAGGG TTCAAAGAGA TAATTGAAGG GAAGATGGAC
CATATTCCTG AACAGTATTT CTACATGGTA GGAACAATAG ATGAGGTATA TGAAAACTAC
GAAAAAGACA TGAAAGGCAA ATAA
 
Protein sequence
MEQNVGYVVQ IIGPVIDIRF ESENLPAINN AIEIHFDGKK LVAEVAQHLG NDTVRCVALG 
STDGLRRGVK AIDTGGPIKV PVGRGTLGRI FNVLGEPIDN KGEVVASDYW PIHRSAPSFE
EQVPAVEIFE TGIKVIDLLA PYAKGGKIGL FGGAGVGKTV LIMELIRNIA TEHGGFSIFT
GVGERTREGN DLWLDMNESG VIEKTVLVFG QMNEPPGARM RVALTGLTMA EYFRDVEGQD
VLLFIDNIFR FIQAGSEVSA LLGRIPSAVG YQPTLANEVG ALQERITSTK KGSITSVQAI
YVPADDLTDP APATTFAHLD ATTVLSRQIA ELGIYPAVDP LDSTSRILDP RIVGEEHYYV
ARTVQQILQR YKELQDIIAI LGMDELSEED KLIVYRARKI QRFLSQPFFV AEAFTGRPGR
YVKLKDTIRG FKEIIEGKMD HIPEQYFYMV GTIDEVYENY EKDMKGK