Gene Moth_2378 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2378 
Symbol 
ID3832017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2503058 
End bp2504446 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content61% 
IMG OID637830297 
ProductF0F1 ATP synthase subunit beta 
Protein accessionYP_431203 
Protein GI83591194 
COG category[C] Energy production and conversion 
COG ID[COG0055] F0F1-type ATP synthase, beta subunit 
TIGRFAM ID[TIGR01039] ATP synthase, F1 beta subunit 


Plasmid Coverage information

Num covering plasmid clones53 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00260932 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGAACGAAG GACAGGTGGT CCAGGTTATT GGCCCGGTGG TTGACGTCGA ATTCGCCAGC 
GACCGGCTGC CCGACCTGTA TAACGCCATC ACCATTAAAA CCGATAAGAT TAATATAACC
ATGGAGGCCA TGCAGCACCT GGGCAACAAC ACCGTCCGCT GTGTGGCCCT CTCCTCGACC
GACGGCCTGC AGCGAGGGAT GAAGGCCGTT GACACCGGCC AGCCAATCAC CGTACCGGTA
GGCCGGGCTA CCCTGGGACG GCTCTTTAAC GTCCTGGGAG AACCCATTGA CAACCAGGGA
CCGGTAGAAA CCACCGAGAG GCTGCCCATT CACCGGCCGG CGCCCTCCTT TGAAGAGCAA
CAGCCTTCTA CCGAGGTCCT GGAGACTGGC ATCAAGGTGG TCGACCTCCT GGCGCCCTAC
GCCAAGGGCG GCAAGATCGG CCTCTTCGGC GGCGCCGGGG TCGGCAAGAC GGTCCTCATC
ATGGAACTCA TCCGCAACAT CGCCTATGAA CACGGCGGCT TTTCCGTCTT CAGCGGCGTG
GGCGAGCGTA CCCGCGAGGG TAACGACCTC TACCTGGAGA TGAAGGAATC CGGGGTTCTC
GAAAAGACGG CCCTGGTCTT TGGCCAGATG AACGAACCCC CGGGTGCCCG CCTGCGGGTG
GGTCTTACAG GCCTGACTAT GGCCGAGTAC TTCCGGGACG CCGAGGGCCA GGACGTTCTC
CTCTTCATCG ACAATATCTT TCGCTTCGTG CAGGCCGGTT CCGAGGTTTC CGCCCTCCTG
GGCCGGATGC CCTCGGCGGT GGGTTATCAG CCCACCCTGG CCACAGAGAT GGGGGCCCTG
CAGGAACGGA TTACCTCCAC GAAAAAGGGT TCCATCACCT CCGTGCAAGC TATCTATGTG
CCGGCCGACG ACCTGACCGA CCCGGCCCCG GCGACGACCT TCGCCCATCT GGACGCCACC
ACGGTTCTGT CCCGGCAGAT CGCTGAGCTG GGCATCTACC CGGCCGTCGA CCCCCTGGAC
TCCACCTCCC GTATCCTGGA CCCGCGCGTC CTGGGAGAAG AGCACTACCA GGTGGCCCGG
GGCGTCCAGC AGGTACTGCA GCGTTATAAA GAACTTCAGG ACATTATCGC CATCCTGGGA
ATGGATGAGC TGTCCGAAGA AGATAAACTC ATAGTTGCCC GGGCACGCAA GATCCAGCGT
TTCCTCTCCC AGCCCTTCCA CGTAGCCGAG GCTTTTACCG GCCAGCCCGG GGTTTATGTG
CCCCTGAAGG AAACCATTCG CGGTTTCAAA GAGATCCTGG AGGGCCGCCA CGACAACCTC
CCCGAGCAGG CCTTCTATAT GGTCGGGACC ATCGACGAAG CCGTCAAGAA GGGCCAGGAG
TTGATGTAG
 
Protein sequence
MNEGQVVQVI GPVVDVEFAS DRLPDLYNAI TIKTDKINIT MEAMQHLGNN TVRCVALSST 
DGLQRGMKAV DTGQPITVPV GRATLGRLFN VLGEPIDNQG PVETTERLPI HRPAPSFEEQ
QPSTEVLETG IKVVDLLAPY AKGGKIGLFG GAGVGKTVLI MELIRNIAYE HGGFSVFSGV
GERTREGNDL YLEMKESGVL EKTALVFGQM NEPPGARLRV GLTGLTMAEY FRDAEGQDVL
LFIDNIFRFV QAGSEVSALL GRMPSAVGYQ PTLATEMGAL QERITSTKKG SITSVQAIYV
PADDLTDPAP ATTFAHLDAT TVLSRQIAEL GIYPAVDPLD STSRILDPRV LGEEHYQVAR
GVQQVLQRYK ELQDIIAILG MDELSEEDKL IVARARKIQR FLSQPFHVAE AFTGQPGVYV
PLKETIRGFK EILEGRHDNL PEQAFYMVGT IDEAVKKGQE LM