Gene Moth_2380 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2380 
Symbol 
ID3832019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2505324 
End bp2506847 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content60% 
IMG OID637830299 
ProductATP synthase F1, alpha subunit 
Protein accessionYP_431205 
Protein GI83591196 
COG category[C] Energy production and conversion 
COG ID[COG0056] F0F1-type ATP synthase, alpha subunit 
TIGRFAM ID[TIGR00962] proton translocating ATP synthase, F1 alpha subunit 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0176105 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCATTC GACCCGACGA GATAACCAGT ATTTTAAAGA ACCAGATTGA ACAATACCAG 
CTGGAAGTAG AAATGGCCGA GGTGGGAACC GTTACCCAGG TCGGTGACGG TATCGCCCGC
ATCTACGGCC TGGACCGGGC CATGGCCGGC GAGCTGCTGG AGTTCCCCGG CGATATCTAT
GGCATGGTCC TGAACCTGGA AGAAGATAAC GTCGGCGCCG TTATCCTCGG TCCCTATACC
CATATCAAAG AGGGCGACCA GGTCAAACGT ACCGGGCGTA TTGTCGAGGT GCCGGTGGGC
GAAGCCCTCA TCGGCCGGGT GGTCAACGCC ATGGGCCAGC CCATAGACGG CAAGGGGCCT
ATCCAGACGG ATAAATTCCG CCCGGTGGAA TCCCCGGCGC CGGGCGTGGT CTACCGCCAG
CCGGTCAATA CTCCCTTACA AACGGGCCTC AAGGCCATTG ACTCCATGGT CCCCATCGGC
CGCGGTCAGC GGGAGCTGAT TATCGGTGAC CGCCAGACGG GGAAGACGGC CATTGCCGTG
GACACCATCA TCAACCAAAA GGGGCAGAAC GTTATCTGCA TCTATGTGGC CATCGGCCAG
AAGGCTTCTA CAGTGGCGGG CGTAGTCCAG CGTCTGGAAG AGGCCGGAGC TATGGAATAT
ACCATCGTCG TTATGGCTAC AGCCAGCGAA CCGGCGCCCA TGCTCTACAT TGCCCCCTAC
GCCGGCTGCA CCATGGGCGA ATACTTTATG TATGAGCAGC ACCGGGACGT TCTCTGCGTT
TATGACGACC TTTCCAAGCA CGCAGCAGCC TACCGGGAAC TCTCCCTGCT TCTGCGGCGG
CCGCCGGGCC GTGAGGCTTA CCCCGGGGAT GTCTTCTATC TCCACTCCCG GTTGCTGGAG
CGGGCCGCCC GCCTGAACGA CTCCCTGGGT GGCGGTTCCC TCACTGCCCT GCCGGTCATT
GAGACCCAGG CTGGCGATGT CTCCGCTTAC ATTCCGACCA ATGTTATCTC CATCACCGAC
GGCCAGATCT TCCTGGAGTC TGATCTCTTC TATGCCGGCC AGCGTCCGGC CATTAACGTC
GGCCTCTCGG TATCCCGGGT GGGCGGCGCC GCCCAGATCA AGGCCATGAA ACAGGTGGCC
GGCCGCCTGC GCCTGGACCT GGCCCAGTAC CGCGAGCTGG CGGCCTTCGC CCAGTTCGGT
TCCGACCTGG ATAAAGCCAC CCAGGCGAGA TTGGCCCGGG GCGAGCGCAT GATGGAGATT
TTGAAACAAG ACCAGTACCA ACCCATGCCC GTCGAAGAAC AGGTGGTCGT CCTCTATGCT
GCCGTCAATG GCTTCCTGGA CGACCTGCCT GTAGCCCGGG TGCGCGCCTT TGAAAAGGAC
TTCCTGCGCT TCCTCCGCAA CGAGAGGCCT GAGGTCCTGG CCGGCATCCG CGAAAAACGC
CAGCTGGACG ATAACCTCCA GGAACAACTG AAAAAGAGCA TTGAAGACTT CAAAGGCAGC
TTTACCGCTG CCGGAGAATC ATAA
 
Protein sequence
MSIRPDEITS ILKNQIEQYQ LEVEMAEVGT VTQVGDGIAR IYGLDRAMAG ELLEFPGDIY 
GMVLNLEEDN VGAVILGPYT HIKEGDQVKR TGRIVEVPVG EALIGRVVNA MGQPIDGKGP
IQTDKFRPVE SPAPGVVYRQ PVNTPLQTGL KAIDSMVPIG RGQRELIIGD RQTGKTAIAV
DTIINQKGQN VICIYVAIGQ KASTVAGVVQ RLEEAGAMEY TIVVMATASE PAPMLYIAPY
AGCTMGEYFM YEQHRDVLCV YDDLSKHAAA YRELSLLLRR PPGREAYPGD VFYLHSRLLE
RAARLNDSLG GGSLTALPVI ETQAGDVSAY IPTNVISITD GQIFLESDLF YAGQRPAINV
GLSVSRVGGA AQIKAMKQVA GRLRLDLAQY RELAAFAQFG SDLDKATQAR LARGERMMEI
LKQDQYQPMP VEEQVVVLYA AVNGFLDDLP VARVRAFEKD FLRFLRNERP EVLAGIREKR
QLDDNLQEQL KKSIEDFKGS FTAAGES