Gene Mpe_A0195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0195 
Symbol 
ID4783520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp209381 
End bp210934 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content66% 
IMG OID640088744 
ProductF0F1 ATP synthase subunit alpha 
Protein accessionYP_001019392 
Protein GI124265388 
COG category[C] Energy production and conversion 
COG ID[COG0056] F0F1-type ATP synthase, alpha subunit 
TIGRFAM ID[TIGR00962] proton translocating ATP synthase, F1 alpha subunit 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACTGA ATCCCGCTGA AATTTCCGAA CTGATCAAGA GCCGCATCGA GGGGCTCGGC 
GCCTCGTCGG ACATCCGCAA CCAGGGCACC GTGGTGTCGG TGACCGACGG CATCGTGCGC
GTGCACGGCC TGTCGGAGGT CATGGCCGGC GAAATGCTGG AGTTCCCCGC CACCAAGGAC
GGCCAGCCGA CCTTCGGTCT CGCGCTGAAC CTCGAGCGCG ACTCGGTCGG TGCCGTGATC
CTGGGCGAGT ACGAGCACAT CTCCGAAGGC GACACGGTCA AGTGCACCGG CCGCATCCTG
GAAGTGCCGG TCGGCCCGGA ACTGATCGGC CGCGTGGTCA ATGCGCTGGG TCAGCCGATC
GACGGCAAGG GACCGATCAA CGCCAAGATG ACCGACGTGA TCGAGAAGGT CGCGCCGGGC
GTGATCGCAC GGAAGTCGGT GGACCAGCCG GTGCAGACCG GCCTGAAGTC GATCGACTCC
ATGGTGCCGG TGGGCCGCGG CCAGCGCGAG TTGATCATCG GCGACCGCCA GACCGGCAAG
ACCGCGGTGG CGATCGACGC GATCATCAAC CAGAAGGGTC AGAACATGAC CTGCGTCTAC
GTCGCGATCG GGCAGAAGGC CTCGTCGATC AAGAACGTGG TGCGCTCGCT CGAAGCGGCT
GGTGCGATGA GCTACACCAT CGTCGTGGCG GCCTCCGCCT CGGAATCGGC AGCGATGCAG
TACGTGTCGG CCTACTCGGG CTGCACGATG GGCGAGTACT TCCGCGACCG CGGCGAAGAC
GCGCTGATCA TCTATGACGA CCTGTCCAAG CAGGCGGTCG CCTACCGCCA GGTCTCGCTG
CTGCTGCGCC GCCCGCCGGG CCGTGAAGCC TACCCCGGTG ACGTGTTCTA TCTCCACAGC
CGTCTGCTCG AGCGCGCCGC GCGCGTGAAC GCCGATTACG TCGAGAAATT CACCAATGGC
GCCGTCAAGG GCAAGACCGG CTCGTTGACC GCGCTGCCGA TCATCGAGAC CCAGGCCGGT
GACGTGTCGG CCTTCGTGCC GACCAACGTG ATCTCGATCA CCGACGGCCA GATCTTCCTC
GAGACCAACC TGTTCAACGC CGGCATCCGC CCCGCGATCA ACGCCGGTAT CTCGGTGTCG
CGCGTGGGGG GCGCCGCCCA GACCAAGCTG GTCAAGGGCC TGTCGGGCGG CATTCGGACC
GACCTTGCGC AGTACCGTGA ACTGGCCGCC TTCGCGCAGT TCGCCTCCGA CCTGGACGAT
GCCACCCGCA AGCAGCTCGA CCGCGGCGCG CGCGTGACCG AGCTGCTCAA GCAGCAGCAG
TACCAGCCGC TGCCGATCAG CCTGATGGCT GCGACGCTGT ACTCGGTGAA CAAGGGCTTC
CTCGACGACG TTGACGTGAA GAAGGTGCTC GCCTTCGAAT CGGGCCTGCA CCAGTTCCTG
AAGACCAGCT ACGCCGCGCT GCTGAAGAAG CTCGAGGACA GCAAGGCGCT GGACAAAGAC
AGCGAAGCCG AGCTCGCCGC CGCCATCGGT GCCTTCAAGA AGTCCTTCGC GTAA
 
Protein sequence
MQLNPAEISE LIKSRIEGLG ASSDIRNQGT VVSVTDGIVR VHGLSEVMAG EMLEFPATKD 
GQPTFGLALN LERDSVGAVI LGEYEHISEG DTVKCTGRIL EVPVGPELIG RVVNALGQPI
DGKGPINAKM TDVIEKVAPG VIARKSVDQP VQTGLKSIDS MVPVGRGQRE LIIGDRQTGK
TAVAIDAIIN QKGQNMTCVY VAIGQKASSI KNVVRSLEAA GAMSYTIVVA ASASESAAMQ
YVSAYSGCTM GEYFRDRGED ALIIYDDLSK QAVAYRQVSL LLRRPPGREA YPGDVFYLHS
RLLERAARVN ADYVEKFTNG AVKGKTGSLT ALPIIETQAG DVSAFVPTNV ISITDGQIFL
ETNLFNAGIR PAINAGISVS RVGGAAQTKL VKGLSGGIRT DLAQYRELAA FAQFASDLDD
ATRKQLDRGA RVTELLKQQQ YQPLPISLMA ATLYSVNKGF LDDVDVKKVL AFESGLHQFL
KTSYAALLKK LEDSKALDKD SEAELAAAIG AFKKSFA