Gene GM21_4037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_4037 
Symbol 
ID8139411 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4620828 
End bp4622336 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content62% 
IMG OID644871653 
ProductF0F1 ATP synthase subunit alpha 
Protein accessionYP_003023811 
Protein GI253702622 
COG category[C] Energy production and conversion 
COG ID[COG0056] F0F1-type ATP synthase, alpha subunit 
TIGRFAM ID[TIGR00962] proton translocating ATP synthase, F1 alpha subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value0.00278502 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAAATCA AAGCGGAAGA AATCAGCGAG ATTATCAGGA AGCAGATCAA GGAATACGGC 
ACTGAAGTAG CTGTTGCCGA AACGGGGACC ATTATCTCCA TCGGCGACGG TATCGCACGT
ATCCACGGTC TTGACAAGGC GATGGCAGGC GAGCTCCTCG AGTTCCCCGG CGGCATCACC
GGCATGGTTC TGAACCTCGA GGAAGACAAC GTCGGCGCGG CGATCCTCGG CGAGTTCTCC
GAGATCAAGG AAGGTGACTC GGTCAAGCTG ACCGGCAAGA TCGTCGAGGT TCCGGTAGGC
CCGGCCCTGA TCGGCCGCGT GGTCGACGCG ATCGGTAACC CGATCGACGG CCTCGGCCCG
ATCAACACCG ACACCTTCGG CAAGGTGGAA GTGAAGGCCC CCGGCATCGT CAAGCGTAAG
TCGGTGCATC AGCCGATGCA GACCGGCCTC AAGGCGATCG ACTCCATGGT TCCGATCGGG
CGTGGACAGC GCGAGCTGAT CATCGGCGAC CGTCAGACCG GCAAGACCGC CGTCGCGATC
GACACCATCA TCAACCAGAA GGGTGGCGAC GTGGTCTGCA TCTACGTCGC AATCGGCCAG
AAGCGCTCCA CGGTCGCCCA GGTTGTTTCC AAGCTGAAAG AGCACGGCGC CATGGATTAC
ACCATCGTCG TCGCCGCAAC CGCCTCCGAG CCGGCACCGC TGCAGTTCAT CGCACCGTAC
ACCGGCGTCA CCATGGGCGA GTTCTTCCGC GACTCCGGCA AGCACGCCCT CATCATCTAC
GATGACCTCT CCAAGCAGGC CGTCGCTTAC CGCCAGCTCT CCTTGCTCCT TCGCCGTCCG
CCGGGGCGCG AAGCCTATCC GGGCGACGTC TTCTACCTGC ACAGCCGTCT CCTCGAGCGT
GCCTGCAAGG TTTCCGACGA CTGCGGCGCC GGCTCCCTGA CGGCTCTCCC GGTCATCGAG
ACCCAGGCGG GCGACGTTTC CGCGTACATC CCGACCAACG TGATCTCGAT CACCGACGGC
CAGATCTACC TGGAGAGCGA CCTGTTCTAC TCCGGCGTAC GTCCCGCCAT CAACGTCGGC
CTCTCCGTTT CCCGCGTCGG CGGCTCGGCT CAGGTTAAGG CGATGAAGCA GGTCGCAGGT
ACCCTCCGTC TGGCACTGGC TCAGTACCGC GAGATGGCGG CCTTCGCCCA GTTCGGCTCC
GACCTGGACA AGGCTACCCA GATGCAGCTC GCCCGCGGCG CACGCCTGGT CGAGATCCTG
AAGCAGCCGC AGTACCGTCC GATCCCGAAC GAGAAGCAGG TCCTGATCAT CTTCGCTGCC
AACAACGGCT TCGTCGATGA GTACCCGATC GGCTCCCTCG GCCGCTACGA GACCGAACTC
TACGCATTCT TCGACTCCAG GAAGGCGACT CTCCTGGGCG AACTGCGCGA CAAGAAAGCG
ATCGACGACG CCATGAAGGG CGAGATCATC GCTTCCCTCG AAGAGTTCAA GAAGGAATTT
ACTGCCTAA
 
Protein sequence
MEIKAEEISE IIRKQIKEYG TEVAVAETGT IISIGDGIAR IHGLDKAMAG ELLEFPGGIT 
GMVLNLEEDN VGAAILGEFS EIKEGDSVKL TGKIVEVPVG PALIGRVVDA IGNPIDGLGP
INTDTFGKVE VKAPGIVKRK SVHQPMQTGL KAIDSMVPIG RGQRELIIGD RQTGKTAVAI
DTIINQKGGD VVCIYVAIGQ KRSTVAQVVS KLKEHGAMDY TIVVAATASE PAPLQFIAPY
TGVTMGEFFR DSGKHALIIY DDLSKQAVAY RQLSLLLRRP PGREAYPGDV FYLHSRLLER
ACKVSDDCGA GSLTALPVIE TQAGDVSAYI PTNVISITDG QIYLESDLFY SGVRPAINVG
LSVSRVGGSA QVKAMKQVAG TLRLALAQYR EMAAFAQFGS DLDKATQMQL ARGARLVEIL
KQPQYRPIPN EKQVLIIFAA NNGFVDEYPI GSLGRYETEL YAFFDSRKAT LLGELRDKKA
IDDAMKGEII ASLEEFKKEF TA