Gene GM21_4035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_4035 
Symbol 
ID8139409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4618483 
End bp4619895 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content62% 
IMG OID644871651 
ProductF0F1 ATP synthase subunit beta 
Protein accessionYP_003023809 
Protein GI253702620 
COG category[C] Energy production and conversion 
COG ID[COG0055] F0F1-type ATP synthase, beta subunit 
TIGRFAM ID[TIGR01039] ATP synthase, F1 beta subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.000097844 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTCAGA ACTTCGGAAA AATATCGCAG GTCATCGGCG CAGTCATCGA CGTCGAATTC 
GAGCCGGGCA AGCTTCCCCC GATCTATCAA GCTCTCAGGG TCACCAACCC TGCGATCGAC
GATCAGGAGT TCAACCTTGT TCTGGAAGTC GCACAGCACC TGGGCGAGAA CGCGGTCAGG
ACCATCGCAA TGGACTCCAC CGACGGTCTG GTTCGTGGCC AGCAGGTCAA AGACATGGGC
AAGCAGATCT CGGTACCGGT CGGCAAGAAG ACCCTGGGCC GCATCCTGAA CGTCATCGGC
GAGCCGGTTG ACGAGATGGG CCCGATCGGC AACGAGAAAG AGTACGGCAT CCACCGCGAA
GCCCCGGCCT TCGTGAACCA GTCGACCAAG GTCGAGGCGT TCACCACCGG GATCAAGGTC
GTCGACCTCC TCGCACCGTA CGCAAGGGGG GGCAAGATCG GCCTCTTCGG CGGCGCAGGC
GTCGGCAAGA CCGTTCTCAT CATGGAGCTG ATCAACAACA TCGCCAAGCA GCACGGCGGT
TTCTCCGTCT TCGCTGGCGT CGGCGAGCGT ACCCGTGAAG GGAACGACCT CTGGATGGAG
ATGAAGGAAT CCGGCGTTCT CGACAAGGCG GCACTCGTGT ACGGCCAGAT GAACGAGCCC
CCGGGGGCGC GCGCCCGCGT CGCTCTCTCC GCGCTCTCCA TCGCAGAGTA CTTCCGCGAC
GAGGAAGGCC AGGACGTGCT CCTCTTCGTC GACAACATCT TCCGCTTCAC CCAGGCGGGT
TCCGAGGTTT CCGCACTCCT CGGCCGGATC CCCTCCGCCG TCGGTTACCA GCCGACCCTG
GCTACCGAAA TGGGCGAGCT GCAGGAGCGC ATCACCTCGA CCAACAAGGG TTCCATCACC
TCGGTCCAGG CGATCTACGT TCCGGCCGAC GACTTGACCG ACCCGGCTCC GGCTACCGCC
TTCGCCCACC TGGACGCGAC CACCGTTCTC TCCCGTCAGA TCGCCGAGCT CGGCATCTAC
CCGGCAGTGG ACCCGCTCGA CTCCACCTCC AGGATCCTCG ATCCGCAGGT AATCGGCGAC
GAGCACTACG CCATCGCGCG CCAGGTTCAG TACGTGCTGC AGAAGTACAA GGATCTCCAG
GACATCATCG CGATCCTCGG TATGGACGAG CTCTCCGAGG AGGACAAACT GGTGGTCGCA
CGCGCCAGGA AGATCCAGAA GTTCCTCTCC CAGCCGTTCC ACGTCGCGGA AGCCTTCACC
GGCTCCCCGG GCAAATACGT CGAACTGAAG GACACCATCA AGGGCTTCTC CGAGATCATC
GCCGGCAAGC ACGACGACCT CCCCGAGCAG GCCTTCTACA TGGTCGGCAC CATCGAGGAA
GCTATCGAGA AGGCCCAGAA GCTCGCGGTG TAA
 
Protein sequence
MSQNFGKISQ VIGAVIDVEF EPGKLPPIYQ ALRVTNPAID DQEFNLVLEV AQHLGENAVR 
TIAMDSTDGL VRGQQVKDMG KQISVPVGKK TLGRILNVIG EPVDEMGPIG NEKEYGIHRE
APAFVNQSTK VEAFTTGIKV VDLLAPYARG GKIGLFGGAG VGKTVLIMEL INNIAKQHGG
FSVFAGVGER TREGNDLWME MKESGVLDKA ALVYGQMNEP PGARARVALS ALSIAEYFRD
EEGQDVLLFV DNIFRFTQAG SEVSALLGRI PSAVGYQPTL ATEMGELQER ITSTNKGSIT
SVQAIYVPAD DLTDPAPATA FAHLDATTVL SRQIAELGIY PAVDPLDSTS RILDPQVIGD
EHYAIARQVQ YVLQKYKDLQ DIIAILGMDE LSEEDKLVVA RARKIQKFLS QPFHVAEAFT
GSPGKYVELK DTIKGFSEII AGKHDDLPEQ AFYMVGTIEE AIEKAQKLAV