Gene Rsph17029_4103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_4103 
Symbol 
ID4895039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009040 
Strand
Start bp44742 
End bp46235 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content73% 
IMG OID640110505 
ProductATP synthase F1, alpha subunit 
Protein accessionYP_001041817 
Protein GI126464841 
COG category[C] Energy production and conversion 
COG ID[COG0056] F0F1-type ATP synthase, alpha subunit 
TIGRFAM ID[TIGR00962] proton translocating ATP synthase, F1 alpha subunit 


Plasmid Coverage information

Num covering plasmid clones64 
Plasmid unclonability p-value0.0134913 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones108 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGTG ATCCGGGACT GGAGGCGCTG AAGGCGCGGG TGGCGGAGGT TCCGCTCGGC 
CCCGAGATCG AGGAGACCGG CCGCATCGCC ACGCTGGCCG ACGGGCTCGT AGAGGTCGAG
GGCCTGCCCG GCGCGCGGCT GGGCGAGGTG GTGCGTTTCG CGGGCGGCGC CGAGGGGCTG
GTGCTGACCC TCGATCCCGA GACGGTGCAG GTGGCGCTGC TCGATCCCGG CGCGGCCCTG
GGCTCGGGCA CCGAGGTGCG CCGCACCGGG CAGCTCCTGT CGGTGCCGGT GGGCCAGGGG
CTTCTGGGCC GCGTCGTCGA TCCGCTCGGC CGTCCGCTCG ACGGACTGCC CGCGATCCTG
CCCGAGGCCA GGCTCGAGAT CGAGCGCCCG GCCCCCGGCA TCGTCGACCG CGACATGGTG
GCCGAGCCGG TGGAGACGGG CCTTCTGGTG GTGGATGCGC TCTTCGCCGT GGGCCGCGGG
CAGCGCGAGC TCATCATCGG CGAGCGCGCC ACCGGCAAGA CCTCCCTCGC GGTCGATGCC
ATCGTGAACC AGGCCGCGAG CGACATCGTC TGCTTCTATG TGGCCATCGG CCAGCGCACG
ACGGCCGTCC GCCGGGTGAT CGAGACCGTG CGCGAGAAGG GGGCCTTCGC GCGCACGGTC
TTCGTGGTGG CGCCCGCGAC GGCTTCGCCC GGCCTGCGCT GGATTGCGCC CTTCGCCGCG
ACCTCCATGG CCGAATGGGT GCGCGACCGG GGCGGGCATG CGCTGATCGT CTATGACGAT
CTGACCAAAC ATGCGGCCGT CCACCGCGAG CTTGCGCTGC TCGCGCGCCA GCCGCCGGGG
CGCGAGGCCT ATCCGGGCGA CATCTTCTAC CTCCATGCGC GGCTTCTGGA GCGCTCGGCA
AAGTTGTCGG CTGTCAACGG CGGCGGCTCG CTCACCGCGC TGCCCATCGC CGAGATCGAG
GCGGGCAACC TCTCGGCCTA TATCCCGACC AACCTGATCT CGATCGCCGA TGGCCAGATC
GTGACTTCGG CCGCGCTCTT TGCCGCCAAC CAGCGCCCCG CGGTGGATAT CGGCCTGTCG
GTCAGCCGCG TGGGCGGCAA GGCGCAGCGG GGCGCGCTGA AGGCGGTGGC GGGGCGGGTG
CGGCTCGATT ATGCGCAATA TCTCGAGATG AAGATGTTCT CGCGCTTCGG CGGCTTCGGC
GATGCGGCCC TGCGCGCGCG TCTGGCGCGC GGAGAGCGGA TCGGCGCGCT TCTCGCCCAG
CCGCGCACGA CCCCGCTCTC GACTCCGGTG CAGGTGGCGC TGCTGGCCGC GCTGGCCGAG
GGCGCGCTCG ACGATGTGCC GCTCGAGGAT CTGACCCGGC TCAAGGCCGC GCTCGGGCCG
GTGCTGGCCG CGGATGCCTC GCTCGGCCCC TTCTGCGCGG CCCCCGACCG GCTGGAGCCC
GAGACCCGCG CGGCGCTTCT GGCCTGTGTC CGCCGCGCGC GGGAGGCGCC ATGA
 
Protein sequence
MSGDPGLEAL KARVAEVPLG PEIEETGRIA TLADGLVEVE GLPGARLGEV VRFAGGAEGL 
VLTLDPETVQ VALLDPGAAL GSGTEVRRTG QLLSVPVGQG LLGRVVDPLG RPLDGLPAIL
PEARLEIERP APGIVDRDMV AEPVETGLLV VDALFAVGRG QRELIIGERA TGKTSLAVDA
IVNQAASDIV CFYVAIGQRT TAVRRVIETV REKGAFARTV FVVAPATASP GLRWIAPFAA
TSMAEWVRDR GGHALIVYDD LTKHAAVHRE LALLARQPPG REAYPGDIFY LHARLLERSA
KLSAVNGGGS LTALPIAEIE AGNLSAYIPT NLISIADGQI VTSAALFAAN QRPAVDIGLS
VSRVGGKAQR GALKAVAGRV RLDYAQYLEM KMFSRFGGFG DAALRARLAR GERIGALLAQ
PRTTPLSTPV QVALLAALAE GALDDVPLED LTRLKAALGP VLAADASLGP FCAAPDRLEP
ETRAALLACV RRAREAP