Gene Rsph17025_2203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_2203 
Symbol 
ID5084107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp2233807 
End bp2235345 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content66% 
IMG OID640483766 
ProductF0F1 ATP synthase subunit alpha 
Protein accessionYP_001168398 
Protein GI146278239 
COG category[C] Energy production and conversion 
COG ID[COG0056] F0F1-type ATP synthase, alpha subunit 
TIGRFAM ID[TIGR00962] proton translocating ATP synthase, F1 alpha subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTATCC AAGCTGCTGA GATCTCTGCG ATCCTCAAGG AGCAGATCAA GAACTTCGGC 
CAGGCTGCCG AAGTCGCCGA AGTGGGGCGC GTTCTCTCGG TGGGCGACGG GATTGCCCGC
GTTCATGGCC TCGACAACGT GCAGGCCGGC GAGATGGTGG AGTTCCCCGG CGGGATCCGC
GGCATGGCGC TCAACCTCGA GGTTGACAAC GTCGGCGTCG TGATCTTCGG CGACGACCGC
TCGATCAAGG AAGGCGACAC CGTCAAGCGC ACCAAGTCCA TCGTGGACGT GCCGGCCGGT
GATGCGCTGC TCGGACGCGT GGTCGATGGC CTCGGCAACC CGATTGACGG CAAGGGTCCG
ATCGCGGCGA CCGAGCGTCG CGTGGCCGAC GTGAAGGCCC CGGGCATCAT CCCTCGGAAA
GGCGTGCACG AGCCGATGGC GACCGGCCTC AAGTCGGTGG ACGCCATGAT CCCGATCGGC
CGCGGCCAGC GCGAGCTGAT CATCGGTGAC CGCCAGACCG GCAAGACCGC GATCGCGCTC
GACACGATCC TGAACCAGAA GAGCTACAAC GAAGCCGCCG GCGACGACGA GAGCAAGAAG
CTCTACTGCA TCTATGTCGC GATCGGCCAG AAGCGTTCGA CCGTCGCGCA GCTGGTGAAG
AAGCTGGAAG AGACCGGCGC CATCGCCTAC ACGCTCGTCG TGGCCGCGAC CGCCTCGGAC
CCGGCGCCGA TGCAGTTCCT CGCGCCCTAC GCCGCGACCG CGATGGCGGA ATATTTCCGC
GACAACGGCC GCCACGCGCT GATCATCTAC GATGACCTCT CGAAGCAGGC CGTCGCCTAC
CGCCAGATGT CGCTGCTGCT CCGCCGTCCG CCGGGGCGCG AAGCCTACCC GGGCGACGTG
TTCTACCTCC ACTCGCGCCT GCTCGAGCGT TCGGCCAAGC TGAACAAGGA GCATGGCTCG
GGCTCGCTGA CCGCGCTGCC GATCATCGAG ACGCAAGGCG GCGACGTGTC GGCCTTCATC
CCGACCAACG TGATCTCGAT CACCGACGGC CAGATCTTCC TTGAAACCGA ACTGTTCTAT
CAGGGCATCC GCCCGGCCGT TAACACCGGT CTGTCGGTGT CGCGCGTGGG CTCCTCGGCC
CAGACCGACG CGATGAAGTC GGTCGCGGGC CCGGTGAAGC TGGAACTGGC GCAATATCGC
GAGATGGCGG CCTTCGCCCA GTTCGGTTCG GACCTCGACG CCGCCACCCA GCAGCTGCTG
AACCGTGGCG CCCGCCTGAC CGAGCTGATG AAGCAGCCGC AATATGCGCC GCTGACCAAC
GCCGAGATCG TCTGCGTGAT CTTCGCCGGC ACCAAGGGCT ACCTCGACAA GGTTCCGGTG
AAGGACGTCG GCCGCTGGGA GCAGGGCCTG CTCAAGCACC TGCGCACCAA CGCCCGCGAT
CTGCTGGCGG ACATCACCAA CAACGACCGC AAGGTCAAGG GTGAGCTGGA AAACAAGATC
CGCGCCGCGC TCGACACCTA CGCCAAAGAC TTCGCCTGA
 
Protein sequence
MGIQAAEISA ILKEQIKNFG QAAEVAEVGR VLSVGDGIAR VHGLDNVQAG EMVEFPGGIR 
GMALNLEVDN VGVVIFGDDR SIKEGDTVKR TKSIVDVPAG DALLGRVVDG LGNPIDGKGP
IAATERRVAD VKAPGIIPRK GVHEPMATGL KSVDAMIPIG RGQRELIIGD RQTGKTAIAL
DTILNQKSYN EAAGDDESKK LYCIYVAIGQ KRSTVAQLVK KLEETGAIAY TLVVAATASD
PAPMQFLAPY AATAMAEYFR DNGRHALIIY DDLSKQAVAY RQMSLLLRRP PGREAYPGDV
FYLHSRLLER SAKLNKEHGS GSLTALPIIE TQGGDVSAFI PTNVISITDG QIFLETELFY
QGIRPAVNTG LSVSRVGSSA QTDAMKSVAG PVKLELAQYR EMAAFAQFGS DLDAATQQLL
NRGARLTELM KQPQYAPLTN AEIVCVIFAG TKGYLDKVPV KDVGRWEQGL LKHLRTNARD
LLADITNNDR KVKGELENKI RAALDTYAKD FA