Gene Rsph17029_1874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1874 
Symbol 
ID4896597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1982960 
End bp1984060 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content72% 
IMG OID640112468 
ProductOmpA/MotB domain-containing protein 
Protein accessionYP_001043750 
Protein GI126462636 
COG category[N] Cell motility 
COG ID[COG1360] Flagellar motor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0773204 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCAA AGCCCAAGGT CATCCGGTTC CAGCCGCCCG TCCCCGACGA CGACGAGGGC 
GAGGACTGCC CGAAATGTCC GCCCCCCGGC GCGCCGGCAT GGCTTGCGAC CTTTGCCGAC
ATCGCGACCA ACCTCATGGC CTTCTTCGTG CTGATCCTGG GCTTCGCGAA GTTCGACGAG
CCCTCGTTCA GCAAGATGGC GGGGGCGATG CGGGAGACCT TCGGCTTCCA TTCGATCCGG
GATGCGACCT CGGGCAACAC GATGATCGAC TTCGGCCTGC CCACCGCCGA TCCCGACGGG
GCCCAGCCGG ACGAGAAGTC GGACACGGGC GGGTCGGAGG ACGGCGGCGA CGCGGCGGAG
CGGGTGGCCG AGGCGCTGAA GAAGGCGCTC GAGGACGGCA AGCTGCAGGT GCGCTCGGAC
GAGGGCGAGG TCGTGATCGA GCTGTCGGGC GAGGACGGAC GGCAGCAGGC GCAGAGCCTC
GCGCGGGCTC TGGCAGAGAC CGCGGGGCTT GGTCCGCTCC CCGAGCCGCA GACCACGGCC
CAGCCGCGGC CCGAGCCGAA GGCCGGGCCT GCGGGCCCCG GAGAGGGCAC GGGGGCGCCG
CCCGGGCCGC CCGTTGGCGG CGACACGGGC GCTGCGCTGC GCCAGTCGGT GCGGGCCGAA
CTCGATGCGC TCCGGCTGCG CAATGCGCTC GACCGCGAAG TGGCGGAGGG GCTGGTGAAG
GTGGAGCAGA CCGACGGCAA GGTGTTCGTG AGCCTCGGCG CGGGCGGATC CTTCCCCTCC
GGCTCCGACG ACCTCACGCC CGATGCGCGC GCGGTCATGG CCCGGATCGC CGAGGCCACG
CGCAACCCCG AACGCACCAT CACCGTGACG GGCCATACCG ACAATGTCCC CGTGTCGGGC
GGCGCCTTCC GGGACAATAT CGCGCTCGCC GCCGGGCGCG CCGCAAGCGT GGTGCGCGAG
CTTGTCGCCT CGGGCAGCGT CGATCCCGGA CGCATCACCG CGGTGAGCCG CGGCGAGTTC
GACCCGGTGG CGGACAATGC AACCGAGGAA GGCCGGGCGC AGAACCGCCG GATCGAGATC
GAGATTTCCT ACAAGGACTG A
 
Protein sequence
MSAKPKVIRF QPPVPDDDEG EDCPKCPPPG APAWLATFAD IATNLMAFFV LILGFAKFDE 
PSFSKMAGAM RETFGFHSIR DATSGNTMID FGLPTADPDG AQPDEKSDTG GSEDGGDAAE
RVAEALKKAL EDGKLQVRSD EGEVVIELSG EDGRQQAQSL ARALAETAGL GPLPEPQTTA
QPRPEPKAGP AGPGEGTGAP PGPPVGGDTG AALRQSVRAE LDALRLRNAL DREVAEGLVK
VEQTDGKVFV SLGAGGSFPS GSDDLTPDAR AVMARIAEAT RNPERTITVT GHTDNVPVSG
GAFRDNIALA AGRAASVVRE LVASGSVDPG RITAVSRGEF DPVADNATEE GRAQNRRIEI
EISYKD