Gene Rsph17025_4151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_4151 
Symbol 
ID5086323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009430 
Strand
Start bp195801 
End bp197108 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content68% 
IMG OID640485713 
Producttransposase, IS4 family protein 
Protein accessionYP_001170307 
Protein GI146280150 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID[TIGR03071] HipA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACACGGC GGGTCCCGGT CTGGTTCGAC AGGCTCCATC TGGCGGATAT CGAGGTCGCA 
GCCGATGGCG CACTCTCGCT GCGCTATGCC GAGCGCTGGT GCCTCACCGA CGGGGCGTTT
CCACTCTCCG TCACCATGCC GCTGCGTGCC GAGCCTTATC CGTCCGAGGT CGTCGCGCCC
TGGCTCGCGA ACCTTCTGCC GGAAGAGGAG CCGCTCCGCA TCCTGACCCG CTCGCTGGGG
CTCGACCAGG CCGATGTACT GGCCCTGCTG GAACAGATCG GCGGGGATAC GGCGGGGGCG
CTGTCCTTTG GCACGCCCAC GGACCGCGCC CGTTGGGCCT GGCGGTCGCT GACCGACCAC
TATGGCCGCG ACGACCCGGC CGAGGCGCTC GAGCGTCATT TCGAAGATCT CGGGCGGCGA
CCCTTTCTGG TGGGAGAGGA GGGGGTTCGG CAATCGCTGG CGGGCGGCCA GAAGAAATCC
GCCCTTGCGG TGCTGGATGC GCAGGGAAAC CCCGTTCTGC GCCTGCCCGG ACCGGACGAT
GTGCTGGCCA TTCCCCTGAA CGGTGCGCCC TCGACCCTGA TCGTGAAGCC GGACAATCCG
AACCTGCCGG GCCTGACCGA GAACGAGGTC TGGTGCCTGA GACTGGCCTC TGCCATCGGC
ATCCCGGCGG CGGAGGCGAC GATCCTGCGG GCCTCGGGCC GCAGTGCCAT TGCCGTCCTG
CGCTATGACC GCAGACTGGG GCGACAGGGA CAATTGCAGC GCCTGCATCA GGAGGATTTC
GCGCAGGCCA ATGGCCTGCC GCCGGGGCGG AAGTACGAGC GCGGCACGCG ACCCGGACTC
AATCTCGCCA CCCTCCTGCG CACGGCACGG CATGTGAGCG TGACAGATGC CCTCGCGCTG
CTCGACCAGG TGATCTTCAA CATCCTCGTC GCCAACACCG ACGCCCATGC CAAGAACTAT
TCCCTGATCC TGCCGATTGC CGGGCCACCG CGTCTGGCAC CGCTCTATGA TGTCTCCTCC
GTGCTGTCCT GGCCGCATGT CGTGCAGGCC TACGCCCAGA ACATCGCCGG AAAGAAGCGG
ACATCGGAGG GGATCGCGGC GCGCCACTGG GCAGCCATCG CCAAAGAGGT CGGCTATCGC
CCGCGGGACG TGCTCAACCG TGTCCAGGAC CTGATTGACA GGATCGTCGC GCATCGCGTC
GGGGTGACGG AGGAGGTCGC GCGTCTGCCC GGCGCGACCG AAGGGTATGT GGCGCAGACG
GCGGAGCTGG TGGACGGCAA CGCGCTTCGG ATGGCCGGGC GTCTTTAA
 
Protein sequence
MTRRVPVWFD RLHLADIEVA ADGALSLRYA ERWCLTDGAF PLSVTMPLRA EPYPSEVVAP 
WLANLLPEEE PLRILTRSLG LDQADVLALL EQIGGDTAGA LSFGTPTDRA RWAWRSLTDH
YGRDDPAEAL ERHFEDLGRR PFLVGEEGVR QSLAGGQKKS ALAVLDAQGN PVLRLPGPDD
VLAIPLNGAP STLIVKPDNP NLPGLTENEV WCLRLASAIG IPAAEATILR ASGRSAIAVL
RYDRRLGRQG QLQRLHQEDF AQANGLPPGR KYERGTRPGL NLATLLRTAR HVSVTDALAL
LDQVIFNILV ANTDAHAKNY SLILPIAGPP RLAPLYDVSS VLSWPHVVQA YAQNIAGKKR
TSEGIAARHW AAIAKEVGYR PRDVLNRVQD LIDRIVAHRV GVTEEVARLP GATEGYVAQT
AELVDGNALR MAGRL