Gene Rsph17025_4073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_4073 
Symbol 
ID5086246 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009430 
Strand
Start bp124835 
End bp126340 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content73% 
IMG OID640485636 
Producthypothetical protein 
Protein accessionYP_001170230 
Protein GI146280073 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1596] Periplasmic protein involved in polysaccharide export 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000220985 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.0992314 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCCTG AAGGTTCTCG CCGCCTTTCC ACCAGCCGTC TTCTGCGCGT GGCAGCCCTC 
GTCGCCGGAG GTTTCGTCCT TGTCGTGGGC GCGACCCGGT TCGCCGACGA AGGGTTCCTC
CAGCGCCTGC GGGCGGATTA TCCGCTCGTC TTCGGCCGGG TCGAGACGCT CCTCGACACG
CGGCGGGCGG CCGGGCCCGT CGAGCCCGCG ATCCGCCCCG AGGAGGTGGT GGCGCTGACG
GTCGAGCGGC CGGGCCTGCC CGCCAACTGC CTCGAGCCCG GCCCGGCCCG GGCCTCGCTC
GCCGTGCCGG GCGACAGGCT TCGGATCCGC TTCTTCGAGA AGGGCGGCTT CTCGACCGGC
GAGGCCGGCG GAGCGGGCGC GGCGCAGACC CTCGTCTTCG AGCGCCTCGA CCTGTCGGGC
ACCTACGAGG TGGCCTCGGA CGGCAGCCTT GCCCTGCCGC TTCTGGGGCG GCTGCCCGTG
CAGGGGCGCG AGCTGGTCTG CATCGAGGCG GCGCTGGCCG ACGGTTACAT CGGGATCCTG
AACGCGCCTC TCGATGCCAC GGTCAGTTTC GAGAGCCGCC CCCCGGTGGT GCTGCGCGGC
CCGGTGCGCG CGCCCGGCAC CTATGGCTGG ACCGAGGGGC TGACGGTGGC CCGGCTGATC
GCCTCGGCCG GGTCGGCGGC GATGGGCGGC TATGACAGCC TCGGTCGCCG GGTCGAGCTG
GAGGCGCGGG TGCGCGAGCT GCGCGACCGG ATGCTGGGCG TCGCGCTGGA ACGGGCCCGG
ACCGAGGCCG CGATCGAGCG GCAGCGGGCG CTGAAGCTGC CGGCCTCGGA GCTTGACTAC
ATGGGGGTGG AGCTTGGCCT GCGGCGGATC GAGGGCGAGA CGCAGGCCCT CGTGGCCGAA
CTCGACGCCT TCGAGGCGAT CGAGAGCCGC TGGCAGACCG AGGTGGCCGA CCTCGGGCGC
CGCCTCGCCG AGATGCGGCG CCACCACCAG ATCGCGCAGG AGCAGCTCGA GGTGCTGCGC
CAGCGGCGCG AGGAACTGTC GGATCTCAGC GGGCGCGGCG TGACGACCGC GGCGCGGCTC
GACGCGGCCA CGCTGAACCT GATGGGCAGC GAGCGCGCCA TGCTCGAGAC CTTCGACGCC
CTGCTCGCGC TGGAATCGCA GCTGAACATC GCCCGACTGT CGCTGGAGCA GGCCCGCACC
GACCGGAGCC GGCGCCTCGC GGCTGAACTG CGCGAAGAGG CCGAGGAGGA GAACCTGCTC
AAGGGCCAGC TGCGCGCGGT GCAGGCCGAG ATCGCCCGGG TCGATCTGGG CGACGGGCTG
GTGGAGGGCT TCGTGCCCGT CGTCGAGATC GAGCGGCCGG GCCCCGAGGG CGTCCGCCGG
ATCCAGGTCG CGCCCGAGGA CGAGGTCTTT CCGGCCGATC TGGTCACGAT CTCGATCCCC
GGCCGCGATC TCGTGATGCC GGTCCGCTCG TCGGAGGACG ACGGCCGGTC CAGCCTGCTG
CGGTAG
 
Protein sequence
MMPEGSRRLS TSRLLRVAAL VAGGFVLVVG ATRFADEGFL QRLRADYPLV FGRVETLLDT 
RRAAGPVEPA IRPEEVVALT VERPGLPANC LEPGPARASL AVPGDRLRIR FFEKGGFSTG
EAGGAGAAQT LVFERLDLSG TYEVASDGSL ALPLLGRLPV QGRELVCIEA ALADGYIGIL
NAPLDATVSF ESRPPVVLRG PVRAPGTYGW TEGLTVARLI ASAGSAAMGG YDSLGRRVEL
EARVRELRDR MLGVALERAR TEAAIERQRA LKLPASELDY MGVELGLRRI EGETQALVAE
LDAFEAIESR WQTEVADLGR RLAEMRRHHQ IAQEQLEVLR QRREELSDLS GRGVTTAARL
DAATLNLMGS ERAMLETFDA LLALESQLNI ARLSLEQART DRSRRLAAEL REEAEEENLL
KGQLRAVQAE IARVDLGDGL VEGFVPVVEI ERPGPEGVRR IQVAPEDEVF PADLVTISIP
GRDLVMPVRS SEDDGRSSLL R