Gene Rsph17029_2122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2122 
Symbol 
ID4895149 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2250144 
End bp2251661 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content75% 
IMG OID640112716 
Producthypothetical protein 
Protein accessionYP_001043997 
Protein GI126462883 
COG category[S] Function unknown 
COG ID[COG2187] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGACC AGACCGAGGC GGTGCGCTTC CTCTCCGATC CCGCGACCCA TGGGGGCGCG 
CCGGTCGAGC GTGTCGAGAC CCATGGCGCC TTCGTCTTTC TCGCCGGATC TCAGGCCTTC
AAGATCAAGC GGGCGGTGCG CTACGACTAC ATGGACCTCT CGACGCTGGC GCTGCGGCAC
CGGATGCTCG CCCGCGAGTT GGAGCTGAAC CGGCCCGCCG CGCCCGCGAT CTACCGCGAT
CTGGTGGCGG TGACCCGCGA ACCGGAGGGG CTCGCCCTGG GCGGGCCGGG CGAGCCCGTA
GAGTGGGTGC TGCGCATGTG GCGCTTTCCG GCCGAGGACG AGCTCTCGGC CGTGGCCGAG
CGGGGCGGGC TCGACGACCG GCTGGCCGCG CGGCTCGGCC GGAGCCTTGC CGCCTATCAC
CGCGCGGCGC CCGTGCGCAC CTCCGACGGC GCGGCCCTCG TGGAGGCGAT CCTCGACGAG
CTCGGGCGGG TCTTCGCGGG CATGGAGGAT CGTCTCGGCG AGGAGCGGAT CGCGCGCTTC
GGATCGCAGG CGCGGGACAT GCTCGCGCGG ACGAAGCCCC TTCTGCGGGC GCGGTCGGAG
GCGGGCCGCG TGCGGCGGGG CCATGGCGAC CTGCATCTGC GCAATCTCGT CCTGATCGAG
GGCGAGCCCG TGCCGTTCGA CGCGCTCGAG TTCGACGAGA CGCTCGGCAC CTGCGACATC
CTCTACGATC TGGCGTTCCT CCTGATGGAT CTCGACCACC GCGGCCTGAC ACGCGCGGCC
GGGATCGCGC TCGCGGCCTG GCTCTTCGAG ATGCGGGGCG ACGAGGACGG AGGGCTGGCC
GCGCTGCCGC TGTTTCTCGC CGTCCGGGCG GCGATCCGGG CGATGGTGCT GGTGCAGACC
GATGCCGCGC GCGGCGCGCC GGGTCACAGC GATGCGGAGG CGCGAGGCCT GCTCGACGAG
GCGCTCCGGG CGCTCTCGCC GCCGCCGGCG CGGCTCCTGG CCATCGGTGG CGTCTCGGGC
AGCGGCAAGA CGGTGCTGGC GGCAGCGCTC GCGCCGCGGC TCGGGCTGCC GCCCGGGGCG
ATCCACCTGC GCTCGGATCT GGAGCGGAAG TCGCGGGCCG GGGTTCCGAC CGACCGCCGC
CTCGGCGGCG CGGCTTACGG CGAGGACGCG CGTCATGCGG TCTACGGGCA GATGCTGGCG
CGTGCCGGAG CGATCCTCGC CGCGGGCTGG CCGGTGATCC TCGATGCGAC CTTTCTCGAT
CCGGCGGACC GGGCGGCGGC GCGGGCGCTG GCCGCCGAGC GGGGCGTGCC GCTCGAGGGC
CTGTGGCTGG AGGCGCCGCC CGATCTGCTC GTCGCCCGCG TCTCGGCACG CCGGAGCGAT
GCCTCGGATG CGGACGAGGC CGTGGTGCGG GCGCAGCTTG CCCGCCAGCC CCGCGCCGGG
GATTGGCGGA TGCTCGACAC CGGCGGGCCG CCGGAGGAGG TGGCAAGCAG CGCCTGCCGC
GCCCTCGGCC TCGACTGA
 
Protein sequence
MQDQTEAVRF LSDPATHGGA PVERVETHGA FVFLAGSQAF KIKRAVRYDY MDLSTLALRH 
RMLARELELN RPAAPAIYRD LVAVTREPEG LALGGPGEPV EWVLRMWRFP AEDELSAVAE
RGGLDDRLAA RLGRSLAAYH RAAPVRTSDG AALVEAILDE LGRVFAGMED RLGEERIARF
GSQARDMLAR TKPLLRARSE AGRVRRGHGD LHLRNLVLIE GEPVPFDALE FDETLGTCDI
LYDLAFLLMD LDHRGLTRAA GIALAAWLFE MRGDEDGGLA ALPLFLAVRA AIRAMVLVQT
DAARGAPGHS DAEARGLLDE ALRALSPPPA RLLAIGGVSG SGKTVLAAAL APRLGLPPGA
IHLRSDLERK SRAGVPTDRR LGGAAYGEDA RHAVYGQMLA RAGAILAAGW PVILDATFLD
PADRAAARAL AAERGVPLEG LWLEAPPDLL VARVSARRSD ASDADEAVVR AQLARQPRAG
DWRMLDTGGP PEEVASSACR ALGLD