Gene Rsph17025_4004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_4004 
Symbol 
ID5086179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009430 
Strand
Start bp34091 
End bp35302 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content67% 
IMG OID640485563 
Producthypothetical protein 
Protein accessionYP_001170163 
Protein GI146280006 
COG category[S] Function unknown 
COG ID[COG4102] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGACC GGCGCATCTT CCTTCAGGGA CTTGCCGGCC TCGGCTGCTC GGTGGCGGCC 
CATCCCCTCG CCACGCCCGT GACCTTTGCC GGCACTCCAG CCGCGCTTGG CGAGAACCGG
CTTGTGGTGA TCGTCCTCCG CGGGGCGATG GACGGGCTCG ATCTGGTGCG ACCGGTCGGG
GATCCGCTCT TTGCCACCTA CCGGCCAAGG CTGGCGCAGG ATGGACAGAA GGGTGAGGGC
GGCGTGATCC CCCTCTCGGA AGGTTTCGTG CTTCATCCCT TGCTGAACGG GCTGTCCGCG
CTCTGGCGCA AGGGCGAACT GGGCTTCGTC CATGCTGTCT CCACTCCCTA CCGCGACAAG
CGCAGCCACT TCGATGGTCA GGACATGCTC GAGGCGGGCA CTGGACCGGA TGTTCCTCCC
GGCGTTGCTC GCGATGGGTG GCTGAACCGG ATGCTGCAGG CCGTGCCGGG TCTGGAAGCC
GAGACGGCCT GGGCCATCGG GCGCGATCCG CTACCGCTGA TGGAGGGCGA GGCCCCCGTG
CGCGTTTGGT CTCCGGACCT CGCGCTCGAC CTCTCGGCCC AGAACCGACG GCTTCTGGAA
GAGCTTTACC ACGAAGACCC GCTGTTCCGG GATGTAGGCA CCGAAGCCCT GGAACTGGCC
GCCAAACTGT CACATGGCGC GGGCGGCGAG GGTCCCCGCG ACGCGAAGAA GGATCTTGCG
GCCATTGACG AGCTCGCGGC CTTTGCGGCC TCTCAGTTGC GTGGGGCGGC CCGCATTGCC
GCCTTCTCTC TTTCGGGTTG GGACACGCAC CGCAACCAGG CTGCGGCCAT CCGCAAGCCG
CTCTTAAAAC TCCAGCGGAT GCTTCTTCAG CTGCAGGCGG AACTGGGACC GGTCTGGGAC
AGGACCGCCA TACTCGCCAT GACGGAGTTC GGCCGGACGG CCCGCGAGAA CGGGTCAGCG
GGGACCGATC ACGGCACGGC AGGTGCCATG ATCATGGCCG GCGGCGCGAT CCGGGGCGGG
CGGGTGCTGG GACGCTGGCC AGGACTTGAC GAGGCCGCCC TCTACGACCG GCGCGACCTC
ATGCCCACGT CGGACGTACG TGCCTGGGCG GCCTGGACCA TGCGGTCTCT CTACGGGTTT
GACCGGGAGC TGCTGGAGCG CAGTGTCTTT CCCGGCATCG AGATGAGAGA GGATCCGGGG
CTCGTGCTCT GA
 
Protein sequence
MIDRRIFLQG LAGLGCSVAA HPLATPVTFA GTPAALGENR LVVIVLRGAM DGLDLVRPVG 
DPLFATYRPR LAQDGQKGEG GVIPLSEGFV LHPLLNGLSA LWRKGELGFV HAVSTPYRDK
RSHFDGQDML EAGTGPDVPP GVARDGWLNR MLQAVPGLEA ETAWAIGRDP LPLMEGEAPV
RVWSPDLALD LSAQNRRLLE ELYHEDPLFR DVGTEALELA AKLSHGAGGE GPRDAKKDLA
AIDELAAFAA SQLRGAARIA AFSLSGWDTH RNQAAAIRKP LLKLQRMLLQ LQAELGPVWD
RTAILAMTEF GRTARENGSA GTDHGTAGAM IMAGGAIRGG RVLGRWPGLD EAALYDRRDL
MPTSDVRAWA AWTMRSLYGF DRELLERSVF PGIEMREDPG LVL