Gene Rsph17025_4071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_4071 
Symbol 
ID5086244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009430 
Strand
Start bp120988 
End bp122313 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content77% 
IMG OID640485634 
Producthypothetical protein 
Protein accessionYP_001170228 
Protein GI146280071 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.266813 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0727287 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGACA GGATCCTGAT GCTGGGCGGC GGCGGCCTGA TGGCGCTTTT CGCGGTGCTG 
GCGCTGCGGC CCGCCGAGAC CGTCCGGCTC GAGGGCGAGC TTGGCTCGAT GACGCCCGAG
GCAGCGATGG CCCGGCTCGA GGCCATGGCG GGACGGGTGG AGCTGACCGA CAACCTGCGC
CTCGTGCAGG CCGACCTGGC GCTCAGGGCC GGCCATCCGG CCGCCGCCGA ACGGGCGCTG
GCCGGGGACC GGCCGCAGGA GGCGGGGGCG GGGGCCGAGG CCGTGCTGGC CGACCGGCGG
GCCGAGATCG CGCGCATGGC CGGCGATCTC GAGGCGGCGG TGCGCCACCT GCGGCGGGCC
CAGGAGCTGA AACCGGATGC CGGGCGGCGC CACAGGCTCG GCTACTGGCT GCGCCTGCTG
GGCGACGAGC CGGCCGAACT CGAGCTGCTG GCCGGCGTGC CGACGGTGCA GCTCCAGCCC
TGGGAGGCGG GGCGGCTCGC GCAGCTTCTG GTGCGGGCGG GCCGGACCGA GGAGGTCGAA
TGGCTGTTGC GCGCGGCCGC GGAGGGGCCG GACCCGCTGG CCGGGGCGAT GCGGCCGCGC
CTGCTCGATG TCCTGCTCGA GACGGGGCGC CGCGACGAGG TGGTGCCGCT CGCCCTCGCC
TGGTGCGCCG CCGCCGGAAA CACCGACCCG CTCGAGGCGG CCCTGCCGGT CCTGATCAAC
CGGGGCGCGC TGGCCGAGGC CTATGCGCTG GCGCGTTCGG CGCTCGAGCG CGAGCCGGCG
GCCGCCCACC GGCTGCTGCC CCTCTTTGCG CGCGGCGGCC ACCGGGCGAT GACCTTCGAC
CTGCAGGCGC GCTGGATCGC GGAAACCCCG GAGATGGATG CCCGCGGCTG GCGGACGCTG
CTCACCGTCA CCGAGATCAC CGGCGACCTG CGCGGCCTGC GCGGCGCGCT CGAACGCTCC
GGTCCGGGGC TCGAGCCGGA GATCACCGGC CGGGCGCTGC TGCAGTTCCT GCGCTTTCAG
GGGCCGTCGG CGCTGCTGCC CTGGCAGGAC CGGATGACGC CGGATCTGGT CCGGGCCGAG
CCGCTGGTGG GGGCGGCCTT CATGGGGCTG CAGGGCCGGC CCGAGGAGGT CCACCGCCTG
CTCGCCCTCG CCGCCGCGCG GCCGCTGTCG GAATGGGACC GCACGCTCTG GCTGTCGCTG
GCCGGATCGC TCCGCGGCAC GGCCGGCCAC GCCGATCTCA TGGCCCGCCC CGGCGCCAAC
CCGGACCTGC CGGCGGCGCT GCTCGCGACC TGGCGCAGCC CCGTCAGCGG ATTGCCGGGG
CCCTGA
 
Protein sequence
MRDRILMLGG GGLMALFAVL ALRPAETVRL EGELGSMTPE AAMARLEAMA GRVELTDNLR 
LVQADLALRA GHPAAAERAL AGDRPQEAGA GAEAVLADRR AEIARMAGDL EAAVRHLRRA
QELKPDAGRR HRLGYWLRLL GDEPAELELL AGVPTVQLQP WEAGRLAQLL VRAGRTEEVE
WLLRAAAEGP DPLAGAMRPR LLDVLLETGR RDEVVPLALA WCAAAGNTDP LEAALPVLIN
RGALAEAYAL ARSALEREPA AAHRLLPLFA RGGHRAMTFD LQARWIAETP EMDARGWRTL
LTVTEITGDL RGLRGALERS GPGLEPEITG RALLQFLRFQ GPSALLPWQD RMTPDLVRAE
PLVGAAFMGL QGRPEEVHRL LALAAARPLS EWDRTLWLSL AGSLRGTAGH ADLMARPGAN
PDLPAALLAT WRSPVSGLPG P