Gene Rsph17025_4093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_4093 
Symbol 
ID5086266 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009430 
Strand
Start bp143645 
End bp144715 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content70% 
IMG OID640485656 
Producthypothetical protein 
Protein accessionYP_001170250 
Protein GI146280093 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1596] Periplasmic protein involved in polysaccharide export 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.818341 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.104815 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCACC GACTGACGGT TGTTCTTGTG GGAAGCCTGC CGGTTCTCGC AGGCTGTTCG 
GAAGGCCATG TGCGCTTTCC GGTGACGGAG AGCGCGCAGA AGGCGCTTCC GGAGAATGTG
CAGGTGATCC GGCTGGATGC GGAGAACATC CGCAGCTTCG AGGTGCCGGC GGAACCGCAC
CAGGCGACGC GGCTGCCGGC CGGCGGAGGA TGGGACTACC GGATCGGGGT GGGGGACATC
CTGGGGATCA CGGTGTTCGA CCATCCCGAA CTCATGCTGC CGGGGGGCGA GAAGACCGCC
GGGGAGAGCG GCTTCCGGGT GCAGGGGGAC GGGACGGTGG CCTTTCCCTA CGTGGGGGCG
GTGCGGGCGA AGGGCCGGGC GCCGGAGGAG GTGCGCGAGG AACTCCGGAC GCGGCTTGCG
GCCTTCATCC CCGAGCCGCA GGTGGATGTG CGGGTGACGG CCTTCAACTC GCAGGCGGTG
AGCGTGACGG GGGAGGTGAG AACCCCGAAC CGGCAGGCGC TGACCACGGT CGAACTGACG
CTTCTCGATG CCATCAACGC GGCGGGGGGA CTGGCCGAGA CGGCGGACGC GCGGCGGGTG
ACGGTCCGGC GCGGCACGAG CTCCTACAGG GTCGATCTCG AGGGGTTCCT GACCGCGGGG
CTCGGGAGCA ACAACCCGGT ATTGCGGCCG GGCGACATCG TCACGGTGCC GCGCCGGCAG
GCGCGCGAGG CCTATCTTCT GGGCGAGATC GTGAAGCCCG CGGCGGTCGA TCTTTCGGTC
GAGCCGCTGA CACTGACCCA GGCGCTGAGC CGGCAGGGCG GCATTCTCGA GCGGCGGGCG
GATGCGCGGG GGGTCTTCGT CTTCCGCGCG AACGGCGCTC CGGGCATGAA GGTGTTCCAG
CTCGATGCGC GCTCGCCCAC GGCGCTTCTC CTGGGGACAC GGTTCCTGCT GCAGCCGGGG
GATGTGGTCT ATGTGACGCG CGCGCCGCTC AGCCGCTGGA ACGACACGAT CAGCGACCTG
CTGCCCTCGG TGGGGATCAC CAGCAGCCTC GACCGGCTGG GGACGAACTG A
 
Protein sequence
MLHRLTVVLV GSLPVLAGCS EGHVRFPVTE SAQKALPENV QVIRLDAENI RSFEVPAEPH 
QATRLPAGGG WDYRIGVGDI LGITVFDHPE LMLPGGEKTA GESGFRVQGD GTVAFPYVGA
VRAKGRAPEE VREELRTRLA AFIPEPQVDV RVTAFNSQAV SVTGEVRTPN RQALTTVELT
LLDAINAAGG LAETADARRV TVRRGTSSYR VDLEGFLTAG LGSNNPVLRP GDIVTVPRRQ
AREAYLLGEI VKPAAVDLSV EPLTLTQALS RQGGILERRA DARGVFVFRA NGAPGMKVFQ
LDARSPTALL LGTRFLLQPG DVVYVTRAPL SRWNDTISDL LPSVGITSSL DRLGTN