Gene Rsph17025_4225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_4225 
Symbol 
ID5086396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009430 
Strand
Start bp266611 
End bp267942 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content65% 
IMG OID640485786 
Producthypothetical protein 
Protein accessionYP_001170380 
Protein GI146280223 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID[TIGR03071] HipA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.43643 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACGGC CGCCGGTTGT CAGGCGCCTT GATGTCCGCA TCAACGGGCG GCTGGCAGGA 
GAGTATCGCT TCACTCCTGC AGGCGGGGTC AGCTTCGCCT ACGATGCCGG CTGGCTGGCC
TGGGAGTTCG CTTTCCCCAT TTCGCGCCAG CTTCCGCTGA GGAGCGGCGC CCAGTCCGGT
ACCCATGTCA ATGCCGTCTT CGAAAACCTG CTGCCGGATA ACCCGGACCT GCGCCGCAGG
ATCGCCGAGC GTGCGGAGGC GCGCAGTGAC CGACCCCATG ATCTTCTGGC CGCCATCGGG
CGCGACTGCA TCGGAGCCAT GCAGTTCCTT CCGCATGGCG CCGATCCTGG CGACCCTTTT
CGGGTTGACG GTGTGCCCCA GGCGGAGGCG CAGATTGCGG CCGCCATCCG GGATCTGGCC
GAGTGGCCCC TGGGTATCCG CGCGGAGGAT CCCTTCCGCA TCTCCTTGGC GGGTGCGCAG
GAAAAGACCG CCTTTCTCTG GAAGGATGAC ACGTGGCTGA AGCCCGCCGG GCTGACACCC
ACCACGCATA TCTTCAAGCG TCGGATGGGC ATCGTTTCGC ACGGCATCGA CATGACCGAC
AGCGTCGAGA ACGAATGGCT CTGCCTGAAG CTTGCCGCAG CTCTTGGTCT GCCGGTAAAC
GAGGCCAGGA TCGAGACCTT CGAGGACCAG ACCGTTCTGG TTGTCACCCG TTTCGACCGC
ACGCCCCGCG CGAAGGGCGG TATCCTGCGC CTGCCGCAGG AGGATTTCCT GCAGGCGCTT
GGCTTCGAGT CCGGCCAGAA GTACCAGGAA CATGGGGGGC CCGGGATGCA GGATGGTTTG
CGCCTGCTGG AGGGCTCGAG CGAGCGCGCC GCCGATCAGC TTCTGTTCCT GAAGGCCCAG
ATCGTGAACT GGATACTTGC CGCCATCGAT GGGCATGCCA AGAACTACTC CCTGTTCCTG
GGGCCGGGTG GATTCAGGAT GACCCCGCTC TACGATATCG TGAGCGCCGC GCCTGCCATG
GCAAACGGCG CCTTCCGCAA CAGGGAACTG CGTCTTGCCA TGTCGGTTGG CCGACGTCGG
CATTATCGGC TGGATCAGAT CCGGCCGCGC CACTTCGAGG AGACATCCGA CCGCGCCCGC
GTACCCTCCG ACATCCGGCG TCGTGCCTTC GTCGACCTCG TGGAAACAGG GCTGGCGGCA
TTCGAAGAGG TTGCCAATGC CCTCCCCGCG GGATTTCCGG ATCGCGTGGC GGGGCCGATC
ATCGATCATG CCAGGGACCG CATGTCTCTT CTGACGGCCC GGTGCGGCGC CGGCTTGATC
GAAGGCCCAT AG
 
Protein sequence
MGRPPVVRRL DVRINGRLAG EYRFTPAGGV SFAYDAGWLA WEFAFPISRQ LPLRSGAQSG 
THVNAVFENL LPDNPDLRRR IAERAEARSD RPHDLLAAIG RDCIGAMQFL PHGADPGDPF
RVDGVPQAEA QIAAAIRDLA EWPLGIRAED PFRISLAGAQ EKTAFLWKDD TWLKPAGLTP
TTHIFKRRMG IVSHGIDMTD SVENEWLCLK LAAALGLPVN EARIETFEDQ TVLVVTRFDR
TPRAKGGILR LPQEDFLQAL GFESGQKYQE HGGPGMQDGL RLLEGSSERA ADQLLFLKAQ
IVNWILAAID GHAKNYSLFL GPGGFRMTPL YDIVSAAPAM ANGAFRNREL RLAMSVGRRR
HYRLDQIRPR HFEETSDRAR VPSDIRRRAF VDLVETGLAA FEEVANALPA GFPDRVAGPI
IDHARDRMSL LTARCGAGLI EGP