Gene Rsph17025_4173 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_4173 
Symbol 
ID5086345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009430 
Strand
Start bp214172 
End bp216508 
Gene Length2337 bp 
Protein Length778 aa 
Translation table11 
GC content67% 
IMG OID640485735 
Producthypothetical protein 
Protein accessionYP_001170329 
Protein GI146280172 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID[TIGR03030] cellulose synthase catalytic subunit (UDP-forming) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.703833 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCCCC TGGTGGTGTT GCTGAGCGTT CCCTCCGATA CGCTCGCCCA GGGGCTGTTC 
GGCATCACCA CGATCCTGAT CGTGGCGCTT CTGAAGCCTT TCACCTGGCG CTCCACGCCG
CTCCGCTTCC TGATGCTGGC GACCGCCGGC ATGGTCGTGA TGCGCTACTG GATGTGGCGG
CTGTTCGAGA CCCTGCCCTC CTGGGAGACG CCCCTGTCGC TGACGGCGGC CGGGATGCTC
TTTGCGGTCG AGACCTATGC GATCGCGCTC TTCTTTCTCA ATGCCCTCCT GCTTGCGGAT
CCGCTGAAGC GGGGCCTGCC CGAACAGGTA CCGTCCGAGC GGCTGCCCAC GGTCGACATC
CTCGTGCCCT CCTACAACGA GCCCGTCGAT CTGCTTGCGG TGACGCTCGC CGCGGCCAGG
AACATCCGTT ACCCGCCGCA TCTGCTCAGG GTCGTGCTCT GCGATGATGG CGGCACCGAC
CAGAAATGCG CCTCCTCCGA CCCCGAAGTG GCCCGTGCCG CCCAGGAGCG GCGCCGGGTC
CTGCAGGCGC TTTGCGAGCG GCTGGGGGTG AGCTATCTCA CCCGCGAGCG CAATGTGAGC
GCCAAGGCCG GCAATCTGAA TGCGGCCCTC GAGCGAACCG GGGGGGAGTT CGTGGCGGTC
TTCGATGCCG ACCACATTCC CTCGAGCGAC TTCCTGGCCC GCACCGTGGG CTTCCTGGTC
AAGGATCCCA GGCTCTTTCT GGTCCAGACG CCGCATTTCT TCATCAACAG GGATCCGATC
CAGCGCAACC TCGGCCTGCC GGCCTCCTGC CCGGCCGAGA ACGAGATGTT CTATGCGCTC
ATCCAGCGCG GCCTCGACCG CTGGGACGGG GCCTTCTTCT GCGGCTCGGC GGCCCTTCTG
CGCCGTACGG CGCTCGAGGA GGTGGGGGGC TTTTCCGGCA AGACCATCAC CGAGGATGCC
GAGACCGCGC TCGACATCCA TGCGCGGGGC TGGAACAGCC TCTATGTCGA CCGGGCCCTG
ATCGCGGGGC TGCAGCCCGA GACCTTTTCC TCCTTCATCC GGCAGCGGGG CCGCTGGGCG
GTCGGGATGA TGCAGATCCT GCGGCTGAAG AACCCGATCT TCCGCCCCGG CCTGAGCCTC
GCGCAGCGGC TGTGCTACTA CAATTCGATC AGCTACTGGT TCTTCCCGTT GGTCCGGCTC
GTCTTCCTGC TCTCGCCGCT GCTCTATCTC TTCTTCGGGT TGCAAATCTT CGTCGCCACC
TTCGAGGAGG CCCTCGTCTA CACGCTGACC TATCTGGTGG TCAGCTTCCT CGTGCAGAAC
GCGCTCTTTT CGCGCGTGCG CTGGCCGCTG ATCTCCGAGA TCTACGAGAT TGCCCAGACC
CCCTATCTGC TGCGCGCCAC GCTGGGGGCC CTGCTGCGCC CGAAAGGGTT CCGCTTCGAG
GTGACGGCCA AGGACGAGAC CGTGGCCGAC AGTTTCCTCT CCCCGATCTA CAGGCCTCTT
CTGGCCCTCT TCCTGCTGAT GGCCGCGGGA GTCGCCGCGG CGGGGCTGCG CTGGTTCCTC
TTGCCCGACG ACCGCCAGGT GGTGCTGCTC GTCGGAGGCT GGGCGCTCTT CAACTTCCTG
ATCGCCGGGC TCAGCCTGCG CGCGGTCGTC GAGCAGCGCC AGCGCCGTGT GGCGCCCAGG
GTCGATCTGA ACGTGAAGGC GATCCTGGTG CCGGGCGAAG GCGAGACCCC TCTCGAGGTC
GAGGTGGCGG ACGCCTCGAC CTATGGCGCA CGCCTGCGCC TGCCCGATGC CGCCCGCCCG
GGGCTGCGCC TCACGATCGG ACAGGCTGTG GCGTTCAAGC CGGAGATCGC GCTCCTGCCG
CAGGTCAGCC GCATGATCCG GTGCGAACTG CGCTCCCTTC AGCACGAGGG CCCCGACCTG
TTTCTCGGGC TGCGTTTCCT GCCGGACCAG GACCCGGCCG CCCGGGAAAC GGTGGCCTGC
CTTGTCTTCT CCGATTCGAG GATCTGGGAC CGAATGCGCC GCAGGGCCTT CTCGGGACGC
GGGCTCGTGC CGGGACTGGG ATTTGTCGCC TGGCATGCGG CAACAACCAT TCCGCGCACC
GCCTACGACC TGGTCCGGCT GTCGCGGACG GTCCCCGAGG AGGAGGAGAC GGGCAAGGAC
GAAAGTCCCG CGCCCCATAT CCTGGCCTTC GGCGGAGAAC TGGCCGCACC GATCCTTCCG
GTGCCGGACA CGGGCCGGAA AGCGCGTGGC GCACGCAAAC GGCGGGCAGA GGGCCTGTCG
GGTCAGGACC TGGAACCTGC CGGCGAAAGC CCGGCCGCTG GCTGGACCGG AGGATGA
 
Protein sequence
MLPLVVLLSV PSDTLAQGLF GITTILIVAL LKPFTWRSTP LRFLMLATAG MVVMRYWMWR 
LFETLPSWET PLSLTAAGML FAVETYAIAL FFLNALLLAD PLKRGLPEQV PSERLPTVDI
LVPSYNEPVD LLAVTLAAAR NIRYPPHLLR VVLCDDGGTD QKCASSDPEV ARAAQERRRV
LQALCERLGV SYLTRERNVS AKAGNLNAAL ERTGGEFVAV FDADHIPSSD FLARTVGFLV
KDPRLFLVQT PHFFINRDPI QRNLGLPASC PAENEMFYAL IQRGLDRWDG AFFCGSAALL
RRTALEEVGG FSGKTITEDA ETALDIHARG WNSLYVDRAL IAGLQPETFS SFIRQRGRWA
VGMMQILRLK NPIFRPGLSL AQRLCYYNSI SYWFFPLVRL VFLLSPLLYL FFGLQIFVAT
FEEALVYTLT YLVVSFLVQN ALFSRVRWPL ISEIYEIAQT PYLLRATLGA LLRPKGFRFE
VTAKDETVAD SFLSPIYRPL LALFLLMAAG VAAAGLRWFL LPDDRQVVLL VGGWALFNFL
IAGLSLRAVV EQRQRRVAPR VDLNVKAILV PGEGETPLEV EVADASTYGA RLRLPDAARP
GLRLTIGQAV AFKPEIALLP QVSRMIRCEL RSLQHEGPDL FLGLRFLPDQ DPAARETVAC
LVFSDSRIWD RMRRRAFSGR GLVPGLGFVA WHAATTIPRT AYDLVRLSRT VPEEEETGKD
ESPAPHILAF GGELAAPILP VPDTGRKARG ARKRRAEGLS GQDLEPAGES PAAGWTGG