Gene Rsph17029_3693 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3693 
Symbol 
ID4898199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp803212 
End bp804327 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content74% 
IMG OID640114301 
Productglycosyl transferase, group 1 
Protein accessionYP_001045555 
Protein GI126464442 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.419352 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGGTG ATCCGGTCAT CGCCCGCCAC TACGGGCCGG GCGGGCGCGA GCATGGCGGC 
GGCATCGGCC GGCTGATCGG CTATGTGGTG GATGAGGCGG CGCGGCGCGG CGAGCGGCAT
CTCGTCACCG ACACCCGCGG CGAGCGCCTC TCCCCCCGTT CGGCCCTGCG GTTCGCGGGC
GCGATGGGGC GCATGGCGCT CGACCGGGCG ACGGCGCCCG ACCGGATCGC CCATATCCAC
ATGGCCGGAC GCGGCAGCAC GGTCCGCAAA ATCCTGCTCT GCGGCTGGGC GCGCACCCTC
GGATGCCGCC ATGTGCTGCA TCTGCACGAT TACCATTATG CCGCCGACTA CGAGGCGCGG
CCGGGCTGGC AGCGGAGTCT GGTGCGCGCC ATGTTCGCCG GCGCCGACGC GGCGGTGGTG
CTGGGCGACC CGGACCGCCG CCTCGCGGTG CAGAGGCTTC AGGCCGATCC CCACCGCGTC
GTGGTCCTGC ACAATGCGGT GCCCGATCCG GGCGAGCGGC CCGCCCCGCC CCCCGGGCCG
CCCTGCATCC TCTTTCTCGG CCGCCTGAGC GAGCGCAAGG GCGTGCCCGA ACTGCTTCAG
GCGCTGGCCC GTCCGGGCAT GGCCTCGCTG CCCTGGCGGG CGGTGCTGGC GGGCGACGGC
CCGGTCGAGG ACTACCGCCG TCAGGCCGAG GCCCTCGGTC TGGCCGGCCG GATCGAAATG
CCGGGCTGGC TCGACCGCCC GGCCACCGAG GCCCTGTGCC GGCAGGCCGA TATCCTCGTG
CTGCCCTCGC ACGCCGAAGG CATGTCGATG GCGGTGCTGG AAGGCATGGC CCACGGTCTC
GCCGTCGTGA CCACGCCCGT CGGCTCGCAT CCCGAGGTGC TGCGCGACGG GGACAGCGGG
CTCTTCGTGA AGCCCGGCGA CGTGCAGGCG CTGGCCGAGG CGCTCGACCG GCTTCTCAGC
GCACCCGAGC TGCGCCGCGC CCTCGGCGCC CGCGCGCGGG CGCGGTTCCT ATCGGATTTC
AGCATGGCGG CCTACGGACG GCAGCTCGAT CGCCTCTATG CGGCGATCGG AGCCGAGCGC
GCTCCCGGCT CCGCAGGGGA AGGACAACGA CCGTGA
 
Protein sequence
MQGDPVIARH YGPGGREHGG GIGRLIGYVV DEAARRGERH LVTDTRGERL SPRSALRFAG 
AMGRMALDRA TAPDRIAHIH MAGRGSTVRK ILLCGWARTL GCRHVLHLHD YHYAADYEAR
PGWQRSLVRA MFAGADAAVV LGDPDRRLAV QRLQADPHRV VVLHNAVPDP GERPAPPPGP
PCILFLGRLS ERKGVPELLQ ALARPGMASL PWRAVLAGDG PVEDYRRQAE ALGLAGRIEM
PGWLDRPATE ALCRQADILV LPSHAEGMSM AVLEGMAHGL AVVTTPVGSH PEVLRDGDSG
LFVKPGDVQA LAEALDRLLS APELRRALGA RARARFLSDF SMAAYGRQLD RLYAAIGAER
APGSAGEGQR P