Gene Rsph17029_3664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3664 
Symbol 
ID4898640 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp764439 
End bp765479 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content75% 
IMG OID640114272 
Productglycosyl transferase, group 1 
Protein accessionYP_001045526 
Protein GI126464413 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.032207 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.177473 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGGCGG TCTTTGCCAT TCCCGGGGAT CCCGACCGCC GCTCGGGCGG CTTTCTCTAC 
GAACGCGCCC TGCTCCGCGC GCTGAACGAG AGCGGCCGCG AGGTGGCCTA CCTGCGCCTT
CCCGCGGGCT TTCCCGATCC CGATCCGGCC GAGACCGTCG AGGCGGCCGG CCTGCTTGCG
GCCGTCCCCG AGGGCGTGCC CGTGATCCTC GACGGGCTCG TGCACGGCGC GATCGAGACG
GAGGCGCTGG CGCGGATGCG CGCGCCACTC GTGGCCATGA CCCACCATCC GCTGGCGCTC
GAGACGGGTC TGCCGCCCGC CCGCGCCGCC CTCCTGCGGG CGCGGGAGCG GGCGAACCTT
GCGCTTGCCG CTCATGTGCT GGTGCCGAGC CCGCATACGG CGCGGCTCCT CGTAGAGGAG
TATGGCGTGC CCGCCGCGCG GATCACGGTG GCGCTGCCGG GCTTTCCGCC CGCCGATCCG
GTGCGCGCGC CCGTGCAGCC GCCACTGATC CTGTCGGTGG GGATCCTCGT GCCGCGCAAG
GGGCACGACG TGCTGCTCGA AGCGCTTGCG CGGATCCGGG ATCTGGACTG GCAGGCGCGC
ATCGTCGGGG CGCCGTGGTT TGCCGAGACG GCCGCGGCGC TGCAGGCGCA GCGGACCGAT
CTGGGGCTCG AGGCTCGGGT CGCCTTCACC GGCGAGCTTG GCGAGGCCGA CCTGCGCGCC
CTCTTCCGGC AGGCCACGCT CTTCGCGCTG GCCACGCGGC ACGAGGGGTA CGGCATGGTC
TTTCCCGAGG CGCTGCTGAA CGGATTGCCC ATCGTCGCCT GCGCCACGGG GGCGGTGCCC
GATACGGTGC CTGCCGATGC GGGGCTTCTG GTGCCGCCCG ACGATCCGGC CGCCTTCGCA
GCGGCGCTCC GTCGCCTGCT GGAGGAGGCC CCCACCCGCC AGCGTCTGGC CGAGGCAGCC
ACCCGTGCAG GCGGCGCCCT GCCCCGGTGG GCGGACACGG CCGCCATCGC GGGCGCCGTC
CTCGACCGGC TTGCGCGCTG A
 
Protein sequence
MRAVFAIPGD PDRRSGGFLY ERALLRALNE SGREVAYLRL PAGFPDPDPA ETVEAAGLLA 
AVPEGVPVIL DGLVHGAIET EALARMRAPL VAMTHHPLAL ETGLPPARAA LLRARERANL
ALAAHVLVPS PHTARLLVEE YGVPAARITV ALPGFPPADP VRAPVQPPLI LSVGILVPRK
GHDVLLEALA RIRDLDWQAR IVGAPWFAET AAALQAQRTD LGLEARVAFT GELGEADLRA
LFRQATLFAL ATRHEGYGMV FPEALLNGLP IVACATGAVP DTVPADAGLL VPPDDPAAFA
AALRRLLEEA PTRQRLAEAA TRAGGALPRW ADTAAIAGAV LDRLAR