Gene Rsph17029_2994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2994 
Symbol 
ID4899017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp1624 
End bp2730 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content71% 
IMG OID640113596 
Productglycosyl transferase, group 1 
Protein accessionYP_001044867 
Protein GI126463754 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.767293 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGAT CGGAACTGTC CCGCCTTGCC GTTGTCGTCA GCGGCTCCGT CCCTCCGGAC 
ACCCGGACCC GGATCGACCG GGGCGCAAAT CCCCGGCTCG ACTTCACCCA CCTTCAGAGC
CTCGGCGCCA CGATCTTCAC CCGGGATTCG GCGCCCGAGG GCCGGGCCTT TCTCCGCCGA
ACGCTGGGCG AGCGGTTCGC CCCGGCCGAA GCCGTGGCCG AGGCGGCCGA CCGGTTCGAC
GCGATCTTCT GCGTGGCCGA AGACATCGGC GTGCCGGTGG CGCTGGCCCT GCGGCTCCGC
GGCAAGCGGA CCCCGCTGCT CGTGGGGGTG CACGGACACT ACCTCGTCAA CCGCAAGTTC
CGGCTCTGGG CGCTGGCCGC GCGCCACGAT GCGGCCACCC GCTTCCTGCC GCTGTCCGAG
CCGATCCGGG CGCGGCTGAT CGCCGAATTC GGCATTCCGG CCAGCCGCTG CCACACGCTC
TGCGTACCGA TCGACACCCG CTTCTTCGCG CCCGAGCCCG CGCCCGAGGC CGATCCGCCG
ATGATCCTGA GCGCGGGCGC CGCACAGCGC GATTATCCCA CGCTCATCGC CGTGATGGAG
GACGTGCCGG CGCGCTTCCG CATCGCCTCG GGGTCGAGCT GGATCGGCGA GGCCACGAAG
CTCGCCGTGC CCGAGACCTG CACGATGGGC AGTGCGGGCT CGATGCCGGG GCTGCGCGCG
CTCTATGCCG CCGCCGCCAT GGTGGTGCTG CCGCTGCAGG ATGTGGTTCA TGCCAGCGGC
TATGCGGTGG CGATGGAGGC CATGGCCATG GGCAAGGCCC TGATCGTGAC GCGCACCGAG
GCTCCGGCCG ATTTCTTCCT CGACGGCGAA ACCTGCCTGC TCGTACCGCC GGGCGACCCG
GCCGCGCTGC GCTCGGCGAT CCTGCGTCTT CTCGAAAATG CCGACCTCCG CATGCGGCTG
GGCCGTGCAG CGCGGCATCT GATGGAGGAG CGCTACGGGA TGGAGAGCTA CACGGCCGAT
CTCGCGCGGC TTCTGACGGA TGTAAGCCGC CCGCCGGCGC AGGCACAGGA CCCGGGCCAC
TGGGTCCGGC GGCCCCGGGG CGGCTGA
 
Protein sequence
MTRSELSRLA VVVSGSVPPD TRTRIDRGAN PRLDFTHLQS LGATIFTRDS APEGRAFLRR 
TLGERFAPAE AVAEAADRFD AIFCVAEDIG VPVALALRLR GKRTPLLVGV HGHYLVNRKF
RLWALAARHD AATRFLPLSE PIRARLIAEF GIPASRCHTL CVPIDTRFFA PEPAPEADPP
MILSAGAAQR DYPTLIAVME DVPARFRIAS GSSWIGEATK LAVPETCTMG SAGSMPGLRA
LYAAAAMVVL PLQDVVHASG YAVAMEAMAM GKALIVTRTE APADFFLDGE TCLLVPPGDP
AALRSAILRL LENADLRMRL GRAARHLMEE RYGMESYTAD LARLLTDVSR PPAQAQDPGH
WVRRPRGG