Gene Rsph17029_2170 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2170 
Symbol 
ID4897616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2300035 
End bp2301135 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content74% 
IMG OID640112764 
Productglycosyl transferase, group 1 
Protein accessionYP_001044045 
Protein GI126462931 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0477227 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGGG CGCGGCTGAC CATCCTCATG ACGGTCGATG CCGTGGGCGG CGTCTGGCGC 
TACGCGATGG ACCTCGCGGC CGGGCTGCGG GGGCAGGTGG ATGTGGTCTT CGCGGGCTTC
GGCCCCGAAC CGTCCGAGGC GCAGCGGCGC GAGGCCGAGG CGCTGGGTCC GCTCGACTGG
TGCGATGCGC CGCTCGACTG GCTGGTGGGC GGCGAATCCG AGCTTGCCGT GGTGCCGAAG
ATGATCGCGG GCGTCGCCCG GCGCCATCGG GTGGATCTGA TCCACCTGAA TCTGCCGTCG
CAGGCGGCGG GTCTGTCGGT GCCGGTGCCG GTGCTGGCGG TCTCGCATTC CTGCGTCGTG
ACCTGGTTCG CGGCAGTGCG CGACGGCGTA CTGCCCGCGG GGTGGCTGTG GCAGCGGCGG
CTGAACCGGC AGGGCCTTGC CGCGGCCGAT GTGGTGGTCA CGCCCACCCG CGCGCAGGCC
GACCTGATGG CGCGGTCCTA CGGGCCGATG CCCGAGGTGC GGGTGGTGGC CAATGCCAGT
CGCGTCGCGG CCCCCGCGCG GCGGATGGCG CGGCCGATGG TGCTGTCCGC GGGGCGCTGG
TGGGACGAGG GCAAGAATGC CGCCGTGCTC GACGCGGCGG CCCCGCTGAT CGACTGGCCG
GTGGTGATGG CCGGCGCTGC CGCCTCGCCA AAGGGACAGG CCGTGGCGAT CCGGGCGGCC
GAGGCCCGTG GCGAGATCAG CCATGCCGAG ATGCTCGAAC TGATGTGCGA GGCCTCGATC
TTCGTCTCGC CCTCGCGCTA CGAGCCCTTC GGTCTGGCCG TCCTCGAGGC CGCGCGGGGC
GGGCTGCCGC TCGTCCTGTC GGACATCCCC ACCTTCCGCG AACTCTGGGA CGGGGCGGCC
GTCTTCTTTC CGCCCGAGGA TCCGATGGCG CTGGCCGAGG CGGTCAACCG GCTCATCCGC
GACCCGGCCC GTCGCCGCAG GCTGGGACAG GCCGCGCAGG CCCGCGCCGC CCTCTACACG
CCCGAGCGGC AGGCGCGCGC CATGGCCGCC ATCTATGCCG AGCTCTGCCC CATTCCCGAA
ACTCTCCGCG CCGCGAGGTG A
 
Protein sequence
MSGARLTILM TVDAVGGVWR YAMDLAAGLR GQVDVVFAGF GPEPSEAQRR EAEALGPLDW 
CDAPLDWLVG GESELAVVPK MIAGVARRHR VDLIHLNLPS QAAGLSVPVP VLAVSHSCVV
TWFAAVRDGV LPAGWLWQRR LNRQGLAAAD VVVTPTRAQA DLMARSYGPM PEVRVVANAS
RVAAPARRMA RPMVLSAGRW WDEGKNAAVL DAAAPLIDWP VVMAGAAASP KGQAVAIRAA
EARGEISHAE MLELMCEASI FVSPSRYEPF GLAVLEAARG GLPLVLSDIP TFRELWDGAA
VFFPPEDPMA LAEAVNRLIR DPARRRRLGQ AAQARAALYT PERQARAMAA IYAELCPIPE
TLRAAR