Gene Rsph17029_2355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2355 
Symbol 
ID4895141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2490430 
End bp2491653 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content71% 
IMG OID640112951 
Productglycosyl transferase, group 1 
Protein accessionYP_001044229 
Protein GI126463115 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.359383 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.146668 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCC TGTTCGTCCA TCAGAACTTC CCCGGCCAGT TCCTGCATCT CGCGCCCGAA 
CTGCAGGGCA GGGGCCACGA ATGCCTCGCC CTCACCGATG GCGAGAACCG GCGCGAGGCG
CCGATCCCGG TGCTGCGCTA CCGGCACGAG GCGCCCGCGC CCGACCCGCA GGCGACGCGG
CTCGGGCGGA ACTTCACCCA GATGAGCGAC CGGGGTGTGA GCGTGGCGCG GGCGGCGCTG
CAGCTCCGCG ATCAGCGCGG CTACCGTCCC GACGTGATCG TGGGCCATTC GGGCTGGGGC
GAGACCCTGT TCCTGAAGGA GGTCTGGCCC GAGGCGAAGC TGCTGATCTA TGCCGAGTTC
TACTATCGCG GCGTGGGGCA GGACGTGGGC TTCGATCCGG AGTTCGACCG GCGCGGCTTC
GACGGGGTGA TGATCGCGCA AGGCCGGGCG GCCCATCTGG GGCAGGCGCT GCTTCATGCC
GATGCGGGCC TGTCGCCCAC CGAATGGCAG GCCTCGACCT ATCCGCCCCC GCTGCGCCGG
ATGATCGAGG TGATCCACGA CGGCGTCGAT ACCGATGCCG TGGCGCCCGA TGCCTCTGCC
CGGTTCGAGC TGCCGGACGG GCGGGTGCTG CGGGCGGGCG AGGAGGTGCT GACCTTCGTC
AACCGCAATC TCGAGCCCTA CCGCGGCTAT CACATCTTCC TGCGCGCGCT GCCCGAGGTT
CTGGCCGCGC GGCCCGAGGC GCAGGTGGTG CTGGTGGGCG GCGACGGCGT GAGCTACGGC
CCCGCGCCCG CGGAGGGCAG CTGGAAGCAG CAGTTCGTGC GCGAAGTGGG GCCTCGGCTC
GACCTCTCGC GGGTGCATTT CGTGGGCCGC GTGCCCTACG ACCGGTTCAA GGCGCTGATG
CAGGTGAGCC GGGCACATGC CTATCTCACC TATCCGTTCG TCCTGTCCTG GTCGCTGCTC
GAGGCCATGT CGGCGGGCGC GCTCGTCGTG GGCTCGCGCA CGGCGCCGGT CGAGGAGCTG
ATCGAGGACG GGCGGAACGG GCTTCTGGTG GATTTCTTCG ACGGGCCGGG CTGGTCACGC
ACGCTGATCC GGGCGCTGGC CGAGCCCGAG CGGATGATGC CCTTGCGTGC CGCCGCGCGC
GCGACGATCC GGGACCGCTA CGATCTGCGC CGCATCTGCC TGCCGCGGCT GGTCGACTGG
GTGGAGCGTC ACGGTCCGCG CTGA
 
Protein sequence
MKILFVHQNF PGQFLHLAPE LQGRGHECLA LTDGENRREA PIPVLRYRHE APAPDPQATR 
LGRNFTQMSD RGVSVARAAL QLRDQRGYRP DVIVGHSGWG ETLFLKEVWP EAKLLIYAEF
YYRGVGQDVG FDPEFDRRGF DGVMIAQGRA AHLGQALLHA DAGLSPTEWQ ASTYPPPLRR
MIEVIHDGVD TDAVAPDASA RFELPDGRVL RAGEEVLTFV NRNLEPYRGY HIFLRALPEV
LAARPEAQVV LVGGDGVSYG PAPAEGSWKQ QFVREVGPRL DLSRVHFVGR VPYDRFKALM
QVSRAHAYLT YPFVLSWSLL EAMSAGALVV GSRTAPVEEL IEDGRNGLLV DFFDGPGWSR
TLIRALAEPE RMMPLRAAAR ATIRDRYDLR RICLPRLVDW VERHGPR