Gene Rsph17029_2286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2286 
Symbol 
ID4897904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2415986 
End bp2417254 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content70% 
IMG OID640112881 
ProductUDP-N-acetylglucosamine 1-carboxyvinyltransferase 
Protein accessionYP_001044160 
Protein GI126463046 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0766] UDP-N-acetylglucosamine enolpyruvyl transferase 
TIGRFAM ID[TIGR01072] UDP-N-acetylglucosamine 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.261062 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTCGA TTCTCGTAAA GGGCAATGGC GAGCTGCGCG GGCAGATCCC GATCGCGGGG 
GCGAAGAATG CCTGTCTGGC GCTGATGCCG GCCACGCTCC TGTCGGACGA ACCGCTGACG
CTGACCAATG CGCCGCGGCT GTCGGACATC CGCACGATGA CGCAGCTTCT GCAGTCGCTC
GGCGCGGAGG TGGCGAGCCT GCAGGGCGGG CAGGTGCTGG CGCTCTCGTC GCATGCGCTG
ACCGACCATC GGGCGGACTA CGACATCGTG CGGAAGATGC GCGCCTCGAT CCTCGTGCTG
GGGCCGATGC TCGCGCGCGA CGGCCATGCG GTCGTGTCGC TGCCCGGCGG CTGCGCCATC
GGTGCGCGGC CGGTGGATCT GCATCTGAAG GCGCTCGAGG CGATGGGGGC CGAGCTCGAC
CTGCGCGACG GCTATATCCA CGCCAAGGCC CCGGCGGGCG GGCTGAAGGG CGCGCGGGTG
GTCTTTCCGC TCGTCTCGGT CGGCGCGACC GAGAATGCGC TGATGGCCGC GACCCTCGCC
AAGGGCACGA CCGTGCTCGA GAATGCCGCG CGCGAGCCCG AGATCGTCGA TCTGGCCCGC
TGCCTGCGTC GGATGGGGGC CCAGATCGAG GGCGAGGGCT CCTCGACCAT CACCATCGAG
GGCGTGGACC GGCTGGGCGG GGCCACGCAC CCCGTCGTCA CCGACCGGAT CGAGCTCGGC
ACCTACATGC TCGCGCCCGC GATCTGCGGC GGCGAGGTCG AGCTTCTGGG TGGGCGGATC
GAGCTGGTCG GCGCCTTCTG CGAGAAGCTC GACGCGGCCG GCATCTCGGT CGAGGAGACC
GAGCGCGGGC TGCGCGTGGC GCGCAGGAAC GGCCGGGTGA AGGCCGTCGA TGTGATGACC
GAGCCCTTCC CGGGTTTTCC CACCGACCTG CAGGCGCAGA TGATGGCGCT TCTCTGCACC
GCCGAGGGCA CTTCCGTGCT CGAGGAGAGG ATCTTCGAGA ACCGCTTCAT GCATGCGCCG
GAGCTGATCC GGATGGGGGC GCGGATCGAG GTGCACGGCG GCACGGCCAC GGTGACGGGC
GTCGAGAAGC TGCGCGGGGC GCCGGTGATG GCCACCGACC TGCGCGCCTC GGTGAGCCTG
ATCCTCGCCG GGCTCGCCGC CGAGGGCGAG ACCATCGTGA GCCGGGTCTA CCACCTCGAT
CGCGGCTATG AGAGGGTGGA AGAGAAGCTG AGCGCCTGCG GCGCGCAGAT CAGAAGGATC
CCCGGCTGA
 
Protein sequence
MDSILVKGNG ELRGQIPIAG AKNACLALMP ATLLSDEPLT LTNAPRLSDI RTMTQLLQSL 
GAEVASLQGG QVLALSSHAL TDHRADYDIV RKMRASILVL GPMLARDGHA VVSLPGGCAI
GARPVDLHLK ALEAMGAELD LRDGYIHAKA PAGGLKGARV VFPLVSVGAT ENALMAATLA
KGTTVLENAA REPEIVDLAR CLRRMGAQIE GEGSSTITIE GVDRLGGATH PVVTDRIELG
TYMLAPAICG GEVELLGGRI ELVGAFCEKL DAAGISVEET ERGLRVARRN GRVKAVDVMT
EPFPGFPTDL QAQMMALLCT AEGTSVLEER IFENRFMHAP ELIRMGARIE VHGGTATVTG
VEKLRGAPVM ATDLRASVSL ILAGLAAEGE TIVSRVYHLD RGYERVEEKL SACGAQIRRI
PG