Gene Saro_0449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0449 
Symbol 
ID3918317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp491483 
End bp492766 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content68% 
IMG OID640443178 
ProductUDP-N-acetylglucosamine 1-carboxyvinyltransferase 
Protein accessionYP_495731 
Protein GI87198474 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0766] UDP-N-acetylglucosamine enolpyruvyl transferase 
TIGRFAM ID[TIGR01072] UDP-N-acetylglucosamine 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.531296 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACAAGA TCATCATTCG CGGCGGAAAG CGTCTTTCGG GCGCCGTGCC GGTCTCGGGC 
GCCAAGAACT CGGCGCTTAC CCTGCTGCCC TGCGCGCTGC TGACCGACGA GCCGGTGACG
CTGCGCAACC TGCCGCGTCT TGCCGACATC GACGGGTTCC AGCACCTGAT GAACCAGTTT
GGCGTATCCA CGTCGATTGC GGGCGCAAGG CCGGAGGATT TCGGCCGCGT GATGACCTTG
CAGGCGACGC GCCTGACGTC GACCGTCGCG CCCTATGACC TGGTGCGCAA GATGCGCGCC
TCGATCCTCG TGCTGGGCCC GATGCTGGCA CGCGCGGGCG AGGCCACGGT TTCGCTGCCC
GGCGGCTGCG CCATCGGCAA CCGCCCGATC GACCTGCATC TCAAGGCGCT GGAAGCCCTA
GGTGCGCAGA TCGAACTGGC GGCAGGCTAT GTCCGCGCGA TCGCTCCCGA CGGCGGCCTG
CCGGGCGGGC GCTATTCCTT CCCGGTGGTC TCCGTCGGCG CGACCGAGAA CGCGCTGATG
GCCGCCGTGC TGGCCAAGGG CAAGTCCACC CTGCACAATG CGGCGCGCGA GCCGGAGATC
GTCGACCTGT GCAACCTGCT GGTTGCGATG GGCGCGCAGA TCGAGGGCAT CGGCACGTCG
GACCTCACGA TCCACGGCGT GGACCGCCTC CACGGCGCGA CCTACATGGT CATGCCGGAC
CGCATCGAGG CCGGCTCCTA TGCCTGCGCA GCCGCGATCA CCGGCGGCGA AGTGATGCTG
AACGGCGCGC GCATCGAGGA CATGGAAGCG ACCGTACAGG CACTGCGCGA CGCGGGCGTC
CATGTCGAAC CGCGCAAGGG CGGGATCTAC GTGGCGGCCG ACGGTCCGCT CAAGCCGGTC
ACGATCTCGA CCGCGCCCTA TCCGGGTTTC GCGACCGACA TGCAGGCGCA GCTCATGGCG
ATGCTGTGCC TTGCCCATGG TTCCTCGGTG CTGACCGAGA CCATCTTCGA GAACCGCTAC
ATGCACGTGC CGGAACTGAA CCGCATGGGC GCCCGGATCG AGACCAAGGG CCGCACCGCA
GTAGTCCACG GGGTGGAGAA GCTGACCGGC GCCGAAGTCA TGGCGACCGA CCTTCGCGCA
TCGATGAGCC TGGTCATCGC CGGGCTTGCG GCCGAAGGCG AGACGCAGGT CCACCGCCTC
TATCACCTCG ACCGCGGCTA TGAACGGCTT GAAGAGAAGC TCTCGCTGCT TGGCGCCGAA
ATCGAGCGCG TCGGCGGGGA CTGA
 
Protein sequence
MDKIIIRGGK RLSGAVPVSG AKNSALTLLP CALLTDEPVT LRNLPRLADI DGFQHLMNQF 
GVSTSIAGAR PEDFGRVMTL QATRLTSTVA PYDLVRKMRA SILVLGPMLA RAGEATVSLP
GGCAIGNRPI DLHLKALEAL GAQIELAAGY VRAIAPDGGL PGGRYSFPVV SVGATENALM
AAVLAKGKST LHNAAREPEI VDLCNLLVAM GAQIEGIGTS DLTIHGVDRL HGATYMVMPD
RIEAGSYACA AAITGGEVML NGARIEDMEA TVQALRDAGV HVEPRKGGIY VAADGPLKPV
TISTAPYPGF ATDMQAQLMA MLCLAHGSSV LTETIFENRY MHVPELNRMG ARIETKGRTA
VVHGVEKLTG AEVMATDLRA SMSLVIAGLA AEGETQVHRL YHLDRGYERL EEKLSLLGAE
IERVGGD