Gene SeD_A3666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3666 
SymbolmurA 
ID6872105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3515729 
End bp3516988 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content58% 
IMG OID642786644 
ProductUDP-N-acetylglucosamine 1-carboxyvinyltransferase 
Protein accessionYP_002217278 
Protein GI198243198 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0766] UDP-N-acetylglucosamine enolpyruvyl transferase 
TIGRFAM ID[TIGR01072] UDP-N-acetylglucosamine 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.003897 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value0.0724542 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAAAT TTCGTGTACA GGGGCCAACG ACGCTCCAGG GCGAAGTCAC AATTTCCGGC 
GCCAAAAACG CCGCGCTGCC GATCCTGTTC GCCGCTCTGC TGGCAGAAGA ACCGGTAGAG
ATCCAAAACG TTCCAAAACT GAAAGACGTG GATACGTCGA TGAAGCTCCT GAGCCAATTA
GGCGCGAAAG TGGAACGCAA TGGATCGGTA CACATTGACG CCAGCCAGGT GAACGTGTTC
TGCGCGCCTT ATGATCTGGT TAAGACCATG CGCGCCTCGA TATGGGCGCT GGGGCCGCTA
GTTGCCCGTT TTGGTCAGGG ACAAGTCTCT CTGCCGGGGG GCTGTACTAT CGGCGCGCGT
CCGGTTGATT TGCATATTAC CGGTCTGGAA CAACTGGGTG CGACCATCAA GCTGGAAGAG
GGCTACGTAA AGGCCTCCGT CGAAGGACGT CTTAAAGGGG CGCATATCGT GATGGATAAG
GTCAGCGTCG GCGCGACCGT CACCATTATG TGCGCCGCCA CGCTGGCGGA AGGGACGACC
ATCATTGAAA ACGCCGCGCG CGAGCCGGAA ATTGTCGACA CCGCTAACTT CCTGGTTACG
CTGGGGGCGA AGATCGCCGG GCAGGGCACC GATCGTATTA CGATCGAGGG CGTAGAGCGT
CTGGGCGGCG GCGTTTATCG TGTGCTGCCG GATCGTATCG AAACGGGGAC TTTCCTGGTG
GCGGCAGCGA TCTCTCGCGG CAAAATCCTC TGCCGTAACG CGCAACCGGA CACTCTGGAC
GCAGTACTGG CGAAACTGCG CGATGCTGGC GCGGATATCG AAGTGGGCGA GGACTGGATT
AGTCTGGACA TGCACGGCAA ACGACCGAAG GCGGTCAATG TCCGTACCGC GCCGCACCCG
GCATTCCCGA CCGATATGCA GGCGCAGTTC ACGTTGCTGA ACCTGGTGGC GGAAGGGACG
GGCTTTATCA CTGAAACTGT CTTTGAAAAC CGTTTTATGC ACGTGCCTGA ACTCAGCCGC
ATGGGCGCGC GTGCGGAAAT TGAAAGCAAT ACCGTCATAT GCCATGGCAT AGAAACGCTC
TCTGGCGCCC AGGTTATGGC GACAGATCTG CGGGCATCTG CGAGCCTGGT GCTGGCGGGC
TGTATTGCGG AAGGCACGAC GATTGTGGAT CGCATTTATC ACATCGATCG CGGTTATGAG
CGCATCGAAG ACAAACTGCG CGCGCTAGGC GCGAATATTG AACGCGTGAA AGGCGAATAA
 
Protein sequence
MDKFRVQGPT TLQGEVTISG AKNAALPILF AALLAEEPVE IQNVPKLKDV DTSMKLLSQL 
GAKVERNGSV HIDASQVNVF CAPYDLVKTM RASIWALGPL VARFGQGQVS LPGGCTIGAR
PVDLHITGLE QLGATIKLEE GYVKASVEGR LKGAHIVMDK VSVGATVTIM CAATLAEGTT
IIENAAREPE IVDTANFLVT LGAKIAGQGT DRITIEGVER LGGGVYRVLP DRIETGTFLV
AAAISRGKIL CRNAQPDTLD AVLAKLRDAG ADIEVGEDWI SLDMHGKRPK AVNVRTAPHP
AFPTDMQAQF TLLNLVAEGT GFITETVFEN RFMHVPELSR MGARAEIESN TVICHGIETL
SGAQVMATDL RASASLVLAG CIAEGTTIVD RIYHIDRGYE RIEDKLRALG ANIERVKGE