Gene CPR_2159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2159 
SymbolmurA 
ID4206277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2386312 
End bp2387577 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content32% 
IMG OID642566709 
ProductUDP-N-acetylglucosamine 1-carboxyvinyltransferase 
Protein accessionYP_699459 
Protein GI110803637 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0766] UDP-N-acetylglucosamine enolpyruvyl transferase 
TIGRFAM ID[TIGR01072] UDP-N-acetylglucosamine 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTAAAA TAGTAGTACA AGGTGGAAAG AAATTAAAAG GAGAAGTAAA TATAAATACA 
GCTAAGAATT CAGTTTTACC AATAATTGCA GGAAGTATAT TAGCTACTGA TGGTGTACTA
ATAAATGAAT TACCAATGTT ACAAGATGTT TTTACAATTT GTAATGTTAT AGAGCAATTA
GGATATGATC TTAAAATAGA TAAGAAAGAA AACAAATTAA TTGTACCACC ACTAAATAAA
AATCCTCTAA TTCCAAGTGA GTATTTAGTA AAAAAAATGA GAGCTTCATT TTTAATAATG
GGGCCAATGA TAGCTAAATA TGGGGAGTTT AAATTAGCTA GGCCAGGTGG ATGTAATATA
GGTTCTAGGC CTATAGAATT ACACCTGAAA GGACTTAGAG CGCTAGGTGC TGAAAATACC
AATTGTGGAA ATGGATTTGT ATGTATAAAG GCAAAAAAAT TAACAGGAAG TAAAATATAT
TTAGACTTTC CATCAGTTGG AGCAACAGAG AATATAATGA TGGCAGCAAC TATGGCTAAA
GGGACTACGG TAATTGAAAA TGCAGCTCAG GAGCCAGAAA TAACTGATTT AATTAATTTT
TTAAATTCTA TGGGAGCTAA AATTTATATT GAAAAACCGG GTGAAATAAT AATAGAAGGT
GTTGATTCAT TAACTTCAAC TGAGTACACT CCTATATATG ATAGAATAGA AGCTGGAACA
TTTATGGTTG CAGCAGCAAT AACAGGATCA GAAATAAAAA TAAATGGAGT AAATAAAGAT
CATTGTTCAG CTATAATATC TAAGTTAAAA GAAGCTGGAA CAGAGTTTTT TGACGTCCAT
AATAATGAAA ATAGTATAAT AGTAAAGGGT AATGAAGAAA TAAAGCCAAT TAATATAAAA
ACAATGCCTT ATCCAGGATA TCCTACTGAT ATGCAATCAC AAATGATGAG TTTATTGAGC
ATAGCAAAAG GAAGCAGTAT CATAACTGAA AGTGTTTTTG AGAATAGATT TATGAATGTA
GATGAATTAA GACGTATGGG TGCAAACATA CAAATAGAAG GAAGAACGGC TTTAATTGAA
GGAGTAGATA ACCTAACTGG ATGTGAAGTT AAGGCAACTG ATTTAAGGGC AGGAGCTGCT
TTGATTTTGG CTGGACTTGT AGCAAAAGGA GAGACAATAG TTACTGATAT ATATCATATA
GATAGAGGTT ATGTTGAAAT AGAAAATAAG TTCAGAGCCT TAGGCGCTGA CATAAGTAGA
ATATAA
 
Protein sequence
MGKIVVQGGK KLKGEVNINT AKNSVLPIIA GSILATDGVL INELPMLQDV FTICNVIEQL 
GYDLKIDKKE NKLIVPPLNK NPLIPSEYLV KKMRASFLIM GPMIAKYGEF KLARPGGCNI
GSRPIELHLK GLRALGAENT NCGNGFVCIK AKKLTGSKIY LDFPSVGATE NIMMAATMAK
GTTVIENAAQ EPEITDLINF LNSMGAKIYI EKPGEIIIEG VDSLTSTEYT PIYDRIEAGT
FMVAAAITGS EIKINGVNKD HCSAIISKLK EAGTEFFDVH NNENSIIVKG NEEIKPINIK
TMPYPGYPTD MQSQMMSLLS IAKGSSIITE SVFENRFMNV DELRRMGANI QIEGRTALIE
GVDNLTGCEV KATDLRAGAA LILAGLVAKG ETIVTDIYHI DRGYVEIENK FRALGADISR
I