Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_2144 |
Symbol | nagA |
ID | 4204116 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 2377874 |
End bp | 2379007 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 642566694 |
Product | N-acetylglucosamine-6-phosphate deacetylase |
Protein accession | YP_699451 |
Protein GI | 110801541 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1820] N-acetylglucosamine-6-phosphate deacetylase |
TIGRFAM ID | [TIGR00221] N-acetylglucosamine-6-phosphate deacetylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAATAA AAGACTGTAA TGTAGTATAT TTAGATAAGA TTGAAAAAGG AAATATACTT ATAGAAAACG GAAAGATAAA GGCTATAAAT CCTAAAGAAT GTAATTGTGA AATAATTGAT GGAGAGGGCT TATTCCTATC TCCAGGATTT ATAGATGTTC ATATACACGG TGCTGGTGGT CACGATACTA TGGATGGTAC TTATGAAGCT ATAAATGAAA TATCAAAAGT TATAGTTAAA CATGGTACTA CTTCATTTTT ACCAACAACA ATGACTGTAG CTGCTGAAGA CGTATGCAAA TCAATGGAAG CTATTCATAA AGCAAAAACT AAAGGAACAG ATGGAGCTAA TGTTTTAGGT GCTCACTTAG AAGGACCATT TATAAGCCCA AGTGCAATAG GTGCTCAAAA TCCTGATTTC TTAATCCCTC CAACAAAAGA AAACTTCTAT AAATTAGTTG GAGAACATGA AGATGATGTT GTTTCAATAA CTCTTGCTCC TGAAGTAGAA GGCGCTAAAG AACTTACTAA ATTCTTAAGT GAAAAAGGTA TAGTTGTTTC AATGGGACAT ACTAAAGCTA CTTATGAAGA AGCTATGGAA GGAATAAAAT GTGGATGCTC TCATGCCACT CATTTATTCA ATGCAATGAC TCCATTTACT CATAGAGAAC CTGGAGTTGT AGGTGCTGTT TTTGATAGTG AAATAACTAC TGAAACAATC TCAGATGGTA TTCACATAGC ATATCCAAGT TTAAGAGTTG CCTATAAACA AAAAGGAACA GATAAGGTTT TATTAATAAC TGATGCTATG ATGGCTTGCT GTATGCCTGA TGGAATGTAT TCATTAGGAG GACAAGATGT TGAAGTTAAA AATGGAGCTG CAAGATTATT GAGTGGTTCT TTAGCAGGAT CAATTTTAAC TTTAGATGTG GCAGTTAAAA ATATATTTAA AAACACTAAT TATTCACTTA ATGAAGTTAT AAAAATGGCT ACTTATAACG GAGCTAAACA CTGCAAAGTT GATGACAAAA AAGGATTAAT TAAAGAAGAC TATGATGCTG ACTTAATTCT TTTTGATGAT AATATAGATA TAAAATATGT AATAGTTAAT GGAAAATTAG TTCATAAAGC TTAA
|
Protein sequence | MLIKDCNVVY LDKIEKGNIL IENGKIKAIN PKECNCEIID GEGLFLSPGF IDVHIHGAGG HDTMDGTYEA INEISKVIVK HGTTSFLPTT MTVAAEDVCK SMEAIHKAKT KGTDGANVLG AHLEGPFISP SAIGAQNPDF LIPPTKENFY KLVGEHEDDV VSITLAPEVE GAKELTKFLS EKGIVVSMGH TKATYEEAME GIKCGCSHAT HLFNAMTPFT HREPGVVGAV FDSEITTETI SDGIHIAYPS LRVAYKQKGT DKVLLITDAM MACCMPDGMY SLGGQDVEVK NGAARLLSGS LAGSILTLDV AVKNIFKNTN YSLNEVIKMA TYNGAKHCKV DDKKGLIKED YDADLILFDD NIDIKYVIVN GKLVHKA
|
| |