Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3231 |
Symbol | neuA |
ID | 6142885 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3303368 |
End bp | 3304624 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 641618061 |
Product | polysialic acid capsule biosynthesis N-acylneuraminate cytidylyltransferase NeuA |
Protein accession | YP_001745211 |
Protein GI | 170683470 |
COG category | [E] Amino acid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1083] CMP-N-acetylneuraminic acid synthetase [COG2755] Lysophospholipase L1 and related esterases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAACAA AAATTATTGC GATAATTCCA GCCCGTAGTG GATCTAAAGG GTTGAGAAAT AAAAATGCTT TGATGCTGAT AGATAAACCT CTTCTTGCTT ATACAATTGA AGCTGCCTTG CAGTCAGAAA TGTTTGAGAA AGTAATTGTG ACAACTGACT CCGAACAGTA TGGAGCAATA GCAGAGTCAT ATGGTGCTGA TTTTTTGCTG AGACCGGAAG AACTAGCAAC TGATAAAGCA TCATCATTTG AATTTATAAA ACATGCGTTA AGTATATATA CTGATTATGA GAACTTTGCT TTATTACAAC CAACTTCACC CTTTAGAGAT TCGACCCATA TTATTGAGGC TGTAAAGTTA TATCAAACTT TAGAAAAATA CCAATGTGTT GTTTCTGTTA CTAGAAGCAA TAAGCCATCA CAAATAATTA GACCATTAGA TGATTACTCG ACACTGTCTT TTTTTGACCT TGATTATAGT AAATATAATC GAAACTCAAT AGTAGAATAT CATCCGAATG GAGCTATATT TATAGCTAAT AAGCAGCATT ATCTTCATAC AAAGCATTTT TTTGGTCGCT ATTCACTAGC TTATATTATG GATAAGGAAA GCTCTTTAGA TATAGATGAT AGAATGGATT TCGAACTTGC AATTACCATT CAGCAAAAAA AAAATAGACA AAAAATACTT TATCAAAACA TACATAATAG AATCAATGAG AAACGAAATG AATTTGATAG TGTAAGTGAT ATAACTTTAA TTGGACACTC GCTGTTTGAT TATTGGGACG TAAAAAAAAT AAATGATATA GAAGTTAATA ACTTAGGTAT CGCTGGTATA AACTCGAAGG AGTACTATGA ATATATTATT GAGAAAGAGC GGATTGTTAA TTTCGGAGAG TTTGTTTTCA TCTTTTTTGG AACTAATGAT ATAGTTGTTA GTGATTGGAA AAAAGAAGAC ACATTGTGGT ATTTGAAGAA AACATGCCAG TATATAAAGA AGAAAAATGC TGCATCAAAA ATTTATTTAT TGTCGGTTCC TCCTGTTTTT GGGCGTATTG ATCGAGATAA TAGAATAATT AATGATTTAA ATTCTTATCT TCGAGAGAAT GTAGATTTTG CGAAGTTTAT TAGCTTGGAT CACGTTTTAA AAGACTCTTA TGGCAATCTA AATAAAATGT ATACTTATGA TGGCTTACAT TTTAATAGTA ATGGGTATAC AGTATTAGAA AACGAAATAG CGGAGATTGT TAAATGA
|
Protein sequence | MRTKIIAIIP ARSGSKGLRN KNALMLIDKP LLAYTIEAAL QSEMFEKVIV TTDSEQYGAI AESYGADFLL RPEELATDKA SSFEFIKHAL SIYTDYENFA LLQPTSPFRD STHIIEAVKL YQTLEKYQCV VSVTRSNKPS QIIRPLDDYS TLSFFDLDYS KYNRNSIVEY HPNGAIFIAN KQHYLHTKHF FGRYSLAYIM DKESSLDIDD RMDFELAITI QQKKNRQKIL YQNIHNRINE KRNEFDSVSD ITLIGHSLFD YWDVKKINDI EVNNLGIAGI NSKEYYEYII EKERIVNFGE FVFIFFGTND IVVSDWKKED TLWYLKKTCQ YIKKKNAASK IYLLSVPPVF GRIDRDNRII NDLNSYLREN VDFAKFISLD HVLKDSYGNL NKMYTYDGLH FNSNGYTVLE NEIAEIVK
|
| |