Gene EcSMS35_3231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3231 
SymbolneuA 
ID6142885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3303368 
End bp3304624 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content31% 
IMG OID641618061 
Productpolysialic acid capsule biosynthesis N-acylneuraminate cytidylyltransferase NeuA 
Protein accessionYP_001745211 
Protein GI170683470 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1083] CMP-N-acetylneuraminic acid synthetase
[COG2755] Lysophospholipase L1 and related esterases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAACAA AAATTATTGC GATAATTCCA GCCCGTAGTG GATCTAAAGG GTTGAGAAAT 
AAAAATGCTT TGATGCTGAT AGATAAACCT CTTCTTGCTT ATACAATTGA AGCTGCCTTG
CAGTCAGAAA TGTTTGAGAA AGTAATTGTG ACAACTGACT CCGAACAGTA TGGAGCAATA
GCAGAGTCAT ATGGTGCTGA TTTTTTGCTG AGACCGGAAG AACTAGCAAC TGATAAAGCA
TCATCATTTG AATTTATAAA ACATGCGTTA AGTATATATA CTGATTATGA GAACTTTGCT
TTATTACAAC CAACTTCACC CTTTAGAGAT TCGACCCATA TTATTGAGGC TGTAAAGTTA
TATCAAACTT TAGAAAAATA CCAATGTGTT GTTTCTGTTA CTAGAAGCAA TAAGCCATCA
CAAATAATTA GACCATTAGA TGATTACTCG ACACTGTCTT TTTTTGACCT TGATTATAGT
AAATATAATC GAAACTCAAT AGTAGAATAT CATCCGAATG GAGCTATATT TATAGCTAAT
AAGCAGCATT ATCTTCATAC AAAGCATTTT TTTGGTCGCT ATTCACTAGC TTATATTATG
GATAAGGAAA GCTCTTTAGA TATAGATGAT AGAATGGATT TCGAACTTGC AATTACCATT
CAGCAAAAAA AAAATAGACA AAAAATACTT TATCAAAACA TACATAATAG AATCAATGAG
AAACGAAATG AATTTGATAG TGTAAGTGAT ATAACTTTAA TTGGACACTC GCTGTTTGAT
TATTGGGACG TAAAAAAAAT AAATGATATA GAAGTTAATA ACTTAGGTAT CGCTGGTATA
AACTCGAAGG AGTACTATGA ATATATTATT GAGAAAGAGC GGATTGTTAA TTTCGGAGAG
TTTGTTTTCA TCTTTTTTGG AACTAATGAT ATAGTTGTTA GTGATTGGAA AAAAGAAGAC
ACATTGTGGT ATTTGAAGAA AACATGCCAG TATATAAAGA AGAAAAATGC TGCATCAAAA
ATTTATTTAT TGTCGGTTCC TCCTGTTTTT GGGCGTATTG ATCGAGATAA TAGAATAATT
AATGATTTAA ATTCTTATCT TCGAGAGAAT GTAGATTTTG CGAAGTTTAT TAGCTTGGAT
CACGTTTTAA AAGACTCTTA TGGCAATCTA AATAAAATGT ATACTTATGA TGGCTTACAT
TTTAATAGTA ATGGGTATAC AGTATTAGAA AACGAAATAG CGGAGATTGT TAAATGA
 
Protein sequence
MRTKIIAIIP ARSGSKGLRN KNALMLIDKP LLAYTIEAAL QSEMFEKVIV TTDSEQYGAI 
AESYGADFLL RPEELATDKA SSFEFIKHAL SIYTDYENFA LLQPTSPFRD STHIIEAVKL
YQTLEKYQCV VSVTRSNKPS QIIRPLDDYS TLSFFDLDYS KYNRNSIVEY HPNGAIFIAN
KQHYLHTKHF FGRYSLAYIM DKESSLDIDD RMDFELAITI QQKKNRQKIL YQNIHNRINE
KRNEFDSVSD ITLIGHSLFD YWDVKKINDI EVNNLGIAGI NSKEYYEYII EKERIVNFGE
FVFIFFGTND IVVSDWKKED TLWYLKKTCQ YIKKKNAASK IYLLSVPPVF GRIDRDNRII
NDLNSYLREN VDFAKFISLD HVLKDSYGNL NKMYTYDGLH FNSNGYTVLE NEIAEIVK