Gene EcSMS35_3232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3232 
SymbolneuB 
ID6142759 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3304624 
End bp3305664 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content32% 
IMG OID641618062 
Productpolysialic acid capsule biosynthesis sialic acid synthase NeuB 
Protein accessionYP_001745212 
Protein GI170684121 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2089] Sialic acid synthase 
TIGRFAM ID[TIGR03569] N-acetylneuraminate synthase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAATA TATATATCGT TGCTGAAATT GGTTGCAACC ATAATGGTAG TGTTGATATT 
GCAAGAGAAA TGATATTAAA AGCCAAAGAG GCCGGTGTTA ATGCAGTAAA ATTCCAAACA
TTTAAAGCTG ATAAATTAAT TTCAGCTATT GCACCTAAGG CAGAGTATCA AATAAAAAAC
ACAGGAGAAT TAGAATCTCA GTTAGAAATG ACAAAAAAGC TTGAAATGAA GTATGACGAT
TATCTCCATC TAATGGAATA TGCAGTCAGT TTAAATTTAG ATGTTTTTTC TACCCCTTTT
GACGAAGACT CTATTGATTT TTTAGCATCT TTGAAACAAA AAATATGGAA AATCCCTTCA
GGTGAGTTAT TGAATTTACC GTATCTTGAA AAAATAGCCA AGCTTCCGAT CCCTGATAAG
AAAATAATCA TATCAACAGG AATGGCTACT ATTGATGAGA TAAAACAGTC TGTTTCTATT
TTTATAAATA ATAAAGTTCC GGTTGATAAT ATTACAATAT TACATTGCAA TACTGAATAT
CCAACGCCCT TTGAGGATGT AAACCTTAAT GCTATTAATG ATTTGAAAAA ACACTTCCCT
AAGAATAACA TAGGCTTCTC TGATCATTCT AGCGGGTTTT ATGCAGCTAT TGCGGCGGTG
CCTTATGGAA TAACTTTTAT TGAAAAACAT TTCACTTTAG ATAAATCTAT GTCTGGCCCA
GATCATTTGG CCTCAATAGA ACCTGATGAA CTGAAACATC TATGTATTGG GGTCAGGTGT
GTTGAAAAAT CTTTAGGTTC AAATAGTAAA GTGGTTACAG CTTCAGAAAG GAAGAATAAA
ATCGTAGCAA GAAAGTCTAT TATAGCTAAA ACAGAGATAA AAAAAGGTGA GGTTTTTTCA
GAAAAAAATA TAACAACAAA AAGACCTGGT AATGGTATCA GTCCGATGGA GTGGTATAAT
TTATTGGGTA AAATTGCAGA GCAAGACTTT ATTCCAGATG AATTAATAAT TCATAGCGAA
TTCAAAAATC AGGGGGAATA A
 
Protein sequence
MSNIYIVAEI GCNHNGSVDI AREMILKAKE AGVNAVKFQT FKADKLISAI APKAEYQIKN 
TGELESQLEM TKKLEMKYDD YLHLMEYAVS LNLDVFSTPF DEDSIDFLAS LKQKIWKIPS
GELLNLPYLE KIAKLPIPDK KIIISTGMAT IDEIKQSVSI FINNKVPVDN ITILHCNTEY
PTPFEDVNLN AINDLKKHFP KNNIGFSDHS SGFYAAIAAV PYGITFIEKH FTLDKSMSGP
DHLASIEPDE LKHLCIGVRC VEKSLGSNSK VVTASERKNK IVARKSIIAK TEIKKGEVFS
EKNITTKRPG NGISPMEWYN LLGKIAEQDF IPDELIIHSE FKNQGE