Gene SAG1161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1161 
SymbolneuB 
ID1013968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1165626 
End bp1166651 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content36% 
IMG OID637316346 
ProductN-acetyl neuramic acid synthetase NeuB 
Protein accessionNP_688170 
Protein GI22537319 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2089] Sialic acid synthase 
TIGRFAM ID[TIGR03569] N-acetylneuraminate synthase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTTATA TTATTGCAGA GATTGGTTGC AATCATAATG GAGATATTAA TCTTGCGAAA 
AAAATGGTAG ATGTTGCCGT GTCTTGTGGT GTTGATGCTG TTAAATTTCA GACTTTTAAA
GCTGAGAAAC TTATTTCTAA ATTTGCTCCC AAAGCTGAAT ATCAAAAAGC AACTACAGGA
ACAGCAGACA GTCAACTTGA GATGACGAAA CGTTTAGAGT TAAGCTTTGA AGAATACTTA
GAAATGCGTG ATTATGCAAT TTCAAAAGGT GTGGAGACCT TTTCAACACC TTTTGATGAA
GAGTCATTAG AGTTCTTAAT TTCTACAGAT ATGCCAATTT ACAAAATTCC ATCAGGAGAA
ATCACTAATT TACCTTACTT AGAAAAGATT GGCAAGCAAC AAAAGAAAGT TATTCTTTCG
ACGGGTATGG CGGTAATGGA AGAGATCCAT CAAGCGGTGA ATATTTTACG TCAGAATGGT
ACAACCGACA TTTCTATTTT ACATTGTACA ACAGAGTACC CAACACCTTA CCCCTCTCTA
AATTTAAACG TTATTCATAC TTTGAAAGAT GAATTTAAAG ATTTAACGAT AGGTTATTCG
GATCATTCAA TTGGATCAGA AGTACCTATC GCAGCAGCAG CAATGGGTGC AGAAGTTATT
GAAAAACACT TTACTTTAGA TACTAATATG GAAGGTCCGG ATCATAAAGC CAGTGCAACA
CCTGATATTT TAGCTGCTTT AGTTAAAGGG GTTCGCATTG TTGAACAAGC CTTAGGTAGA
TTTGAAAAAA TCCCAGATCC AGTAGAAGAA AAAAATAAGA TTGTTGCTCG TAAATCAGTC
GTTGCTTTAA AACCAATTAA AAAAGGCGAT ATTTATTCAA TAGAAAATAT TACGGTGAAG
CGCCCAGGTA ATGGTATTTC TCCTATGAAC TGGTATGACA TCTTGGGACA AGAAGCGCAA
GATGATTTCG AAGAGGATGA AGTTATTCGT GATTCACGCT TTGAAAATCA ATTGCCCGAG
TTATAA
 
Protein sequence
MVYIIAEIGC NHNGDINLAK KMVDVAVSCG VDAVKFQTFK AEKLISKFAP KAEYQKATTG 
TADSQLEMTK RLELSFEEYL EMRDYAISKG VETFSTPFDE ESLEFLISTD MPIYKIPSGE
ITNLPYLEKI GKQQKKVILS TGMAVMEEIH QAVNILRQNG TTDISILHCT TEYPTPYPSL
NLNVIHTLKD EFKDLTIGYS DHSIGSEVPI AAAAMGAEVI EKHFTLDTNM EGPDHKASAT
PDILAALVKG VRIVEQALGR FEKIPDPVEE KNKIVARKSV VALKPIKKGD IYSIENITVK
RPGNGISPMN WYDILGQEAQ DDFEEDEVIR DSRFENQLPE L