Gene SAG0032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG0032 
Symbol 
ID1012782 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp47521 
End bp48825 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content41% 
IMG OID637315187 
Productgroup B streptococcal surface immunogenic protein 
Protein accessionNP_687068 
Protein GI22536217 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.85122 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATGA ATAAAAAGGT ACTATTGACA TCGACAATGG CAGCTTCGCT ATTATCAGTC 
GCAAGTGTTC AAGCACAAGA AACAGATACG ACGTGGACAG CACGTACTGT TTCAGAGGTA
AAGGCTGATT TGGTAAAGCA AGACAATAAA TCATCATATA CTGTGAAATA TGGTGATACA
CTAAGCGTTA TTTCAGAAGC AATGTCAATT GATATGAATG TCTTAGCAAA AATAAATAAC
ATTGCAGATA TCAATCTTAT TTATCCTGAG ACAACACTGA CAGTAACTTA CGATCAGAAG
AGTCATACTG CCACTTCAAT GAAAATAGAA ACACCAGCAA CAAATGCTGC TGGTCAAACA
ACAGCTACTG TGGATTTGAA AACCAATCAA GTTTCTGTTG CAGACCAAAA AGTTTCTCTC
AATACAATTT CGGAAGGTAT GACACCAGAA GCAGCAACAA CGATTGTTTC GCCAATGAAG
ACATATTCTT CTGCGCCAGC TTTGAAATCA AAAGAAGTAT TAGCACAAGA GCAAGCTGTT
AGTCAAGCAG CAGCTAATGA ACAGGTATCA CCAGCTCCTG TGAAGTCGAT TACTTCAGAA
GTTCCAGCAG CTAAAGAGGA AGTTAAACCA ACTCAGACGT CAGTCAGTCA GTCAACAACA
GTATCACCAG CTTCTGTTGC CGCTGAAACA CCAGCTCCAG TAGCTAAAGT AGCACCGGTA
AGAACTGTAG CAGCCCCTAG AGTGGCAAGT GTTAAAGTAG TCACTCCTAA AGTAGAAACT
GGTGCATCAC CAGAGCATGT ATCAGCTCCA GCAGTTCCTG TGACTACGAC TTCACCAGCT
ACAGACAGTA AGTTACAAGC GACTGAAGTT AAGAGCGTTC CGGTAGCACA AAAAGCTCCA
ACAGCAACAC CGGTAGCACA ACCAGCTTCA ACAACAAATG CAGTAGCTGC ACATCCTGAA
AATGCAGGGC TCCAACCTCA TGTTGCAGCT TATAAAGAAA AAGTAGCGTC AACTTATGGA
GTTAATGAAT TCAGTACATA CCGTGCGGGA GATCCAGGTG ATCATGGTAA AGGTTTAGCA
GTTGACTTTA TTGTAGGTAC TAATCAAGCA CTTGGTAATA AAGTTGCACA GTACTCTACA
CAAAATATGG CAGCAAATAA CATTTCATAT GTTATCTGGC AACAAAAGTT TTACTCAAAT
ACAAACAGTA TTTATGGACC TGCTAATACT TGGAATGCAA TGCCAGATCG TGGTGGCGTT
ACTGCCAACC ACTATGACCA CGTTCACGTA TCATTTAACA AATAA
 
Protein sequence
MKMNKKVLLT STMAASLLSV ASVQAQETDT TWTARTVSEV KADLVKQDNK SSYTVKYGDT 
LSVISEAMSI DMNVLAKINN IADINLIYPE TTLTVTYDQK SHTATSMKIE TPATNAAGQT
TATVDLKTNQ VSVADQKVSL NTISEGMTPE AATTIVSPMK TYSSAPALKS KEVLAQEQAV
SQAAANEQVS PAPVKSITSE VPAAKEEVKP TQTSVSQSTT VSPASVAAET PAPVAKVAPV
RTVAAPRVAS VKVVTPKVET GASPEHVSAP AVPVTTTSPA TDSKLQATEV KSVPVAQKAP
TATPVAQPAS TTNAVAAHPE NAGLQPHVAA YKEKVASTYG VNEFSTYRAG DPGDHGKGLA
VDFIVGTNQA LGNKVAQYST QNMAANNISY VIWQQKFYSN TNSIYGPANT WNAMPDRGGV
TANHYDHVHV SFNK