Gene SAG1683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1683 
Symbol 
ID1014492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1680408 
End bp1681946 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content39% 
IMG OID637316852 
Productimmunogenic secreted protein, putative 
Protein accessionNP_688674 
Protein GI22537823 
COG category[R] General function prediction only 
COG ID[COG3942] Surface antigen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00328189 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAAAC AAAAAGTAAT GGCAACTTTG TTGTTATCCA CTTTAGTCTT ATCGCTATCA 
TCACCTTTAG TGACCTTAGC AGAAACTATT AATCCAGAAA CAAGCCTGAC AATGGCAACA
GCATCAACAG AAAGTTCTTC TGAAGCAGAG AAACAGGAAA AAACACAACC TACAGATTCA
GAAACTGCTT CACCTTCAGC CGAAGGAAGT ATCTCAACAG AAAAAACAGA GATTGGTACG
ACAGAGACAT CATCAAGCAA TGAATCATCA TCAAGTTCAT CACATCAATC TTCTTCCAAC
GAAGATGCTA AAACATCTGA TTCTGCTTCA ACAGCATCTA CTCCTAGCAC TAATACTACA
AACAGTAGTC AAGCAGACAG TAAGCCAGGT CAATCAACAA AGACTGAATT AAAACCTGAG
CCTACCTTAC CATTAGTAGA GCCTAAAATA ACTCCCGCTC CGTCTCAGAT AGAAAGTGTT
CAGACAAATC AGAATGCTTC TGTTCCTGCT TTATCCTTTG ATGATAACTT ATTATCAACA
CCGATTTCAC CAGTGACAGC AACGCCATTC TACGTAGAAC ACTGGTCTGG TCAGGATGCC
TACTCTCACT ATTTATTGTC ACATCGTTAC GGTATCAAAG CTGAACAATT AGATGGGTAC
TTAAAATCTT TAGGGATTCA ATATGATTCT AATCGTATCA ATGGTGCTAA GTTATTACAA
TGGGAAAAAG ATAGTGGTTT AGATGTCCGT GCTATTGTAG CTATTGCTGT CCTTGAAAGT
TCATTGGGAA CTCAAGGAGT GGCTAAGATG CCAGGTGCTA ATATGTTTGG TTATGGTGCC
TTTGATCATG ACTCTAGCCA TGCTAGTGCT TATAATGATG AAGAAGCAAT TATGTTGTTG
ACAAAAAATA CAATTATTAA AAACAACAAC TCTAGCTTTG AAATCCAAGA TTTGAAAGCA
CAGAAATTAT CTTCTGGACA ACTTAATACA GTTACTGAGG GTGGTGTTTA TTATACAGAT
AACTCTGGAA CTGGTAAACG TCGTGCCCAG ATTATGGAAG ATTTAGACCG CTGGATTGAT
CAACATGGAG GGACACCAGA AATTCCTGCT GCCTTGAAAG CTTTATCGAC AGCAAGTTTA
GCAGATTTAC CAAGTGGTTT TAGCTTATCA ACAGCAGTTA ACACAGCTAG CTATATTGCA
TCAACTTATC CATGGGGTGA ATGTACATGG TATGTCTTTA ACCGCGCTAA AGAGTTAGGT
TATACATTTG ATCCATTTAT GGGTAATGGT GGAGATTGGC AACATAAGGC TGGTTTTGAA
ACAACACATT CACCAAAAGT AGGCTATGCT GTATCATTTT CACCAGGACA AGCTGGTGCT
GATGGCACTT ACGGTCACGT AGCTATTGTT GAAGAAGTTA AAAAAGATGG TTCAGTTCTT
ATTTCAGAAT CTAATGCAAT GGGACGTGGT ATTGTCTCTT ACCGTACTTT TAGTTCAGCA
CAAGCTGCAC AATTAACTTA TGTTATTGGC CATAAATAA
 
Protein sequence
MSKQKVMATL LLSTLVLSLS SPLVTLAETI NPETSLTMAT ASTESSSEAE KQEKTQPTDS 
ETASPSAEGS ISTEKTEIGT TETSSSNESS SSSSHQSSSN EDAKTSDSAS TASTPSTNTT
NSSQADSKPG QSTKTELKPE PTLPLVEPKI TPAPSQIESV QTNQNASVPA LSFDDNLLST
PISPVTATPF YVEHWSGQDA YSHYLLSHRY GIKAEQLDGY LKSLGIQYDS NRINGAKLLQ
WEKDSGLDVR AIVAIAVLES SLGTQGVAKM PGANMFGYGA FDHDSSHASA YNDEEAIMLL
TKNTIIKNNN SSFEIQDLKA QKLSSGQLNT VTEGGVYYTD NSGTGKRRAQ IMEDLDRWID
QHGGTPEIPA ALKALSTASL ADLPSGFSLS TAVNTASYIA STYPWGECTW YVFNRAKELG
YTFDPFMGNG GDWQHKAGFE TTHSPKVGYA VSFSPGQAGA DGTYGHVAIV EEVKKDGSVL
ISESNAMGRG IVSYRTFSSA QAAQLTYVIG HK