Gene SAG2063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG2063 
Symbol 
ID1014874 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp2043157 
End bp2045049 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content37% 
IMG OID637317229 
Productpathogenicity protein, putative 
Protein accessionNP_689049 
Protein GI22538198 
COG category 
COG ID 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain
[TIGR01168] Gram-positive signal peptide, YSIRK family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAATA ACGAAAAAAA AGTAAAATAC TTTTTAAGAA AAACAGCTTA TGGTTTGGCC 
TCAATGTCAG CAGCGTTTGC TGTATGTAGT GGTATTGTAC ACGCGGATAC TAGTTCAGGA
ATATCGGCTT CAATTCCTCA TAAGAAACAA GTTAATTTAG GGGCGGTTAC TCTGAAGAAT
TTGATTTCTA AATATCGTGG TAATGACAAA GCTATTGCTA TACTTTTAAG TAGAGTAAAT
GATTTTAATA GAGCATCACA GGATACACTT CCACAATTAA TTAATAGTAC TGAAGCAGAA
ATTAGAAATA TTTTATATCA AGGACAAATT GGTAAGCAAA ATAAACCAAG TGTAACTACA
CATGCTAAAG TTAGTGATCA AGAACTAGGT AAGCAGTCAA GACGTTCTCA AGATATCATT
AAGTCATTAG GTTTCCTTTC ATCAGACCAA AAAGATATTT TAGTTAAATC TATTAGCTCT
TCAAAAGATT CGCAACTTAT TCTTAAATTT GTAACTCAAG CCACGCAACT GAATAATGCT
GAATCAACAA AAGCTAAGCA AATGGCTCAA AATGACGTGG CCTTAATAAA AAATATAAGC
CCCGAAGTCT TAGAAGAATA TAAAGAAAAA ATTCAAAGAG CTAGCACTAA GAGTCAAGTT
GATGAGTTTG TAGCAGAAGC TAAAAAAGTT GTTAATTCCA ATAAAGAAAC GTTGGTAAAT
CAGGCCAATG GTAAAAAGCA AGAAATTGCT AAGTTAGAAA ATTTATCTAA CGATGAAATG
TTGAGATATA ATACTGCAAT TGATAATGTA GTGAAACAGT ATAATGAAGG TAAGCTCAAT
ATTACTGCTG CAATGAATGC TTTAAATAGT ATTAAGCAAG CAGCACAGGA AGTTGCCCAG
AAAAACTTAC AAAAGCAGTA TGCTAAAAAA ATTGAAAGAA TAAGTTCAAA AGGATTAGCG
TTATCTAAAA AGGCTAAAGA AATTTATGAA AAGCATAAAA GTATTTTGCC TACACCTGGA
TATTATGCAG ACTCTGTGGG AACTTATTTG AATAGGTTTA GAGATAAACA AACTTTCGGA
AATAGGAGTG TTTGGACTGG TCAAAGTGGA CTTGATGAAG CAAAAAAAAT GCTTGATGAA
GTCAAAAAGC TTTTAAAAGA ACTTCAAGAC CTTACCAGAG GTACTAAAGA AGATAAAAAA
CCAGACGTTA AGCCAGAAGC CAAACCAGAG GCCAAACCAG ACGTTAAGCC AGAGGCCAAA
CCAGACGTTA AGCCAGAAGC TAAGCCAGAC GTTAAACCAG AAGCTAAGCC AGACGTTAAA
CCAGAAGCTA AGCCAGACGT TAAACCAGAA GCTAAGCCAG ACGTTAAACC AAAGGCCAAA
CCAGACGTTA AGCCAGAAGC TAAGCCAGAC GTTAAACCAG ACGTTAAACC AGACGTTAAG
CCAGAGGCCA AACCAGAGGA TAAGCCAGAC GTTAAACCAG ACGTTAAGCC AGAAGCTAAA
CCAGACGTTA AGCCAGAGGC CAAACCAGAA GCTAAGCCAG AAGCTAAGCC AGAAGCTAAG
CCAGAGGCCA AACCAGAAGC TAAGCCAGAC GTTAAGCCAG AAGCTAAACC AGACGTTAAA
CCAGAGGCTA AGCCAGAAGC TAAACCAGAG GCTAAGTCAG AAGCTAAACC AGAGGCTAAG
CTAGAAGCTA AACCAGAGGC CAAACCAGCA ACCAAAAAAT CGGTTAATAC TAGCGGAAAC
TTGGCGGCTA AAAAAGCTAT TGAAAACAAA AAGTATAGTA AAAAATTACC ATCAACGGGT
GAAGCCGCAA GTCCACTCTT AGCAATTGTA TCACTAATTG TTATGTTAAG TGCAGGTCTT
ATTACGATAG TTTTAAAGCA TAAAAAAAAT TAA
 
Protein sequence
MNNNEKKVKY FLRKTAYGLA SMSAAFAVCS GIVHADTSSG ISASIPHKKQ VNLGAVTLKN 
LISKYRGNDK AIAILLSRVN DFNRASQDTL PQLINSTEAE IRNILYQGQI GKQNKPSVTT
HAKVSDQELG KQSRRSQDII KSLGFLSSDQ KDILVKSISS SKDSQLILKF VTQATQLNNA
ESTKAKQMAQ NDVALIKNIS PEVLEEYKEK IQRASTKSQV DEFVAEAKKV VNSNKETLVN
QANGKKQEIA KLENLSNDEM LRYNTAIDNV VKQYNEGKLN ITAAMNALNS IKQAAQEVAQ
KNLQKQYAKK IERISSKGLA LSKKAKEIYE KHKSILPTPG YYADSVGTYL NRFRDKQTFG
NRSVWTGQSG LDEAKKMLDE VKKLLKELQD LTRGTKEDKK PDVKPEAKPE AKPDVKPEAK
PDVKPEAKPD VKPEAKPDVK PEAKPDVKPE AKPDVKPKAK PDVKPEAKPD VKPDVKPDVK
PEAKPEDKPD VKPDVKPEAK PDVKPEAKPE AKPEAKPEAK PEAKPEAKPD VKPEAKPDVK
PEAKPEAKPE AKSEAKPEAK LEAKPEAKPA TKKSVNTSGN LAAKKAIENK KYSKKLPSTG
EAASPLLAIV SLIVMLSAGL ITIVLKHKKN