Gene SAG1407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1407 
Symbol 
ID1014216 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1414765 
End bp1416882 
Gene Length2118 bp 
Protein Length705 aa 
Translation table11 
GC content37% 
IMG OID637316583 
Productcell wall surface anchor family protein 
Protein accessionNP_688405 
Protein GI22537554 
COG category 
COG ID 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAA TCAACAAATA TTTTGCAATG TTCTCGGCAT TGTTACTGAC TTTAACGTCA 
TTGCTCTCAG TTGCACCAGC GTTTGCGGAT GAAGCAACAA CTAATACAGT GACTTTGCAC
AAGATTTTGC AAACCGAATC AAATCTTAAC AAAAGTAACT TCCCAGGAAC TACAGGTCTT
AACGGAAAAG ACTACAAAGG TGGAGCTATT TCTGACCTTG CTGGTTACTT TGGCGAGGGA
TCTAAAGAAA TCGAAGGTGC GTTCTTTGCT TTAGCTTTGA AAGAAGATAA AAGTGGTAAA
GTGCAATATG TTAAGGCAAA AGAAGGTAAC AAATTAACAC CAGCCTTAAT TAATAAAGAT
GGTACTCCTG AAATAACAGT AAATATTGAT GAGGCCGTGT CTGGATTGAC ACCAGAGGGA
GATACTGGAC TTGTTTTCAA CACCAAAGGA TTGAAAGGCG AGTTTAAAAT TGTTGAAGTT
AAATCAAAAT CTACTTACAA CAATAATGGT TCCCTCCTGG CTGCTTCAAA AGCGGTTCCA
GTTAACATCA CTCTTCCATT GGTAAATGAA GATGGTGTTG TTGCTGATGC CCATGTTTAT
CCAAAGAACA CTGAAGAAAA ACCAGAAATT GATAAAAACT TTGCTAAAAC AAACGATTTG
ACAGCATTGA CAGATGTTAA TAGACTTTTG ACAGCTGGCG CAAATTATGG TAATTATGCA
CGTGACAAAG CAACTGCTAC TGCTGAAATC GGTAAAGTTG TTCCTTATGA AGTTAAAACA
AAAATTCACA AAGGTTCTAA ATACGAAAAC TTGGTTTGGA CAGATATAAT GTCAAATGGT
TTGACAATGG GTTCAACTGT TAGCCTTAAA GCTTCAGGAA CTACAGAAAC TTTTGCTAAG
GATACAGACT ATGAACTTAG CATTGATGCC CGTGGTTTCA CATTAAAATT CACAGCTGAT
GGATTGGGCA AATTGGAAAA AGCAGCTAAA ACAGCTGATA TTGAATTTAC ATTGACTTAT
AGTGCTACTG TTAATGGTCA AGCAATTATT GATAATCCAG AATCCAATGA TATCAAATTG
TCGTATGGTA ACAAACCAGG TAAAGACTTG ACTGAACTTC CTGTTACACC TTCAAAGGGT
GAAGTAACAG TTGCTAAAAC TTGGTCTGAC GGAATTGCAC CTGATGGTGT AAACGTTGTT
TACACATTGA AAGATAAAGA TAAAACTGTT GCTTCAGTAT CATTGACAAA AACATCTAAA
GGTACAATCG ACCTTGGAAA TGGTATCAAA TTTGAAGTAT CTGGTAACTT CTCGGGTAAA
TTCACTGGTC TAGAAAACAA ATCATACATG ATCTCAGAAC GTGTTTCTGG TTACGGAAGT
GCAATAAATC TAGAAAATGG TAAAGTAACC ATTACCAATA CCAAAGATTC TGATAACCCA
ACACCATTGA ACCCAACTGA ACCAAAAGTT GAAACTCATG GTAAGAAATT TGTCAAAACT
AATGAACAAG GTGACCGTTT GGCTGGTGCA CAATTCGTTG TGAAAAACTC AGCAGGTAAA
TACCTTGCTC TTAAAGCAGA TCAATCAGAA GGTCAAAAAA CTTTAGCTGC TAAGAAAATA
GCTTTAGATG AAGCTATCGC TGCTTATAAC AAGTTGTCTG CAACAGACCA AAAAGGTGAA
AAAGGAATTA CTGCAAAAGA ACTTATCAAA ACTAAACAAG CAGATTACGA TGCAGCCTTC
ATTGAGGCTC GTACAGCTTA TGAGTGGATA ACAGATAAGG CTAGAGCCAT TACCTACACT
TCAAACGATC AAGGTCAATT TGAAGTTACA GGTCTTGCAG ACGGTACTTA CAACCTTGAA
GAAACACTTG CTCCAGCAGG ATTTGCTAAG TTGGCAGGTA ATATTAAGTT TGTAGTTAAT
CAAGGGTCAT ACATAACAGG TGGTAACATT GACTACGTTG CTAACAGCAA CCAAAAAGAT
GCGACACGTG TAGAAAATAA AAAGGTAACA ATCCCACAAA CAGGTGGTAT TGGTACAATT
CTTTTCACAA TTATTGGTTT AAGCATTATG CTTGGAGCAG TAGTTATCAT GAAAAGACGC
CAATCAAAGG AAGCTTAA
 
Protein sequence
MKRINKYFAM FSALLLTLTS LLSVAPAFAD EATTNTVTLH KILQTESNLN KSNFPGTTGL 
NGKDYKGGAI SDLAGYFGEG SKEIEGAFFA LALKEDKSGK VQYVKAKEGN KLTPALINKD
GTPEITVNID EAVSGLTPEG DTGLVFNTKG LKGEFKIVEV KSKSTYNNNG SLLAASKAVP
VNITLPLVNE DGVVADAHVY PKNTEEKPEI DKNFAKTNDL TALTDVNRLL TAGANYGNYA
RDKATATAEI GKVVPYEVKT KIHKGSKYEN LVWTDIMSNG LTMGSTVSLK ASGTTETFAK
DTDYELSIDA RGFTLKFTAD GLGKLEKAAK TADIEFTLTY SATVNGQAII DNPESNDIKL
SYGNKPGKDL TELPVTPSKG EVTVAKTWSD GIAPDGVNVV YTLKDKDKTV ASVSLTKTSK
GTIDLGNGIK FEVSGNFSGK FTGLENKSYM ISERVSGYGS AINLENGKVT ITNTKDSDNP
TPLNPTEPKV ETHGKKFVKT NEQGDRLAGA QFVVKNSAGK YLALKADQSE GQKTLAAKKI
ALDEAIAAYN KLSATDQKGE KGITAKELIK TKQADYDAAF IEARTAYEWI TDKARAITYT
SNDQGQFEVT GLADGTYNLE ETLAPAGFAK LAGNIKFVVN QGSYITGGNI DYVANSNQKD
ATRVENKKVT IPQTGGIGTI LFTIIGLSIM LGAVVIMKRR QSKEA