Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SAG1462 |
Symbol | |
ID | 1014271 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptococcus agalactiae 2603V/R |
Kingdom | Bacteria |
Replicon accession | NC_004116 |
Strand | - |
Start bp | 1478371 |
End bp | 1481283 |
Gene Length | 2913 bp |
Protein Length | 970 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 637316635 |
Product | cell wall surface anchor family protein |
Protein accession | NP_688457 |
Protein GI | 22537606 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG5422] RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases |
TIGRFAM ID | [TIGR01167] LPXTG-motif cell wall anchor domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCAAA AGACTTTTGG CAAGCAGTTA ACAGTTGTAG ATACTAAGAG TAGAGTCAAG ATGCATAAAT CAGAAAAAAA CTGGGTAAGA ACAGTAATGT CGCATTTTAA TCTATTTAAA GCGATTAAAG GGAGAGCAAC TGTTGAAGCA GATGTGTGTA TTCAAGATGT TGAAAAAGAA GACCGACTAT CTTCAGGAAA TTTGACCTAT CTCAAAGGAA TACTAGCTGC TGGAGCTCTG GTAGGTGGAG CGAGTTTAAC CAGTCGTGTT TATGCAGATG AGACTCCAGT TGTTCAAGAA CAATCAAGTT CTGTACCAAC ACTGGCAGAA CAAACGGAAG TGACTGTTAA AACAACTACT GTTCAAAATC ATCAAGATGG GACAGTATCG AAAAACATTA TTGATTCTAA TAGTGTATCT ATGTCAGAGT CAGCCTCAAC AAGTACTAGT GAATCTGTAA GTATGTCTAT GTCAGGGTCA ACTTTAACAA GTGTAAGTGA ATCTGTAAGT ACATCTGCTT TAACAAGTGC TTCAGAATCG ATAAGCACGT CAGCCTCAGA AAGTGTTTCA AAATCTACAA GTATTAGTGA GGTTTCAAAT ATTCTTGAAA CTCAAGCTTC TTTAACTGAT AAAGGAAGAG AGTCGTTTTC GGCAAACCAG ATAGTAACAG AAAGTAGCTT AGTTACTGAT GCTGGTAAAA ATGCTTCAGT ATCTAGCCTA ATTGAAATTA CAAAACCAAA ATCGGAGTTA CAGACTTCCA AAATGTCAAA TGAGTCGCTT ATAACTCCAG AGAAATCCCA AGTAATGATT GCAAGCGATA AAACTGGGAA TGAGAGTCTA ACTCCGACAA TTAGATTAAA ATCAGTTATT CAGCCAAGGA GTATGAACTT GATGACTTTG AGTTCGGAGA TGGACTTGAT ACCACTAGAA GAAGTGTCTG ATACTGAAAT GTTAGGTAAA GATGTATCAA GCGAGTTGCA GAAAGTTAAT ATTGCGTTAA AAGATAACAC TCTTAGTGAG CCTGGAACAG TTAAATTAGA TAGTTCAGAA AACCTTGTTT TGAACTTTGC CTTTTCAATC GCTTCTGTTA ACGAGGGAGA TGTCTTTACT GTAAAGCTTT CTGATAACCT TGACACACAA GGGATTGGTA CTATTCTAAA AGTTCAAGAT ATAATGGATG AAACGGGGCA GTTATTAGCG ACTGGGTCAT ATAGTCCTTT AACACATAAT ATTACATACA CCTGGACAAG GTATGCTTCT ACGTTGAATA ATATTAAAGC TAGAGTCAAT ATGCCAGTTT GGCCTGACCA GAGAATAATT TCTAAAACAA CTTCAGATAA GCAGTGCTTT ACTGCAACAT TGAACAATCA AGTTGCTTCA ATTGAGGAAC GTGTTCAGTA TAATAGTCCT TCAGTGACAG AACATACTAA TGTTAAGACA AATGTAAGAT CTCGGATCAT GAAGCTTGAT GATGAAAGAC AGACAGAAAC TTATATTACT CAAATTAATC CTGAAGGTAA GGAAATGTAT TTCGCATCAG GACTTGGGAA TCTATATACT ATTATCGGTT CAGATGGAAC ATCAGGTTCA CCAGTTAATT TATTAAATGC GGAAGTAAAG ATTCTAAAAA CTAATTCAAA AAATCTTACA GATAGTATGG ATCAAAATTA TGATTCGCCT GAGTTTGAAG ATGTGACTTC CCAGTATAGT TATACTAACG ATGGTTCTAA AATTACCATA GATTGGAAAA CAAATTCTAT TTCTTCCACT ACATCTTATG TTGTTTTGGT CAAAATACCT AAACAAAGTG GTGTATTGTA TTCAACTGTT TCTGATATAA ATCAAACATA TGGTTCTAAA TATTCTTATG GGCATACGAA TATAAGTGGT GACTCAGATG CGAATGCCGA AATTAAACTT TTATCAGAAA GTGCTTCTAC GAGTGCGTCG ACGTCAGCAA GTACCAGCGC TTCCATGAGT GCCTCGACAT CAGCAAGTAC CAGCGCTTCC ATGAGTGCGT CGACGTCAGC CAGCACCAGT GCTTCAACAA GTGCAAGTAT GAGTGCTTCA ACAAGTGCAA GTACCAGCGC CTCAACCAGT GCCAGTACCA GTGCTTCCAC CTCAGCAAGT ATGAGCGCCT CAACAAGTGC AAGTACTAGC GCTTCCACAA GTGCCAGTAC CAGTGCATCC ACGTCAGCAA GTACTAGCGC CTCAATGAGT GCCTCGACGT CAGCCAGCAC CAGTGCTTCC ACAAGTGCAA GTACTAGCGC CTCAATGAGT GCCTCGACGT CAGCCAGCAC CAGTGCTTCC ACAAGTGCAA GTACTAGCGC CTCAATGAGT GCCTCGACGT CAGCCAGCAC CAGTGCTTCC ACAAGTGCAA GTACTAGCGC CTCAATGAGT GCCTCGACGT CAGCCAGCAC CAGTGCTTCC ACAAGTGCAA GTACTAGCGC CTCAATGAGT GCCTCGACGT CAGCCAGCAC CAGCGCTTCA ACGAGTGCGT CAATGTCAGC CAGCACCAGT GCTTCAACCA GTGCCTCGAT GTCAGCCAGC ACTAGCGCTT CAACGAGTGC GTCAATGTCA GCCAGCACAA GTGCTTCAAC AAGTGCCTCG ATGTCAGCCA GCACAAGTGC TTCAACAAGT GCCTCGATGT CAGCCAGCAC TAGCGCTTCA ATGAGTGCCA CGACGTCAGC CAGCACCAGC GTCTCAACGA GTGCATCGAC ATCAGCAAGT ACCAGCGCTT CCACAAGTTC TTCAAGCTCA GTGACTTCTA ATTCATCAAA AGAGAAGGTG TATTCTGCCT TACCTTCTAC GGGTGACCAA GATTATTCTG TAACTGCTAC TGCCTTAGGT TTAGGTTTAA TGACTGGTGC AACCCTTTTG GGACGAAAAA AATCTAAAAA AGATAAAGAC TAA
|
Protein sequence | MSQKTFGKQL TVVDTKSRVK MHKSEKNWVR TVMSHFNLFK AIKGRATVEA DVCIQDVEKE DRLSSGNLTY LKGILAAGAL VGGASLTSRV YADETPVVQE QSSSVPTLAE QTEVTVKTTT VQNHQDGTVS KNIIDSNSVS MSESASTSTS ESVSMSMSGS TLTSVSESVS TSALTSASES ISTSASESVS KSTSISEVSN ILETQASLTD KGRESFSANQ IVTESSLVTD AGKNASVSSL IEITKPKSEL QTSKMSNESL ITPEKSQVMI ASDKTGNESL TPTIRLKSVI QPRSMNLMTL SSEMDLIPLE EVSDTEMLGK DVSSELQKVN IALKDNTLSE PGTVKLDSSE NLVLNFAFSI ASVNEGDVFT VKLSDNLDTQ GIGTILKVQD IMDETGQLLA TGSYSPLTHN ITYTWTRYAS TLNNIKARVN MPVWPDQRII SKTTSDKQCF TATLNNQVAS IEERVQYNSP SVTEHTNVKT NVRSRIMKLD DERQTETYIT QINPEGKEMY FASGLGNLYT IIGSDGTSGS PVNLLNAEVK ILKTNSKNLT DSMDQNYDSP EFEDVTSQYS YTNDGSKITI DWKTNSISST TSYVVLVKIP KQSGVLYSTV SDINQTYGSK YSYGHTNISG DSDANAEIKL LSESASTSAS TSASTSASMS ASTSASTSAS MSASTSASTS ASTSASMSAS TSASTSASTS ASTSASTSAS MSASTSASTS ASTSASTSAS TSASTSASMS ASTSASTSAS TSASTSASMS ASTSASTSAS TSASTSASMS ASTSASTSAS TSASTSASMS ASTSASTSAS TSASTSASMS ASTSASTSAS TSASMSASTS ASTSASMSAS TSASTSASMS ASTSASTSAS MSASTSASTS ASMSASTSAS MSATTSASTS VSTSASTSAS TSASTSSSSS VTSNSSKEKV YSALPSTGDQ DYSVTATALG LGLMTGATLL GRKKSKKDKD
|
| |