Gene SAG1462 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1462 
Symbol 
ID1014271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1478371 
End bp1481283 
Gene Length2913 bp 
Protein Length970 aa 
Translation table11 
GC content41% 
IMG OID637316635 
Productcell wall surface anchor family protein 
Protein accessionNP_688457 
Protein GI22537606 
COG category[T] Signal transduction mechanisms 
COG ID[COG5422] RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCAAA AGACTTTTGG CAAGCAGTTA ACAGTTGTAG ATACTAAGAG TAGAGTCAAG 
ATGCATAAAT CAGAAAAAAA CTGGGTAAGA ACAGTAATGT CGCATTTTAA TCTATTTAAA
GCGATTAAAG GGAGAGCAAC TGTTGAAGCA GATGTGTGTA TTCAAGATGT TGAAAAAGAA
GACCGACTAT CTTCAGGAAA TTTGACCTAT CTCAAAGGAA TACTAGCTGC TGGAGCTCTG
GTAGGTGGAG CGAGTTTAAC CAGTCGTGTT TATGCAGATG AGACTCCAGT TGTTCAAGAA
CAATCAAGTT CTGTACCAAC ACTGGCAGAA CAAACGGAAG TGACTGTTAA AACAACTACT
GTTCAAAATC ATCAAGATGG GACAGTATCG AAAAACATTA TTGATTCTAA TAGTGTATCT
ATGTCAGAGT CAGCCTCAAC AAGTACTAGT GAATCTGTAA GTATGTCTAT GTCAGGGTCA
ACTTTAACAA GTGTAAGTGA ATCTGTAAGT ACATCTGCTT TAACAAGTGC TTCAGAATCG
ATAAGCACGT CAGCCTCAGA AAGTGTTTCA AAATCTACAA GTATTAGTGA GGTTTCAAAT
ATTCTTGAAA CTCAAGCTTC TTTAACTGAT AAAGGAAGAG AGTCGTTTTC GGCAAACCAG
ATAGTAACAG AAAGTAGCTT AGTTACTGAT GCTGGTAAAA ATGCTTCAGT ATCTAGCCTA
ATTGAAATTA CAAAACCAAA ATCGGAGTTA CAGACTTCCA AAATGTCAAA TGAGTCGCTT
ATAACTCCAG AGAAATCCCA AGTAATGATT GCAAGCGATA AAACTGGGAA TGAGAGTCTA
ACTCCGACAA TTAGATTAAA ATCAGTTATT CAGCCAAGGA GTATGAACTT GATGACTTTG
AGTTCGGAGA TGGACTTGAT ACCACTAGAA GAAGTGTCTG ATACTGAAAT GTTAGGTAAA
GATGTATCAA GCGAGTTGCA GAAAGTTAAT ATTGCGTTAA AAGATAACAC TCTTAGTGAG
CCTGGAACAG TTAAATTAGA TAGTTCAGAA AACCTTGTTT TGAACTTTGC CTTTTCAATC
GCTTCTGTTA ACGAGGGAGA TGTCTTTACT GTAAAGCTTT CTGATAACCT TGACACACAA
GGGATTGGTA CTATTCTAAA AGTTCAAGAT ATAATGGATG AAACGGGGCA GTTATTAGCG
ACTGGGTCAT ATAGTCCTTT AACACATAAT ATTACATACA CCTGGACAAG GTATGCTTCT
ACGTTGAATA ATATTAAAGC TAGAGTCAAT ATGCCAGTTT GGCCTGACCA GAGAATAATT
TCTAAAACAA CTTCAGATAA GCAGTGCTTT ACTGCAACAT TGAACAATCA AGTTGCTTCA
ATTGAGGAAC GTGTTCAGTA TAATAGTCCT TCAGTGACAG AACATACTAA TGTTAAGACA
AATGTAAGAT CTCGGATCAT GAAGCTTGAT GATGAAAGAC AGACAGAAAC TTATATTACT
CAAATTAATC CTGAAGGTAA GGAAATGTAT TTCGCATCAG GACTTGGGAA TCTATATACT
ATTATCGGTT CAGATGGAAC ATCAGGTTCA CCAGTTAATT TATTAAATGC GGAAGTAAAG
ATTCTAAAAA CTAATTCAAA AAATCTTACA GATAGTATGG ATCAAAATTA TGATTCGCCT
GAGTTTGAAG ATGTGACTTC CCAGTATAGT TATACTAACG ATGGTTCTAA AATTACCATA
GATTGGAAAA CAAATTCTAT TTCTTCCACT ACATCTTATG TTGTTTTGGT CAAAATACCT
AAACAAAGTG GTGTATTGTA TTCAACTGTT TCTGATATAA ATCAAACATA TGGTTCTAAA
TATTCTTATG GGCATACGAA TATAAGTGGT GACTCAGATG CGAATGCCGA AATTAAACTT
TTATCAGAAA GTGCTTCTAC GAGTGCGTCG ACGTCAGCAA GTACCAGCGC TTCCATGAGT
GCCTCGACAT CAGCAAGTAC CAGCGCTTCC ATGAGTGCGT CGACGTCAGC CAGCACCAGT
GCTTCAACAA GTGCAAGTAT GAGTGCTTCA ACAAGTGCAA GTACCAGCGC CTCAACCAGT
GCCAGTACCA GTGCTTCCAC CTCAGCAAGT ATGAGCGCCT CAACAAGTGC AAGTACTAGC
GCTTCCACAA GTGCCAGTAC CAGTGCATCC ACGTCAGCAA GTACTAGCGC CTCAATGAGT
GCCTCGACGT CAGCCAGCAC CAGTGCTTCC ACAAGTGCAA GTACTAGCGC CTCAATGAGT
GCCTCGACGT CAGCCAGCAC CAGTGCTTCC ACAAGTGCAA GTACTAGCGC CTCAATGAGT
GCCTCGACGT CAGCCAGCAC CAGTGCTTCC ACAAGTGCAA GTACTAGCGC CTCAATGAGT
GCCTCGACGT CAGCCAGCAC CAGTGCTTCC ACAAGTGCAA GTACTAGCGC CTCAATGAGT
GCCTCGACGT CAGCCAGCAC CAGCGCTTCA ACGAGTGCGT CAATGTCAGC CAGCACCAGT
GCTTCAACCA GTGCCTCGAT GTCAGCCAGC ACTAGCGCTT CAACGAGTGC GTCAATGTCA
GCCAGCACAA GTGCTTCAAC AAGTGCCTCG ATGTCAGCCA GCACAAGTGC TTCAACAAGT
GCCTCGATGT CAGCCAGCAC TAGCGCTTCA ATGAGTGCCA CGACGTCAGC CAGCACCAGC
GTCTCAACGA GTGCATCGAC ATCAGCAAGT ACCAGCGCTT CCACAAGTTC TTCAAGCTCA
GTGACTTCTA ATTCATCAAA AGAGAAGGTG TATTCTGCCT TACCTTCTAC GGGTGACCAA
GATTATTCTG TAACTGCTAC TGCCTTAGGT TTAGGTTTAA TGACTGGTGC AACCCTTTTG
GGACGAAAAA AATCTAAAAA AGATAAAGAC TAA
 
Protein sequence
MSQKTFGKQL TVVDTKSRVK MHKSEKNWVR TVMSHFNLFK AIKGRATVEA DVCIQDVEKE 
DRLSSGNLTY LKGILAAGAL VGGASLTSRV YADETPVVQE QSSSVPTLAE QTEVTVKTTT
VQNHQDGTVS KNIIDSNSVS MSESASTSTS ESVSMSMSGS TLTSVSESVS TSALTSASES
ISTSASESVS KSTSISEVSN ILETQASLTD KGRESFSANQ IVTESSLVTD AGKNASVSSL
IEITKPKSEL QTSKMSNESL ITPEKSQVMI ASDKTGNESL TPTIRLKSVI QPRSMNLMTL
SSEMDLIPLE EVSDTEMLGK DVSSELQKVN IALKDNTLSE PGTVKLDSSE NLVLNFAFSI
ASVNEGDVFT VKLSDNLDTQ GIGTILKVQD IMDETGQLLA TGSYSPLTHN ITYTWTRYAS
TLNNIKARVN MPVWPDQRII SKTTSDKQCF TATLNNQVAS IEERVQYNSP SVTEHTNVKT
NVRSRIMKLD DERQTETYIT QINPEGKEMY FASGLGNLYT IIGSDGTSGS PVNLLNAEVK
ILKTNSKNLT DSMDQNYDSP EFEDVTSQYS YTNDGSKITI DWKTNSISST TSYVVLVKIP
KQSGVLYSTV SDINQTYGSK YSYGHTNISG DSDANAEIKL LSESASTSAS TSASTSASMS
ASTSASTSAS MSASTSASTS ASTSASMSAS TSASTSASTS ASTSASTSAS MSASTSASTS
ASTSASTSAS TSASTSASMS ASTSASTSAS TSASTSASMS ASTSASTSAS TSASTSASMS
ASTSASTSAS TSASTSASMS ASTSASTSAS TSASTSASMS ASTSASTSAS TSASMSASTS
ASTSASMSAS TSASTSASMS ASTSASTSAS MSASTSASTS ASMSASTSAS MSATTSASTS
VSTSASTSAS TSASTSSSSS VTSNSSKEKV YSALPSTGDQ DYSVTATALG LGLMTGATLL
GRKKSKKDKD