Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SAG0421 |
Symbol | |
ID | 1013223 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptococcus agalactiae 2603V/R |
Kingdom | Bacteria |
Replicon accession | NC_004116 |
Strand | + |
Start bp | 436461 |
End bp | 439628 |
Gene Length | 3168 bp |
Protein Length | 1055 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 637315626 |
Product | hypothetical protein |
Protein accession | NP_687455 |
Protein GI | 22536604 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01167] LPXTG-motif cell wall anchor domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAAAA AACATCTTAA AACGCTTGCC TTGGCACTTA CTACAGTATC AGTAGTGACA TACAGCCAGG AGGTATATGG ATTAGAAAGA GAGGAATCGG TCAAACAAGA ACAAACCCAG TCAGCTTCAG AAGATGATTG GTTCGAAGAA GATAATGAGA GGAAAACAAA TGTTTCTAAA GAGAATTCTA CTGTTGATGA AACAGTTAGT GATTTATTTT CTGATGGAAA TAGTAATAAC TCTAGTTCTA AAACCGAGTC AGTGGTAAGT GACCCTAAAC AAGTCCCCAA AGCAAAACCA GAGGTTACAC AAGAAGCAAG CAATTCTAGT AATGATGCTA GCAAAGTAGA AGTACCAAAA CAGGATACAG CTTCAAAAAA GGAAACTCTA GAAACATCAA CTTGGGAGGC AAAAGATTTC GTAACTAGAG GGGATACTTT AGTAGGTTTT TCAAAATCTG GAATTAATAA GTTATCTCAA ACATCACACT TGGTTTTACC AAGTCATGCA GCAGATGGAA CTCAATTGAC ACAAGTAGCT AGCTTTGCTT TTACTCCAGA TAAAAAGACG GCCATTGCAG AATATACAAG TAGGCTAGGA GAAAATGGGA AACCGAGTCG TTTAGATATT GATCAGAAGG AAATTATTGA TGAGGGAGAA ATATTTAATG CTTACCAGTT GACTAAGCTT ACTATTCCAA ATGGTTATAA GTCTATTGGT CAAGATGCTT TTGTGGACAA TAAGAATATT GCTGAGGTTA ACCTTCCTGA GAGTCTCGAG ACTATTTCAG ACTATGCTTT TGCTCACATG TCTTTAAAAC AAGTAAAGTT ACCAGATAAC CTAAAGGTCA TTGGAGAATT AGCTTTTTTT GATAATCAGA TTGGTGGTAA GCTTTACTTG CCACGTCACT TGATAAAATT AGCAGAACGC GCTTTCAAAT CTAATCGTAT TCAAACAGTT GAATTTTTGG GAAGTAAGCT TAAGGTTATA GGAGAAGCAA GTTTTCAAGA TAATAATCTG AGGAATGTTA TGCTTCCGGA TGGACTTGAA AAAATAGAAT CAGAAGCTTT TACAGGAAAT CCAGGAGATG AACATTACAA CAATCAGGTT GTATTGCGCA CAAGGACAGG CCAAAATCCA CATCAACTTG CGACTGAGAA TACTTACGTC AATCCGGACA AATCATTGTG GCGTGCAACA CCTGATATGG ATTATACCAA ATGGTTAGAG GAAGATTTTA CCTATCAAAA AAATAGTGTT ACAGGTTTTT CAAATAAAGG CTTACAAAAG GTAAGACGTA ATAAAAACTT AGAAATTCCA AAACAACACA ATGGTATTAC TATTACTGAA ATTGGTGATA ACGCTTTTCG CAATGTTGAT TTTCAAAGTA AAACTTTACG TAAATATGAT TTGGAAGAAA TAAAGCTCCC CTCAACTATT CGGAAAATAG GTGCTTTTGC TTTTCAATCT AATAACTTGA AATCCTTTGA AGCAAGTGAA GATTTAGAAG AGATTAAAGA GGGAGCCTTT ATGAATAATC GTATTGGAAC TCTAGACTTG AAAGACAAAC TTATCAAAAT AGGTGATGCT GCTTTCCATA TTAATCATAT TTATGCCATT GTTCTTCCAG AATCTGTACA AGAAATAGGA CGTTCAGCTT TTCGACAAAA TGGTGCGCTT CACCTTATGT TTATCGGAAA TAAGGTTAAA ACAATTGGTG AAATGGCTTT TTTATCCAAT AAACTGGAAA GTGTAAATCT CTCTGAGCAA AAACAATTAA AGACAATTGA GGTCCAAGCT TTTTCGGATA ATGCCCTTAG TGAAGTAGTC TTACCGCCAA ATTTACAGAC TATTCGTGAA GAGGCTTTCA AAAGGAATCA TTTGAAAGAA GTGAAGGGTT CATCTACATT ATCTCAGATT ACTTTTAATG CTTTTGATCA AAATGATGGG GACAAACGCT TTGGTAAGAA AGTGGTTGTT AGGACACATA ATAATTCTCA TATGTTAGCA GATGGTGAGC GTTTTATCAT TGATCCAGAT AAGCTATCTT CTACAATGGT AGACCTTGAA AAGGTTTTAA AAATAATCGA AGGTTTAGAT TACTCTACAT TACGTCAGAC TACTCAAACT CAGTTTAGAG AAATGACTAC TGCAGGTAAA GCGTTGTTAT CAAAATCTAA CCTCCGACAA GGAGAAAAAC AAAAATTCCT TCAAGAAGCA CAATTTTTCC TTGGTCGCGT TGATTTGGAT AAAGCCATAG CTAAAGCTGA GAAGGCTTTA GTGACCAAGA AGGCAACAAA GAATGGTCAT TTGCTTGAGA GGAGTATTAA CAAAGCGGTA TTAGCTTATA ATAATAGTGC TATTAAAAAA GCTAATGTTA AGCGCTTGGA AAAAGAGTTA GACTTGCTGA CGGATTTAGT CGAGGGAAAA GGACCATTAG CGCAAGCTAC AATGGTACAA GGAGTTTATT TATTAAAGAC GCCTTTACCA TTGCCAGAAT ATTATATCGG ATTGAACGTT TATTTTGACA AGTCTGGAAA ATTGATTTAT GCACTTGATA TGAGTGATAC TATTGGCGAG GGACAAAAAG ATGCATATGG TAATCCTATA TTAAATGTTG ACGAGGATAA TGAAGGTTAT CATACCTTGG CAGTTGCCAC TTTAGCTGAT TATGAAGGTC TTTATATTAA AGATATTTTA AATAGTTCCC TTGATAAGAT TAAAGCAATA CGCCAGATTC CTTTGGCAAA ATATCATAGA TTAGGAATTT TCCAAGCTAT TCGAAATGCA GCGGCAGAAG CAGACCGATT GCTTCCTAAG ACACCTAAGG GGTACCTAAA TGAAGTCCCA AATTATCGTA AAAAACAAGT GGAGAAAAAT TTAAAACCAG TTGATTATAA AACGCCGATT TTTAATAAGG CTTTACCTAA TGAAAAGGTA GACGGTGATA GAGCGGCTAA AGGTCATAAT ATAAATGCGG AGACTAATAA TTCTGTAGCT GTAACACCAA TAAGGTCCGA GCAGCAATTA CATAAGTCAC AGTCTGATGT AAATTTACCT CAAACAAGTT CTAAAAATAA TTTTATATAC GAGATTCTAG GATACGTTAG TTTATGTTTG CTTTTCCTAG TAACTGCTGG GAAAAAAGGA AAACGAGCAA GAAAATAA
|
Protein sequence | MTKKHLKTLA LALTTVSVVT YSQEVYGLER EESVKQEQTQ SASEDDWFEE DNERKTNVSK ENSTVDETVS DLFSDGNSNN SSSKTESVVS DPKQVPKAKP EVTQEASNSS NDASKVEVPK QDTASKKETL ETSTWEAKDF VTRGDTLVGF SKSGINKLSQ TSHLVLPSHA ADGTQLTQVA SFAFTPDKKT AIAEYTSRLG ENGKPSRLDI DQKEIIDEGE IFNAYQLTKL TIPNGYKSIG QDAFVDNKNI AEVNLPESLE TISDYAFAHM SLKQVKLPDN LKVIGELAFF DNQIGGKLYL PRHLIKLAER AFKSNRIQTV EFLGSKLKVI GEASFQDNNL RNVMLPDGLE KIESEAFTGN PGDEHYNNQV VLRTRTGQNP HQLATENTYV NPDKSLWRAT PDMDYTKWLE EDFTYQKNSV TGFSNKGLQK VRRNKNLEIP KQHNGITITE IGDNAFRNVD FQSKTLRKYD LEEIKLPSTI RKIGAFAFQS NNLKSFEASE DLEEIKEGAF MNNRIGTLDL KDKLIKIGDA AFHINHIYAI VLPESVQEIG RSAFRQNGAL HLMFIGNKVK TIGEMAFLSN KLESVNLSEQ KQLKTIEVQA FSDNALSEVV LPPNLQTIRE EAFKRNHLKE VKGSSTLSQI TFNAFDQNDG DKRFGKKVVV RTHNNSHMLA DGERFIIDPD KLSSTMVDLE KVLKIIEGLD YSTLRQTTQT QFREMTTAGK ALLSKSNLRQ GEKQKFLQEA QFFLGRVDLD KAIAKAEKAL VTKKATKNGH LLERSINKAV LAYNNSAIKK ANVKRLEKEL DLLTDLVEGK GPLAQATMVQ GVYLLKTPLP LPEYYIGLNV YFDKSGKLIY ALDMSDTIGE GQKDAYGNPI LNVDEDNEGY HTLAVATLAD YEGLYIKDIL NSSLDKIKAI RQIPLAKYHR LGIFQAIRNA AAEADRLLPK TPKGYLNEVP NYRKKQVEKN LKPVDYKTPI FNKALPNEKV DGDRAAKGHN INAETNNSVA VTPIRSEQQL HKSQSDVNLP QTSSKNNFIY EILGYVSLCL LFLVTAGKKG KRARK
|
| |