Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SAG0782 |
Symbol | |
ID | 1013586 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptococcus agalactiae 2603V/R |
Kingdom | Bacteria |
Replicon accession | NC_004116 |
Strand | + |
Start bp | 779217 |
End bp | 781454 |
Gene Length | 2238 bp |
Protein Length | 745 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 637315970 |
Product | DNA internalization-related competence protein ComEC/Rec2 |
Protein accession | NP_687797 |
Protein GI | 22536946 |
COG category | [R] General function prediction only |
COG ID | [COG0658] Predicted membrane metal-binding protein [COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) |
TIGRFAM ID | [TIGR00360] ComEC/Rec2-related protein [TIGR00361] DNA internalization-related competence protein ComEC/Rec2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTACAAT TGACTAAGTA TTTTCCTCTA AAACCTATTT ATTTAGCATT GTTGGTCTTC CAAATTTACT TACTAGTGTT TTCTTGGACA ATGCTTGGTT GTGCCTTTCT TTTATTTTCT TTTATTTTTC TGATTTATCA ATATGATCGT GAAACTATTT TTAAAACAAT AGCAATAGTA ATTTTTTTCT TATTTTATTT TTTATGGCAA AATCACAATA TGAATGTCCA ATATCAAAGA GTACCGAATC ATATTAGCCA GATTAAAGTG CGTATTGATA CTATTTCTAT CAATGGTGAT GTTTTATCAT TCCAGGCAGA TGCTTCAGGT AACACTTATC AAGCTTTTTA CACATTAAAA AATAAAAGTG AGAAAGATTA TTTTCAAAAT CTTGATAATA ATATAATGAT AATTGCAGAT ATCAAACTTG AAGAAGCAGA GGAGAGAAGG CATTTTAATG GCTTTGATTA TCGTCAGTAT TTAAAAAGAC ATGGAATTTA TCGTATCGCC AAAGTGACAA AGATAAAACA GATACGCTTA TTTCAACATA GGTCTTTCTT TGCTCTTATG TCTAAGTGGC GTAGAAGTGC AATTGTTATT AGTCAAACTT TTCCAAATCC TATGCGTCAC TATATGTCAG GGCTTTTGTT TGGATATCTA GATAAGACCT TTGATGACAT GTCCGATTTA TATAGTAGTC TAGGTATTAT ACATTTATTT GCTTTGTCAG GTATGCAAGT AGGTTTTTTT CTCGGTATTT TTCGTTATAT CTGTCTACGT ATTGGCTTAC GTCTAGACCA TGTTTGGTTA CTTCAAATAC CATTCTCGCT AATTTATGCT GGTTTAACAG GCTTTAGTAT CTCAGTCGTT AGGGCACTTA TTCAATCTTT ATTATCACAT AGCGGTGTCA AGAAAGATGA GAACTTTGCT CTCTGCTTGT TAATTTGTCT TATCTCCCTC CCCCACTCAC TTTTGACTAC GGGAGGAGTT CTTAGCTTTG CTTATGCTTT TATACTTACG ATGACCTCCT TTGATCATTT TTCGAGTATA AAAAAAGTAG CTATCGAATC TTTGACAGTC TCTGTAGGAA TTCTTCCCAT ACTAACCTAC TATTTTTCGG GTTTTCAACC AATATCAATT ATATTAACAG CACTTTTATC TTTTGCATTT GATATTATAT TTTTGCCTTT ATTAACTGTT ATATTTGTCT TATCGCCTAT CGTTAAATTA AGTTGTATTA ATAGTTTGTT TGAAATCCTA GAAGTGTTAT TAAAATGGAC TGGGCAACTG TTTCCAAGGC CACTTATTTT TGGAAAGCCC AGCCTTTTTC TTTTAATAGT CATGATTATA ATTTTGGGAT TACTTTATGA TTATTATCAT TCTAAATGTT TTCGTTATTG CTCCCTTCTT ATTATCTTTA CCTTGTTTTT TATCACTAAG AATCCAATTA CTAACGAGGT TGCGATTTTA GATGTTGGAC AGGGAGATAG TATTTTAGTG AGGGATTGGT TAGGAAAAAC AATTTTAATT GATACTGGGG GAAGGGTGAG ATTTGAACAG CCTGAAGAAT GGAAACAAAA AGTAAATCAG TCTAATGCTA AGAGAACGCT CATTCCTTAC TTGAAAAGCA GAGGTATTAG CAAGATAGAT GATTTAGTGA TAACTCATAC CGATACAGAT CATATGGGGG ATATGGAAGT TATCTCAAAG CATTTTAAAG TTGCACGTTT GATTACAAGT TCAGGTTCTT TAACGAATTC GCAGTACGTT AAGCATTTAT CAAAGATAGG TGTAGCGGTA AAATCTATAG AAGCCGGTGA TAAACTTGCT GTCATGGGAA GTTATTTACA AGTACTTTAC CCATGGCACA AGGGTGATGG AAAAAATAAT GATTCAATTG TTTTATATGG ACATTTATTA GGAAAAGGCT TCTTATTTAC CGGTGATTTG GAGGAAGAGG GAGAAAAGCA GTTATTAGAA GCTTATCCTA ATTTATCAGT AGATATCCTT AAAGCAGGAC ATCATGGTTC TAAGGGCTCA TCAAGTCTAT CCTTTCTGAA AAAGTTGTCT CCTAGTGTGG TTCTAGTTTC AGCTGGTAAA AATAATCGTT ACCAGCATCC TCATCAAGAG ACTTTACAAA GGTTCCAAAA GATTAAAAGC AAGATTTTCC GAACGGATCA ATCAGGTACA ATTAGGCTAA CAGGATGGTG GAAGTGGCAT ATTCAGACAG TTCGTTGA
|
Protein sequence | MLQLTKYFPL KPIYLALLVF QIYLLVFSWT MLGCAFLLFS FIFLIYQYDR ETIFKTIAIV IFFLFYFLWQ NHNMNVQYQR VPNHISQIKV RIDTISINGD VLSFQADASG NTYQAFYTLK NKSEKDYFQN LDNNIMIIAD IKLEEAEERR HFNGFDYRQY LKRHGIYRIA KVTKIKQIRL FQHRSFFALM SKWRRSAIVI SQTFPNPMRH YMSGLLFGYL DKTFDDMSDL YSSLGIIHLF ALSGMQVGFF LGIFRYICLR IGLRLDHVWL LQIPFSLIYA GLTGFSISVV RALIQSLLSH SGVKKDENFA LCLLICLISL PHSLLTTGGV LSFAYAFILT MTSFDHFSSI KKVAIESLTV SVGILPILTY YFSGFQPISI ILTALLSFAF DIIFLPLLTV IFVLSPIVKL SCINSLFEIL EVLLKWTGQL FPRPLIFGKP SLFLLIVMII ILGLLYDYYH SKCFRYCSLL IIFTLFFITK NPITNEVAIL DVGQGDSILV RDWLGKTILI DTGGRVRFEQ PEEWKQKVNQ SNAKRTLIPY LKSRGISKID DLVITHTDTD HMGDMEVISK HFKVARLITS SGSLTNSQYV KHLSKIGVAV KSIEAGDKLA VMGSYLQVLY PWHKGDGKNN DSIVLYGHLL GKGFLFTGDL EEEGEKQLLE AYPNLSVDIL KAGHHGSKGS SSLSFLKKLS PSVVLVSAGK NNRYQHPHQE TLQRFQKIKS KIFRTDQSGT IRLTGWWKWH IQTVR
|
| |