Gene SAG0782 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG0782 
Symbol 
ID1013586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp779217 
End bp781454 
Gene Length2238 bp 
Protein Length745 aa 
Translation table11 
GC content33% 
IMG OID637315970 
ProductDNA internalization-related competence protein ComEC/Rec2 
Protein accessionNP_687797 
Protein GI22536946 
COG category[R] General function prediction only 
COG ID[COG0658] Predicted membrane metal-binding protein
[COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTACAAT TGACTAAGTA TTTTCCTCTA AAACCTATTT ATTTAGCATT GTTGGTCTTC 
CAAATTTACT TACTAGTGTT TTCTTGGACA ATGCTTGGTT GTGCCTTTCT TTTATTTTCT
TTTATTTTTC TGATTTATCA ATATGATCGT GAAACTATTT TTAAAACAAT AGCAATAGTA
ATTTTTTTCT TATTTTATTT TTTATGGCAA AATCACAATA TGAATGTCCA ATATCAAAGA
GTACCGAATC ATATTAGCCA GATTAAAGTG CGTATTGATA CTATTTCTAT CAATGGTGAT
GTTTTATCAT TCCAGGCAGA TGCTTCAGGT AACACTTATC AAGCTTTTTA CACATTAAAA
AATAAAAGTG AGAAAGATTA TTTTCAAAAT CTTGATAATA ATATAATGAT AATTGCAGAT
ATCAAACTTG AAGAAGCAGA GGAGAGAAGG CATTTTAATG GCTTTGATTA TCGTCAGTAT
TTAAAAAGAC ATGGAATTTA TCGTATCGCC AAAGTGACAA AGATAAAACA GATACGCTTA
TTTCAACATA GGTCTTTCTT TGCTCTTATG TCTAAGTGGC GTAGAAGTGC AATTGTTATT
AGTCAAACTT TTCCAAATCC TATGCGTCAC TATATGTCAG GGCTTTTGTT TGGATATCTA
GATAAGACCT TTGATGACAT GTCCGATTTA TATAGTAGTC TAGGTATTAT ACATTTATTT
GCTTTGTCAG GTATGCAAGT AGGTTTTTTT CTCGGTATTT TTCGTTATAT CTGTCTACGT
ATTGGCTTAC GTCTAGACCA TGTTTGGTTA CTTCAAATAC CATTCTCGCT AATTTATGCT
GGTTTAACAG GCTTTAGTAT CTCAGTCGTT AGGGCACTTA TTCAATCTTT ATTATCACAT
AGCGGTGTCA AGAAAGATGA GAACTTTGCT CTCTGCTTGT TAATTTGTCT TATCTCCCTC
CCCCACTCAC TTTTGACTAC GGGAGGAGTT CTTAGCTTTG CTTATGCTTT TATACTTACG
ATGACCTCCT TTGATCATTT TTCGAGTATA AAAAAAGTAG CTATCGAATC TTTGACAGTC
TCTGTAGGAA TTCTTCCCAT ACTAACCTAC TATTTTTCGG GTTTTCAACC AATATCAATT
ATATTAACAG CACTTTTATC TTTTGCATTT GATATTATAT TTTTGCCTTT ATTAACTGTT
ATATTTGTCT TATCGCCTAT CGTTAAATTA AGTTGTATTA ATAGTTTGTT TGAAATCCTA
GAAGTGTTAT TAAAATGGAC TGGGCAACTG TTTCCAAGGC CACTTATTTT TGGAAAGCCC
AGCCTTTTTC TTTTAATAGT CATGATTATA ATTTTGGGAT TACTTTATGA TTATTATCAT
TCTAAATGTT TTCGTTATTG CTCCCTTCTT ATTATCTTTA CCTTGTTTTT TATCACTAAG
AATCCAATTA CTAACGAGGT TGCGATTTTA GATGTTGGAC AGGGAGATAG TATTTTAGTG
AGGGATTGGT TAGGAAAAAC AATTTTAATT GATACTGGGG GAAGGGTGAG ATTTGAACAG
CCTGAAGAAT GGAAACAAAA AGTAAATCAG TCTAATGCTA AGAGAACGCT CATTCCTTAC
TTGAAAAGCA GAGGTATTAG CAAGATAGAT GATTTAGTGA TAACTCATAC CGATACAGAT
CATATGGGGG ATATGGAAGT TATCTCAAAG CATTTTAAAG TTGCACGTTT GATTACAAGT
TCAGGTTCTT TAACGAATTC GCAGTACGTT AAGCATTTAT CAAAGATAGG TGTAGCGGTA
AAATCTATAG AAGCCGGTGA TAAACTTGCT GTCATGGGAA GTTATTTACA AGTACTTTAC
CCATGGCACA AGGGTGATGG AAAAAATAAT GATTCAATTG TTTTATATGG ACATTTATTA
GGAAAAGGCT TCTTATTTAC CGGTGATTTG GAGGAAGAGG GAGAAAAGCA GTTATTAGAA
GCTTATCCTA ATTTATCAGT AGATATCCTT AAAGCAGGAC ATCATGGTTC TAAGGGCTCA
TCAAGTCTAT CCTTTCTGAA AAAGTTGTCT CCTAGTGTGG TTCTAGTTTC AGCTGGTAAA
AATAATCGTT ACCAGCATCC TCATCAAGAG ACTTTACAAA GGTTCCAAAA GATTAAAAGC
AAGATTTTCC GAACGGATCA ATCAGGTACA ATTAGGCTAA CAGGATGGTG GAAGTGGCAT
ATTCAGACAG TTCGTTGA
 
Protein sequence
MLQLTKYFPL KPIYLALLVF QIYLLVFSWT MLGCAFLLFS FIFLIYQYDR ETIFKTIAIV 
IFFLFYFLWQ NHNMNVQYQR VPNHISQIKV RIDTISINGD VLSFQADASG NTYQAFYTLK
NKSEKDYFQN LDNNIMIIAD IKLEEAEERR HFNGFDYRQY LKRHGIYRIA KVTKIKQIRL
FQHRSFFALM SKWRRSAIVI SQTFPNPMRH YMSGLLFGYL DKTFDDMSDL YSSLGIIHLF
ALSGMQVGFF LGIFRYICLR IGLRLDHVWL LQIPFSLIYA GLTGFSISVV RALIQSLLSH
SGVKKDENFA LCLLICLISL PHSLLTTGGV LSFAYAFILT MTSFDHFSSI KKVAIESLTV
SVGILPILTY YFSGFQPISI ILTALLSFAF DIIFLPLLTV IFVLSPIVKL SCINSLFEIL
EVLLKWTGQL FPRPLIFGKP SLFLLIVMII ILGLLYDYYH SKCFRYCSLL IIFTLFFITK
NPITNEVAIL DVGQGDSILV RDWLGKTILI DTGGRVRFEQ PEEWKQKVNQ SNAKRTLIPY
LKSRGISKID DLVITHTDTD HMGDMEVISK HFKVARLITS SGSLTNSQYV KHLSKIGVAV
KSIEAGDKLA VMGSYLQVLY PWHKGDGKNN DSIVLYGHLL GKGFLFTGDL EEEGEKQLLE
AYPNLSVDIL KAGHHGSKGS SSLSFLKKLS PSVVLVSAGK NNRYQHPHQE TLQRFQKIKS
KIFRTDQSGT IRLTGWWKWH IQTVR