Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SAG1038 |
Symbol | |
ID | 1013842 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptococcus agalactiae 2603V/R |
Kingdom | Bacteria |
Replicon accession | NC_004116 |
Strand | - |
Start bp | 1044617 |
End bp | 1047628 |
Gene Length | 3012 bp |
Protein Length | 1003 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 637316221 |
Product | phage infection protein, putative |
Protein accession | NP_688048 |
Protein GI | 22537197 |
COG category | [S] Function unknown |
COG ID | [COG1511] Predicted membrane protein |
TIGRFAM ID | [TIGR03061] YhgE/Pip N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.687309 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGAA ATAACTTTAG AATACTTTGG TATATTATAG CTGTTGCTCT CTTTTTAGTA GCTATTGCTG GCCTTAATCT TAAGCTTCAA GGCGATCATG CAAAAGAGAA TAAAACAACA CAGTCAGCTA CTAACACTAA ACTAAATATA GCATTAGTCA ACGAAGATCA AAATGTATCT AATGGTAAAG AAAGTTATAA TTTAGGAGCT AGTTATATTA AGTCTATCGA ACGTGATAAT AGTCAAAATT GGTCAGTTGT TAGTCGAGGT ACTGCTCAAA ATGGATTAGA CAAAGGTGAT TATCAATTAA TGGTTATCAT CCCTAATAAC TTTTCTCAAA AACTTCTTGA TGTCAATAAA GCAAATGCAG AGCAAACAAC TATCTCTTAT AAGGTTAATG CCAAGGGGAA TTTAGCATTA GAGAAAAAGG CAACTGAAAA AGAGAAAGAT ATTGTTTCAG AGTTAAATAG CCACTTAGTA AATATGTACA TGGCAAGTAT TTTAAGTAAC TTATATACAG CTCAAGAAAA TGTACAGGCC ATGGTAAATG TTCAGTCAGG TAATATCTCA AATTACCAAA AAAATCTTTT AGATTCTGCA ACTAATTTCC AAAATATCTT TCCAGCCCTC GTTAACCAGT CTAGTAGTTC CATTACTGCT AACGAATCAC TGAAAAAATC TTTAGAAGCT TCTGATAACA TGTTTAATGA TTTGGTGACA ACCCAGACAA ATACTGGAAA AGATTTATCA AGCTTAATAG AACAGCGCCA TCAAGATAGC ATTTCGTATG AAGCGTTTTC GACCTCATTA CTAGAAATGA ATAACGAGTT ATTAGAGAAG CAATTATCTG ATATTATCAC ACAAGCACAA AAAGACCAAG AAACATTATC ATCACAACTC AATAGTATTA TGGGTGATGA TAACAATCAT AATCATAAAG AAAACTCATC AGCTTATCTA AATGTTGCAA GGCAAAAAAT CCAAGAACTA TCTGAAGCAC TCAAGTCACA AGATAACATT GCTAAAGATC AAAGTGAACA ACTAGATAAA ATTGTTAGAG AGGGACTAGC AAGTTACTTT GCTAAGAATA ATAAAGATAA TATTACTTTA TTAGAATTAT TGAAGAGTCA TTCTACTAAT GAGAAGACTT TGAAAGATTT TAAAGCTAAG GTAGCAGATT TCACAAATTC TCTCATTTCA AGCATTCCTT CACTAAACCT TTCGGAGTTG CACTTAACCC AAGAAGAAGA AAAAGCCATT CAGTTTACAT CTTCTGATTC GGAAATCATT AAAAAAGTAA GCAACGAAAG AACTCTGTAT TTAAATACTA ATTTATTGAA TCGTCTTTTT GAAGCTAGGA AAAATAGAGA TGAAGCTAAA AATAAAGTTA ATCAATTAAA ATTGTCATCT AGTTCTACTC GAACTGGTGA GCAGATAGTT TCTGTTGAAT CAAATAATCC AGATTACAGG GTAGATACCT GGACGGTTAA TGGTAAGCAA ACAAGAACCT TGGACCCCAG CCAAAACAAT AATATTATTA TCAATAGCAG TTATCAAAGG GAATCATCTA ACTCTAGTAT TACGAAACCA AGCTATACAA TCACTATCGG GAATCAGAAA CAAATTGTAC AAAATGATGA CGGTAAAATC CAACGGTCTT ACTTTGAGGC AGAGGCAACG TATCAACGAA CTCTACAAGA AGTAAATGAT GCATATAACA CCACTCACGG ATTAGTAGCT AAATACTACA TTATTTCTGA TGGCGAAGAA CCGCAAAATT TATTTGATCA ATTTTTAAAT CAAAGTGTTA ACGATACGAT GGTTGATCTG GTCAAAAATG GTATAACTAA GTATTTGATG GATGAAAACA CTGCTGATGC GCAACAGAAA GTTAAAGATG TCATGGAGGA AGTTGAAAAT AGTCAAGATG AATTGGCGGA TCAAATGGCA AAAGTAACTG AGACAAATGT ACGACTAACA GATGCAATCA AAAAACAGCT TGAAACCCTT CAATCAATTA ACATGAAGGT CCAAAACATC ACACAGGATC AATCTAAAGT AAACGACTCA CAGAAAACAA CTGATCAACA ATTATCTGAT TTGAAGAACC AGCTAGATGG GCTAATGACC TCTGCAGCGG GCGTAAAAGA TATGTCCAAA TCTAATAGTC AAGAGGCGGA CCAAGTTAAT CAAATATTCA CATCATTCAA TAAAGATGTT CAAGATGCTA AAAATTCGGG CAACAAACTT TCGACGGATG CAACCGATTT AATGGCTAAC TTCCAAAAAG AATTGGCCAA TAATGGTGAT TTCGTAGCTT CTTTCTCTAA AGTTTTTAAT ACTGCTTATA AAAATGGAGT GCCAAATGAT ATTCTCCTTA ATTTCTTATC ACGACCTGTC GCTGAATCAG CGTCTGCAGT TAGGGCAACT GAAAATACTT ATCGCCCATT CACTTGGATA CTATTATTAG AAGTTGTTAG CCTATTTACA GCATATATCT TTGCGACTCA AAACCTTATT AAGAAATTGA CAGATAGGTA TAATGTCAAC CGCTGGCTTC AAACAGACTT TTTAAATGTC ATTGTCATTT CAGGCCTCTC TCTTGTTATT GGTTTAGCAC TGGGAGTTAT CTCAAGCAGA AGTCTTCATG TCATGCCTGA ATATGTACCA TCTTGGTTCC TAGTTATGAC TCTGTTTAGT TTTTTATTAA TTCATAGTCA GTATTTCTTT ATTAAAAATT TTAAAGCAGT TGGTATGGGT TTAGCTTTGT TTATGATTAT TAGTTTTGTG TATCTATCAA ATGCCGTAGG CACAGTAGCG ACTGTTAGTG GACTTCCAAA ATTACTAAAA GCTATTAATC CACTATCTAT CCTTGAAAAT CAATTATCAT CTTATTTTGA TAATGTGACA ACTGGATTTA TTTTCCTTAT CTTAGTACTT CTTGTGGATG TTGCTTTTAT TATCATGAAT ATTTTTATCA CCTTAAATTT TGAAGCTAAA GTAAAAGAGT GA
|
Protein sequence | MKRNNFRILW YIIAVALFLV AIAGLNLKLQ GDHAKENKTT QSATNTKLNI ALVNEDQNVS NGKESYNLGA SYIKSIERDN SQNWSVVSRG TAQNGLDKGD YQLMVIIPNN FSQKLLDVNK ANAEQTTISY KVNAKGNLAL EKKATEKEKD IVSELNSHLV NMYMASILSN LYTAQENVQA MVNVQSGNIS NYQKNLLDSA TNFQNIFPAL VNQSSSSITA NESLKKSLEA SDNMFNDLVT TQTNTGKDLS SLIEQRHQDS ISYEAFSTSL LEMNNELLEK QLSDIITQAQ KDQETLSSQL NSIMGDDNNH NHKENSSAYL NVARQKIQEL SEALKSQDNI AKDQSEQLDK IVREGLASYF AKNNKDNITL LELLKSHSTN EKTLKDFKAK VADFTNSLIS SIPSLNLSEL HLTQEEEKAI QFTSSDSEII KKVSNERTLY LNTNLLNRLF EARKNRDEAK NKVNQLKLSS SSTRTGEQIV SVESNNPDYR VDTWTVNGKQ TRTLDPSQNN NIIINSSYQR ESSNSSITKP SYTITIGNQK QIVQNDDGKI QRSYFEAEAT YQRTLQEVND AYNTTHGLVA KYYIISDGEE PQNLFDQFLN QSVNDTMVDL VKNGITKYLM DENTADAQQK VKDVMEEVEN SQDELADQMA KVTETNVRLT DAIKKQLETL QSINMKVQNI TQDQSKVNDS QKTTDQQLSD LKNQLDGLMT SAAGVKDMSK SNSQEADQVN QIFTSFNKDV QDAKNSGNKL STDATDLMAN FQKELANNGD FVASFSKVFN TAYKNGVPND ILLNFLSRPV AESASAVRAT ENTYRPFTWI LLLEVVSLFT AYIFATQNLI KKLTDRYNVN RWLQTDFLNV IVISGLSLVI GLALGVISSR SLHVMPEYVP SWFLVMTLFS FLLIHSQYFF IKNFKAVGMG LALFMIISFV YLSNAVGTVA TVSGLPKLLK AINPLSILEN QLSSYFDNVT TGFIFLILVL LVDVAFIIMN IFITLNFEAK VKE
|
| |