Gene SAG1038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1038 
Symbol 
ID1013842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1044617 
End bp1047628 
Gene Length3012 bp 
Protein Length1003 aa 
Translation table11 
GC content33% 
IMG OID637316221 
Productphage infection protein, putative 
Protein accessionNP_688048 
Protein GI22537197 
COG category[S] Function unknown 
COG ID[COG1511] Predicted membrane protein 
TIGRFAM ID[TIGR03061] YhgE/Pip N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.687309 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAA ATAACTTTAG AATACTTTGG TATATTATAG CTGTTGCTCT CTTTTTAGTA 
GCTATTGCTG GCCTTAATCT TAAGCTTCAA GGCGATCATG CAAAAGAGAA TAAAACAACA
CAGTCAGCTA CTAACACTAA ACTAAATATA GCATTAGTCA ACGAAGATCA AAATGTATCT
AATGGTAAAG AAAGTTATAA TTTAGGAGCT AGTTATATTA AGTCTATCGA ACGTGATAAT
AGTCAAAATT GGTCAGTTGT TAGTCGAGGT ACTGCTCAAA ATGGATTAGA CAAAGGTGAT
TATCAATTAA TGGTTATCAT CCCTAATAAC TTTTCTCAAA AACTTCTTGA TGTCAATAAA
GCAAATGCAG AGCAAACAAC TATCTCTTAT AAGGTTAATG CCAAGGGGAA TTTAGCATTA
GAGAAAAAGG CAACTGAAAA AGAGAAAGAT ATTGTTTCAG AGTTAAATAG CCACTTAGTA
AATATGTACA TGGCAAGTAT TTTAAGTAAC TTATATACAG CTCAAGAAAA TGTACAGGCC
ATGGTAAATG TTCAGTCAGG TAATATCTCA AATTACCAAA AAAATCTTTT AGATTCTGCA
ACTAATTTCC AAAATATCTT TCCAGCCCTC GTTAACCAGT CTAGTAGTTC CATTACTGCT
AACGAATCAC TGAAAAAATC TTTAGAAGCT TCTGATAACA TGTTTAATGA TTTGGTGACA
ACCCAGACAA ATACTGGAAA AGATTTATCA AGCTTAATAG AACAGCGCCA TCAAGATAGC
ATTTCGTATG AAGCGTTTTC GACCTCATTA CTAGAAATGA ATAACGAGTT ATTAGAGAAG
CAATTATCTG ATATTATCAC ACAAGCACAA AAAGACCAAG AAACATTATC ATCACAACTC
AATAGTATTA TGGGTGATGA TAACAATCAT AATCATAAAG AAAACTCATC AGCTTATCTA
AATGTTGCAA GGCAAAAAAT CCAAGAACTA TCTGAAGCAC TCAAGTCACA AGATAACATT
GCTAAAGATC AAAGTGAACA ACTAGATAAA ATTGTTAGAG AGGGACTAGC AAGTTACTTT
GCTAAGAATA ATAAAGATAA TATTACTTTA TTAGAATTAT TGAAGAGTCA TTCTACTAAT
GAGAAGACTT TGAAAGATTT TAAAGCTAAG GTAGCAGATT TCACAAATTC TCTCATTTCA
AGCATTCCTT CACTAAACCT TTCGGAGTTG CACTTAACCC AAGAAGAAGA AAAAGCCATT
CAGTTTACAT CTTCTGATTC GGAAATCATT AAAAAAGTAA GCAACGAAAG AACTCTGTAT
TTAAATACTA ATTTATTGAA TCGTCTTTTT GAAGCTAGGA AAAATAGAGA TGAAGCTAAA
AATAAAGTTA ATCAATTAAA ATTGTCATCT AGTTCTACTC GAACTGGTGA GCAGATAGTT
TCTGTTGAAT CAAATAATCC AGATTACAGG GTAGATACCT GGACGGTTAA TGGTAAGCAA
ACAAGAACCT TGGACCCCAG CCAAAACAAT AATATTATTA TCAATAGCAG TTATCAAAGG
GAATCATCTA ACTCTAGTAT TACGAAACCA AGCTATACAA TCACTATCGG GAATCAGAAA
CAAATTGTAC AAAATGATGA CGGTAAAATC CAACGGTCTT ACTTTGAGGC AGAGGCAACG
TATCAACGAA CTCTACAAGA AGTAAATGAT GCATATAACA CCACTCACGG ATTAGTAGCT
AAATACTACA TTATTTCTGA TGGCGAAGAA CCGCAAAATT TATTTGATCA ATTTTTAAAT
CAAAGTGTTA ACGATACGAT GGTTGATCTG GTCAAAAATG GTATAACTAA GTATTTGATG
GATGAAAACA CTGCTGATGC GCAACAGAAA GTTAAAGATG TCATGGAGGA AGTTGAAAAT
AGTCAAGATG AATTGGCGGA TCAAATGGCA AAAGTAACTG AGACAAATGT ACGACTAACA
GATGCAATCA AAAAACAGCT TGAAACCCTT CAATCAATTA ACATGAAGGT CCAAAACATC
ACACAGGATC AATCTAAAGT AAACGACTCA CAGAAAACAA CTGATCAACA ATTATCTGAT
TTGAAGAACC AGCTAGATGG GCTAATGACC TCTGCAGCGG GCGTAAAAGA TATGTCCAAA
TCTAATAGTC AAGAGGCGGA CCAAGTTAAT CAAATATTCA CATCATTCAA TAAAGATGTT
CAAGATGCTA AAAATTCGGG CAACAAACTT TCGACGGATG CAACCGATTT AATGGCTAAC
TTCCAAAAAG AATTGGCCAA TAATGGTGAT TTCGTAGCTT CTTTCTCTAA AGTTTTTAAT
ACTGCTTATA AAAATGGAGT GCCAAATGAT ATTCTCCTTA ATTTCTTATC ACGACCTGTC
GCTGAATCAG CGTCTGCAGT TAGGGCAACT GAAAATACTT ATCGCCCATT CACTTGGATA
CTATTATTAG AAGTTGTTAG CCTATTTACA GCATATATCT TTGCGACTCA AAACCTTATT
AAGAAATTGA CAGATAGGTA TAATGTCAAC CGCTGGCTTC AAACAGACTT TTTAAATGTC
ATTGTCATTT CAGGCCTCTC TCTTGTTATT GGTTTAGCAC TGGGAGTTAT CTCAAGCAGA
AGTCTTCATG TCATGCCTGA ATATGTACCA TCTTGGTTCC TAGTTATGAC TCTGTTTAGT
TTTTTATTAA TTCATAGTCA GTATTTCTTT ATTAAAAATT TTAAAGCAGT TGGTATGGGT
TTAGCTTTGT TTATGATTAT TAGTTTTGTG TATCTATCAA ATGCCGTAGG CACAGTAGCG
ACTGTTAGTG GACTTCCAAA ATTACTAAAA GCTATTAATC CACTATCTAT CCTTGAAAAT
CAATTATCAT CTTATTTTGA TAATGTGACA ACTGGATTTA TTTTCCTTAT CTTAGTACTT
CTTGTGGATG TTGCTTTTAT TATCATGAAT ATTTTTATCA CCTTAAATTT TGAAGCTAAA
GTAAAAGAGT GA
 
Protein sequence
MKRNNFRILW YIIAVALFLV AIAGLNLKLQ GDHAKENKTT QSATNTKLNI ALVNEDQNVS 
NGKESYNLGA SYIKSIERDN SQNWSVVSRG TAQNGLDKGD YQLMVIIPNN FSQKLLDVNK
ANAEQTTISY KVNAKGNLAL EKKATEKEKD IVSELNSHLV NMYMASILSN LYTAQENVQA
MVNVQSGNIS NYQKNLLDSA TNFQNIFPAL VNQSSSSITA NESLKKSLEA SDNMFNDLVT
TQTNTGKDLS SLIEQRHQDS ISYEAFSTSL LEMNNELLEK QLSDIITQAQ KDQETLSSQL
NSIMGDDNNH NHKENSSAYL NVARQKIQEL SEALKSQDNI AKDQSEQLDK IVREGLASYF
AKNNKDNITL LELLKSHSTN EKTLKDFKAK VADFTNSLIS SIPSLNLSEL HLTQEEEKAI
QFTSSDSEII KKVSNERTLY LNTNLLNRLF EARKNRDEAK NKVNQLKLSS SSTRTGEQIV
SVESNNPDYR VDTWTVNGKQ TRTLDPSQNN NIIINSSYQR ESSNSSITKP SYTITIGNQK
QIVQNDDGKI QRSYFEAEAT YQRTLQEVND AYNTTHGLVA KYYIISDGEE PQNLFDQFLN
QSVNDTMVDL VKNGITKYLM DENTADAQQK VKDVMEEVEN SQDELADQMA KVTETNVRLT
DAIKKQLETL QSINMKVQNI TQDQSKVNDS QKTTDQQLSD LKNQLDGLMT SAAGVKDMSK
SNSQEADQVN QIFTSFNKDV QDAKNSGNKL STDATDLMAN FQKELANNGD FVASFSKVFN
TAYKNGVPND ILLNFLSRPV AESASAVRAT ENTYRPFTWI LLLEVVSLFT AYIFATQNLI
KKLTDRYNVN RWLQTDFLNV IVISGLSLVI GLALGVISSR SLHVMPEYVP SWFLVMTLFS
FLLIHSQYFF IKNFKAVGMG LALFMIISFV YLSNAVGTVA TVSGLPKLLK AINPLSILEN
QLSSYFDNVT TGFIFLILVL LVDVAFIIMN IFITLNFEAK VKE