Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SAG1844 |
Symbol | |
ID | 1014654 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptococcus agalactiae 2603V/R |
Kingdom | Bacteria |
Replicon accession | NC_004116 |
Strand | - |
Start bp | 1841344 |
End bp | 1844079 |
Gene Length | 2736 bp |
Protein Length | 911 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637317013 |
Product | hypothetical protein |
Protein accession | NP_688834 |
Protein GI | 22537983 |
COG category | [S] Function unknown |
COG ID | [COG5280] Phage-related minor tail protein |
TIGRFAM ID | [TIGR01760] phage tail tape measure protein, TP901 family, core region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00878784 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGAAA CGTTTGAAGG CTTATACGTC AAATTTGGTG CTAATACTGT TGAATTTGAT AGGTCTGTAA AAGGTATCAA CACTGCCTTA TCTAGTTTGA AAAAAGACTT CAATAACATC AACAGACAAT TGAAGATGGA TCCAGACAAT GTTGACTTGT TGAATCGTAA GTTGGTTAAC TTGCAAGAAC AGGCTCGTGT TGGTGCTATA AAAATTGCTG AACTCAAAAA GCAACAGAAG GCACTGGGAG AATCTGAAGT TGGGTCAGCA CAGTGGAATA AGCTTCAACT TGAAATTGCT AAGGTTGAAT CACAGATGAA GATTGTTGAT AAGGCAATGG AGTCAACAAA GAAACACATT GAAGATGTAG GAGACCCAAA GTCTATTCTG AATCTTAACA AAGAACTTGA TAATGTTGCT AAAGAGCTTG ATATTGTCAA TCAAAAGCTT GAGCTAGACC CTGACAATGT CGAACTAGCA GAGCAAAAAA TGAAACTACT TGGCAAACAG TCGGAATTGG CTGGGGATAA AGTCCAAGAA TTAAAGAAAA AACAAGCTGC CCTTGGCGAT GAGAAAATAG GTACAGAAGA ATGGCGTCAA CTTCAAAATG AAATCGGTCA AGCTGAAGTT GAAGTTCTAA AGATTGACCG TGCAATGGAC ATTCTTGGTG AGTCAAGCCG TTCTGCAACT GGAGACATCA AAGAGGCAAC CAGCTATTTA AGAGCTGATG TCATGATGGA TGTTGCAGAT AAGGCTGGTC AGATTGGCCA GAAAATGGTT GACGCTGGGA AAATGACAGT AGATGCTTGG TCTGAGATAG ATGAGGCTCT GGACACCGTC ACAACCAAAA CTGGTCTGAC TGGTGATGCC TTAGCAGAGC TTCAGGAAAT TGCTAAAGAC ATTGCTACTG GTATGCCTAC CAGCTTTCAG AATGCTGGTG ATGCCGTTGG GGAATTGAAT ACTCAGTTCG GTTTGACTGG GGAAAAGCTG AAATCAGCAT CTGAATTACT TATCAAGTAT GCTGAGATTA ACGAAACAGA CATTTCAAGC TCTGCCATTT CTGCAAAACA AGCTATTGAA GCTTACGGTT TGACAGCTGA AGACTTGGGA ATGGTCTTAG ACAATGTGAC CAAAGCCGCT CAAGATACAG GACAGTCAGT TGACACGATT GTTCAAAAAG CCATTGACGG TGCTCCTCAG ATTAAAGGTT TGGGACTTTC TTTTGAAGAA GGTGCTGCAC TGATCGGTAA GTTTGAGAAA AGCGGTGTGG ATTCATCTGC TGCTCTATCC TCTCTATCGA AAGCTGCTGT CATCTATGCT AAAGACGGTA AGACTCTGAC AGATGGATTG AATGAGACTG TTAGTGCTAT TCAAAATTCT ACTAGTGAGA CAGAGGCTTT AAGTATTGCC TCAGAAATCT TTGGTAGTAA GGCTGCTCCT AGAATGGTCG ATGCTATTCA GCGTGGTGCT TTTAGCTTTG ATGACTTAGC TGAAGCAGCT AAAAGTTCCT CTGGTACTGT CTCCACCACA TTTGATGAGA CGCTTGACCC AATAGATAAG TTGACTCAGT ATTCTAACCA AGCAAAAGAG GGAATGGCAG AACTTGGCGG TAAATTGCTT GAGACTGTCA TCCCAGCTTT AGAACCTTTG ATGGGTATGC TTGAATCTTC TGTCAATTGG TTTACTAGCC TAAACGAAAC TGATCAACAG ACTATCGTGA TTCTTGGCCT AGTTACAACT GCTGTGATGA TGTTGCTTGG TGCAATTGCA CCGCTGGTCA TCGCCATAGG GGCAATAGGT GCGCCTGTCG GAATTGTAGT GGCGGCAATA GTAGGGGCTA TTGCCGTCAT AACACTTATC ATCCAAGCAA TCATGAACTG GGGAGCCATA ACTGAATGGC TTCAGTCAAC GTGGGATTCT TGTGCTGCCT GGCTTTCTGA ATTGTGGACT AACATAGTCA CGACTGCCAC CACAGCGTGG TCAAATTTCA CTGCCTGGCT TTCTGGCCTT TGGTCTTCAG TAGTCTCAAC TGGACAGTCT TTGTGGTCTA GCTTTACTAG TTCCTTGTCC AATATTTTCT CAAGTTTGAT TACAGGTGCT CAGTCTCTGT GGTCAAGTTT CACTTCCACT CTTTCCAATT TGTGGTCTGG ACTGGTCTCA ACTGGGTCAA ATTTGTTTAA TAATTTGAGT AGCACGATTT CAGGAATTTT TAATGGGATA CTTTCAACAG CAAGCAATAT TTGGAATTCC ATAAAATCCA CTATTTCCAA TGCAATAGAT GGGGCGAAAA ATGCAGTGTC CAACGGGGTC AATGCCATCA AGAATCTGTT TAACTTCCAG ATTAAATGGC CTCATATTCC ACTACCTCAC TTCCGTGTGA GTGGTTCTGC TAACCCTCTG GATTGGCTAA AAGGTGGCTT ACCAAGTATC GGCATTGACT GGTATGCCAA GGGCGGTATC ATGACCAAAC CAACCCTATT TGGCATGAAT GGAAACCGTG CAATGGTTGG CGGTGAGGCT GGCGCTGAAG CCATCTTGCC ATTGAATAAG TCAACCCTGG GGGCAATTGG TCAAAGTATT GCTAACACGA TGAATACATC GAACAATATT AACGTTAACT TCTCTGGCGT CACTATCAGG GAAGAAGCTG ACCTTAACAG ACTAGCCAAC GTGGTTGGAA ATCGTATTGC TGAAGAATTG CAACGTAAAA CTAATTTGAG AGGAGGAATG GCATGA
|
Protein sequence | MTETFEGLYV KFGANTVEFD RSVKGINTAL SSLKKDFNNI NRQLKMDPDN VDLLNRKLVN LQEQARVGAI KIAELKKQQK ALGESEVGSA QWNKLQLEIA KVESQMKIVD KAMESTKKHI EDVGDPKSIL NLNKELDNVA KELDIVNQKL ELDPDNVELA EQKMKLLGKQ SELAGDKVQE LKKKQAALGD EKIGTEEWRQ LQNEIGQAEV EVLKIDRAMD ILGESSRSAT GDIKEATSYL RADVMMDVAD KAGQIGQKMV DAGKMTVDAW SEIDEALDTV TTKTGLTGDA LAELQEIAKD IATGMPTSFQ NAGDAVGELN TQFGLTGEKL KSASELLIKY AEINETDISS SAISAKQAIE AYGLTAEDLG MVLDNVTKAA QDTGQSVDTI VQKAIDGAPQ IKGLGLSFEE GAALIGKFEK SGVDSSAALS SLSKAAVIYA KDGKTLTDGL NETVSAIQNS TSETEALSIA SEIFGSKAAP RMVDAIQRGA FSFDDLAEAA KSSSGTVSTT FDETLDPIDK LTQYSNQAKE GMAELGGKLL ETVIPALEPL MGMLESSVNW FTSLNETDQQ TIVILGLVTT AVMMLLGAIA PLVIAIGAIG APVGIVVAAI VGAIAVITLI IQAIMNWGAI TEWLQSTWDS CAAWLSELWT NIVTTATTAW SNFTAWLSGL WSSVVSTGQS LWSSFTSSLS NIFSSLITGA QSLWSSFTST LSNLWSGLVS TGSNLFNNLS STISGIFNGI LSTASNIWNS IKSTISNAID GAKNAVSNGV NAIKNLFNFQ IKWPHIPLPH FRVSGSANPL DWLKGGLPSI GIDWYAKGGI MTKPTLFGMN GNRAMVGGEA GAEAILPLNK STLGAIGQSI ANTMNTSNNI NVNFSGVTIR EEADLNRLAN VVGNRIAEEL QRKTNLRGGM A
|
| |