Gene SAG1844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1844 
Symbol 
ID1014654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1841344 
End bp1844079 
Gene Length2736 bp 
Protein Length911 aa 
Translation table11 
GC content43% 
IMG OID637317013 
Producthypothetical protein 
Protein accessionNP_688834 
Protein GI22537983 
COG category[S] Function unknown 
COG ID[COG5280] Phage-related minor tail protein 
TIGRFAM ID[TIGR01760] phage tail tape measure protein, TP901 family, core region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00878784 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGAAA CGTTTGAAGG CTTATACGTC AAATTTGGTG CTAATACTGT TGAATTTGAT 
AGGTCTGTAA AAGGTATCAA CACTGCCTTA TCTAGTTTGA AAAAAGACTT CAATAACATC
AACAGACAAT TGAAGATGGA TCCAGACAAT GTTGACTTGT TGAATCGTAA GTTGGTTAAC
TTGCAAGAAC AGGCTCGTGT TGGTGCTATA AAAATTGCTG AACTCAAAAA GCAACAGAAG
GCACTGGGAG AATCTGAAGT TGGGTCAGCA CAGTGGAATA AGCTTCAACT TGAAATTGCT
AAGGTTGAAT CACAGATGAA GATTGTTGAT AAGGCAATGG AGTCAACAAA GAAACACATT
GAAGATGTAG GAGACCCAAA GTCTATTCTG AATCTTAACA AAGAACTTGA TAATGTTGCT
AAAGAGCTTG ATATTGTCAA TCAAAAGCTT GAGCTAGACC CTGACAATGT CGAACTAGCA
GAGCAAAAAA TGAAACTACT TGGCAAACAG TCGGAATTGG CTGGGGATAA AGTCCAAGAA
TTAAAGAAAA AACAAGCTGC CCTTGGCGAT GAGAAAATAG GTACAGAAGA ATGGCGTCAA
CTTCAAAATG AAATCGGTCA AGCTGAAGTT GAAGTTCTAA AGATTGACCG TGCAATGGAC
ATTCTTGGTG AGTCAAGCCG TTCTGCAACT GGAGACATCA AAGAGGCAAC CAGCTATTTA
AGAGCTGATG TCATGATGGA TGTTGCAGAT AAGGCTGGTC AGATTGGCCA GAAAATGGTT
GACGCTGGGA AAATGACAGT AGATGCTTGG TCTGAGATAG ATGAGGCTCT GGACACCGTC
ACAACCAAAA CTGGTCTGAC TGGTGATGCC TTAGCAGAGC TTCAGGAAAT TGCTAAAGAC
ATTGCTACTG GTATGCCTAC CAGCTTTCAG AATGCTGGTG ATGCCGTTGG GGAATTGAAT
ACTCAGTTCG GTTTGACTGG GGAAAAGCTG AAATCAGCAT CTGAATTACT TATCAAGTAT
GCTGAGATTA ACGAAACAGA CATTTCAAGC TCTGCCATTT CTGCAAAACA AGCTATTGAA
GCTTACGGTT TGACAGCTGA AGACTTGGGA ATGGTCTTAG ACAATGTGAC CAAAGCCGCT
CAAGATACAG GACAGTCAGT TGACACGATT GTTCAAAAAG CCATTGACGG TGCTCCTCAG
ATTAAAGGTT TGGGACTTTC TTTTGAAGAA GGTGCTGCAC TGATCGGTAA GTTTGAGAAA
AGCGGTGTGG ATTCATCTGC TGCTCTATCC TCTCTATCGA AAGCTGCTGT CATCTATGCT
AAAGACGGTA AGACTCTGAC AGATGGATTG AATGAGACTG TTAGTGCTAT TCAAAATTCT
ACTAGTGAGA CAGAGGCTTT AAGTATTGCC TCAGAAATCT TTGGTAGTAA GGCTGCTCCT
AGAATGGTCG ATGCTATTCA GCGTGGTGCT TTTAGCTTTG ATGACTTAGC TGAAGCAGCT
AAAAGTTCCT CTGGTACTGT CTCCACCACA TTTGATGAGA CGCTTGACCC AATAGATAAG
TTGACTCAGT ATTCTAACCA AGCAAAAGAG GGAATGGCAG AACTTGGCGG TAAATTGCTT
GAGACTGTCA TCCCAGCTTT AGAACCTTTG ATGGGTATGC TTGAATCTTC TGTCAATTGG
TTTACTAGCC TAAACGAAAC TGATCAACAG ACTATCGTGA TTCTTGGCCT AGTTACAACT
GCTGTGATGA TGTTGCTTGG TGCAATTGCA CCGCTGGTCA TCGCCATAGG GGCAATAGGT
GCGCCTGTCG GAATTGTAGT GGCGGCAATA GTAGGGGCTA TTGCCGTCAT AACACTTATC
ATCCAAGCAA TCATGAACTG GGGAGCCATA ACTGAATGGC TTCAGTCAAC GTGGGATTCT
TGTGCTGCCT GGCTTTCTGA ATTGTGGACT AACATAGTCA CGACTGCCAC CACAGCGTGG
TCAAATTTCA CTGCCTGGCT TTCTGGCCTT TGGTCTTCAG TAGTCTCAAC TGGACAGTCT
TTGTGGTCTA GCTTTACTAG TTCCTTGTCC AATATTTTCT CAAGTTTGAT TACAGGTGCT
CAGTCTCTGT GGTCAAGTTT CACTTCCACT CTTTCCAATT TGTGGTCTGG ACTGGTCTCA
ACTGGGTCAA ATTTGTTTAA TAATTTGAGT AGCACGATTT CAGGAATTTT TAATGGGATA
CTTTCAACAG CAAGCAATAT TTGGAATTCC ATAAAATCCA CTATTTCCAA TGCAATAGAT
GGGGCGAAAA ATGCAGTGTC CAACGGGGTC AATGCCATCA AGAATCTGTT TAACTTCCAG
ATTAAATGGC CTCATATTCC ACTACCTCAC TTCCGTGTGA GTGGTTCTGC TAACCCTCTG
GATTGGCTAA AAGGTGGCTT ACCAAGTATC GGCATTGACT GGTATGCCAA GGGCGGTATC
ATGACCAAAC CAACCCTATT TGGCATGAAT GGAAACCGTG CAATGGTTGG CGGTGAGGCT
GGCGCTGAAG CCATCTTGCC ATTGAATAAG TCAACCCTGG GGGCAATTGG TCAAAGTATT
GCTAACACGA TGAATACATC GAACAATATT AACGTTAACT TCTCTGGCGT CACTATCAGG
GAAGAAGCTG ACCTTAACAG ACTAGCCAAC GTGGTTGGAA ATCGTATTGC TGAAGAATTG
CAACGTAAAA CTAATTTGAG AGGAGGAATG GCATGA
 
Protein sequence
MTETFEGLYV KFGANTVEFD RSVKGINTAL SSLKKDFNNI NRQLKMDPDN VDLLNRKLVN 
LQEQARVGAI KIAELKKQQK ALGESEVGSA QWNKLQLEIA KVESQMKIVD KAMESTKKHI
EDVGDPKSIL NLNKELDNVA KELDIVNQKL ELDPDNVELA EQKMKLLGKQ SELAGDKVQE
LKKKQAALGD EKIGTEEWRQ LQNEIGQAEV EVLKIDRAMD ILGESSRSAT GDIKEATSYL
RADVMMDVAD KAGQIGQKMV DAGKMTVDAW SEIDEALDTV TTKTGLTGDA LAELQEIAKD
IATGMPTSFQ NAGDAVGELN TQFGLTGEKL KSASELLIKY AEINETDISS SAISAKQAIE
AYGLTAEDLG MVLDNVTKAA QDTGQSVDTI VQKAIDGAPQ IKGLGLSFEE GAALIGKFEK
SGVDSSAALS SLSKAAVIYA KDGKTLTDGL NETVSAIQNS TSETEALSIA SEIFGSKAAP
RMVDAIQRGA FSFDDLAEAA KSSSGTVSTT FDETLDPIDK LTQYSNQAKE GMAELGGKLL
ETVIPALEPL MGMLESSVNW FTSLNETDQQ TIVILGLVTT AVMMLLGAIA PLVIAIGAIG
APVGIVVAAI VGAIAVITLI IQAIMNWGAI TEWLQSTWDS CAAWLSELWT NIVTTATTAW
SNFTAWLSGL WSSVVSTGQS LWSSFTSSLS NIFSSLITGA QSLWSSFTST LSNLWSGLVS
TGSNLFNNLS STISGIFNGI LSTASNIWNS IKSTISNAID GAKNAVSNGV NAIKNLFNFQ
IKWPHIPLPH FRVSGSANPL DWLKGGLPSI GIDWYAKGGI MTKPTLFGMN GNRAMVGGEA
GAEAILPLNK STLGAIGQSI ANTMNTSNNI NVNFSGVTIR EEADLNRLAN VVGNRIAEEL
QRKTNLRGGM A