Gene SAG1552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1552 
Symbol 
ID1014361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1559887 
End bp1562046 
Gene Length2160 bp 
Protein Length719 aa 
Translation table11 
GC content35% 
IMG OID637316723 
Producthypothetical protein 
Protein accessionNP_688545 
Protein GI22537694 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.253159 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAT ATCTAAGTAT TGTAGCCCTT TTAACTATTT TGTTTAGCTC TATTTATTAT 
TTGTATTACT TTGATGGTAG TTTGTATTTA CCAAAGGGCT TATTAAAAGA AAATACAAGA
ACTAACTTTG TTGTTAAAGG TGATACTGTA CTTCACAAGC CCACCAATAA ACCTTTTGTT
GTTAAAGGAG TAGACGTTGA GTCTTCCTTA GCGGGTTATC ATCACAACGA TTTTCCTATT
ACTCAAAAAA CGTATCGTGA ATGGTTCCAT TTAATTTCCA ACATGGGGGC AAATACTGTA
AGAGTCAAGG TACCGATGAA TGTTGCATTT TACGATGCCT TATATCACCA CAACAAAGCA
TCAAAGAGGC CACTGTATTT GTTGCAAGGA ATACGTATAG ATTCTTATCG CAATAATGCT
TCTATAACAG CTTTTAATGA TAATTATAGG GGGTATTTAA AACGAGAAGC AAAAGGCGTT
GTGGATATTC TCCATGGGCG TAAGCAAGTA TGGAATACTG ATTTGGGTAG CCGTCATTAT
CATTATGATC TTAGTCCTTG GGTACTTGGT TATGTCGTAG GGGATGATTG GAATAGTGGT
ACTGTCGCTT ATACTAATCA TCAAGAGAAA AAAACGCAAT ATAAAGGACG TTATTTTAAA
ACTTCTGTGG CAGCTAATCC ATTTGAGGTC ATGCTAGCTC AAGTAATGGA TGAATTGACA
CATTATGAGA CAGCTAAATA TGGTTGGCAA CATTTGATTA GTTTTTCAAA CTCACCAACA
ACAGACCCTT TTCATTATCG AAAACCATTT GAGGCACAGG CTCCTAAATA CGTACAACTA
AATGTAGAAA ATATTCAAGC TAATTCAAAT GTTAAAGCAG GTATGTTTGC AGCATATAAA
GCTATTGATT TCCATCCTCG ATACAAGGAT TATCTATTAT TTGATAAAGA GAATATCAGT
AAAGAAGATA GACAAAAGAT TAAAGAACTT TCTTTGTCAC AGGGATACGT TAAACTGCTA
AATGCTTATC ACAAAATCCC TGTTCTAGTC ACGGGTTATG GCTATTCGAC AGCGAGAGGT
ATTGCCCAAA AAGAAATTGA TAAACGTCCT CTGCCGATTA ATGAAAAAGA ACAAGGTCAG
CGTTTACTAG AAGATTATGA ATCTTTTATA TCATCCGGTA GTTTTGGAGC GACTATCAAT
GCATGGCAAG ACGATTGGAA TGCAAGGGCG TGGAATACAT CTTTCGCCAC AAATAAACAT
AGTCAATTCC TATGGGGGGA TGCACAAGTA TTTAATCAAG GTTATGGTTT ATTAGGCTTT
AAAAACGCAA AACATCATTA TCAAGTTGAT GGTAAAAGAG GCAAAGGAGA GTGGAAACAT
CCTCTGATGA CTAGTGCAAC AGGAGATGAC TTATATGCTA GCAGTGATGA AAGCTATCTC
TACCTTGCGA TTAAAACAAA ACCTGAAAAA CTAAAAGAAA AACGATTATT ACCAATAGAT
ATTACACCAA AATCTGGTAG TAGAAAAATG AATGGTAGTA AGGTCACATT TTCTAAATCT
AGTGACTTTG TATTGTCTAT TGATCCAAAT GGCAAGTCTG AATTATTTGT CCAAGAGCGC
TATAATGCCT TAAAAGCGAA CTATCTTCGA CAGCTTAACG GTAAAGATTT TTATGCTTTC
CCACCAAAGA AGAACAGTAG TAATTTTGAG CAGATAAATA TGGTATTGAG AAATACAAAG
ATTGTTGAAG ACATGGAAAA AGTAAAAGCA ACAGAGAGGT TCTTACCAAC TCATCCTACT
GGTCTTCTCA AAACAGGAAC AACTGATAGG CACCAAAAAA CATTTGATTC ACAAACAGAT
ATTTCGTTTG GAAAGGACTT TATAGAGGTC AGAATTCCGT GGCAGTTGTT GAATTTTTCT
GATCCATCAT CTCAAAAAAT TCACGATGAT TACTTTAAAC ATTATGGTGT GAAGGAGTTA
GAAATTGAGA GCATTGCTTT AGGATTAGGT GCTAATAGCA AAGAAAACAC ACTGATAAAG
ATGGCAGATT ATCGTTTGAA AAATTGGGAG AGACCCGATA CCAAAACCTT TTTAAAAGAC
TCCTATTATA GTATTAAGAA AGAATGGTCT AAAGAAAGAG AGAGAACATA TGGTACATAA
 
Protein sequence
MKKYLSIVAL LTILFSSIYY LYYFDGSLYL PKGLLKENTR TNFVVKGDTV LHKPTNKPFV 
VKGVDVESSL AGYHHNDFPI TQKTYREWFH LISNMGANTV RVKVPMNVAF YDALYHHNKA
SKRPLYLLQG IRIDSYRNNA SITAFNDNYR GYLKREAKGV VDILHGRKQV WNTDLGSRHY
HYDLSPWVLG YVVGDDWNSG TVAYTNHQEK KTQYKGRYFK TSVAANPFEV MLAQVMDELT
HYETAKYGWQ HLISFSNSPT TDPFHYRKPF EAQAPKYVQL NVENIQANSN VKAGMFAAYK
AIDFHPRYKD YLLFDKENIS KEDRQKIKEL SLSQGYVKLL NAYHKIPVLV TGYGYSTARG
IAQKEIDKRP LPINEKEQGQ RLLEDYESFI SSGSFGATIN AWQDDWNARA WNTSFATNKH
SQFLWGDAQV FNQGYGLLGF KNAKHHYQVD GKRGKGEWKH PLMTSATGDD LYASSDESYL
YLAIKTKPEK LKEKRLLPID ITPKSGSRKM NGSKVTFSKS SDFVLSIDPN GKSELFVQER
YNALKANYLR QLNGKDFYAF PPKKNSSNFE QINMVLRNTK IVEDMEKVKA TERFLPTHPT
GLLKTGTTDR HQKTFDSQTD ISFGKDFIEV RIPWQLLNFS DPSSQKIHDD YFKHYGVKEL
EIESIALGLG ANSKENTLIK MADYRLKNWE RPDTKTFLKD SYYSIKKEWS KERERTYGT