Gene SAG0512 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG0512 
Symbol 
ID1013315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp524382 
End bp525698 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content35% 
IMG OID637315714 
ProductHD domain-containing protein 
Protein accessionNP_687542 
Protein GI22536691 
COG category[R] General function prediction only 
COG ID[COG1078] HD superfamily phosphohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGAAA AAGTATTTCG TGACCCCGTT CACACTTATA TCCATGTAAA TAATCAAGTC 
ATATATGACT TGATTAACAC TAAAGAATTT CAACGTTTAC GACGCATTAA ACAAACCTCA
ACGACTTCTT TTACCTTCCA TGGTGCAGAG CATAGTCGTT TTTCACATTG TCTCGGAGTT
TATGAACTGG CTCGTAAAGT TACAGAAATC TTTGATGAAC ATTATTCTGA TCTTTGGAAT
AAAAATGAGT CCCTGCTAAC GATGGCTGCC GCCCTACTAC ATGATATCGG ACATGGCGCA
TATTCTCATA CATTTGAGCG TCTCTTCAAT ACTGATCATG AAGCTTACAC TCAGGAAATC
ATCACCAATC CTACCACAGA GATTAATGCT ATTTTACGAA AGGTGGCTCC TGATTTTCCT
GATAAAGTAG CTAGTGTTAT TAATCATAGC TACCCCAATA AACAAGTTGT TCAATTAATT
TCTAGTCAAA TTGATTGTGA TCGAATGGAC TATCTTCTCA GAGATTCTTA CTACACTGCT
GCTAGCTATG GACAATTTGA TTTAACACGA ATTTTACGCG TTATTAGACC AACCGATTCT
GGAATTGCTT TTGCTCGTAA CGGGATGCAC GCTGTGGAAG ATTATATTGT TAGTCGCTTC
CAAATGTATA TGCAAGTTTA TTTCCATCCA GCCAGTCGTG CTATGGAATT ACTCTTACAA
AATTTACTAA AACGTGCTCG CTTCTTATTT GATACCCATC GCGATTTTTT TGAACAAACT
TCTCCAAATC TTATTCCTTT CTTCACGGAT CAATATGATT TACAAGATTA TTTAGCCCTA
GATGATGGGG TAATGAATAC TTACTTCCAA TCTTGGATGC AAGCTGATGA TAACATTTTG
GCAGATTTAG CTAATCGTTT TATCAATCGA AAAGTCTTTA AATCAATTAC CTTTGAAGAG
TCTGATAAAG AAAATCTAGT TAAAATGAAA GAACTAGTCT CACAGGTTGG TTTTGATCCC
GATTATTACA CTGGTGTCCA TGCTAATTTT GACTTACCTT ACGATGTTTA TCGGCCTGAA
CATTCAAATC CACGAACAGA AATTCAAATT ATCCAAAAGA ATGGACAACT TGCTGAATTA
TCAAGCTTAT CACCGATTGT AAAAGCATTA ACTGGATCTA ATTATGGTGA TCAACGTTTT
TATTTTCCAA AAGAAATGTT GACATTAGAT AGCCTATTTT CAAGTACAAA AGAAGAATTT
CAATCTTATA TCACAAATGA GCATTTGACA CTCACGAAAG ATAATAGCCA TTCATGA
 
Protein sequence
MNEKVFRDPV HTYIHVNNQV IYDLINTKEF QRLRRIKQTS TTSFTFHGAE HSRFSHCLGV 
YELARKVTEI FDEHYSDLWN KNESLLTMAA ALLHDIGHGA YSHTFERLFN TDHEAYTQEI
ITNPTTEINA ILRKVAPDFP DKVASVINHS YPNKQVVQLI SSQIDCDRMD YLLRDSYYTA
ASYGQFDLTR ILRVIRPTDS GIAFARNGMH AVEDYIVSRF QMYMQVYFHP ASRAMELLLQ
NLLKRARFLF DTHRDFFEQT SPNLIPFFTD QYDLQDYLAL DDGVMNTYFQ SWMQADDNIL
ADLANRFINR KVFKSITFEE SDKENLVKMK ELVSQVGFDP DYYTGVHANF DLPYDVYRPE
HSNPRTEIQI IQKNGQLAEL SSLSPIVKAL TGSNYGDQRF YFPKEMLTLD SLFSSTKEEF
QSYITNEHLT LTKDNSHS