Gene SAG1233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1233 
Symbol 
ID1014040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1241015 
End bp1243483 
Gene Length2469 bp 
Protein Length822 aa 
Translation table11 
GC content40% 
IMG OID637316414 
Productstreptococcal histidine triad family protein 
Protein accessionNP_688238 
Protein GI22537387 
COG category 
COG ID 
TIGRFAM ID[TIGR01363] streptococcal histidine triad protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00703595 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGAAAA CATATGGTTA TATCGGCTCA GTTGCTGCTA TTTTACTAGC TACTCATATT 
GGAAGTTACC AGCTTGGTAA GCATCATATG GGTCTAGCAA CAAAGGACAA TCAGATTGCC
TATATTGATG ATAGCAAAGG TAAGGTAAAA GCCCCTAAAA CAAACAAAAC GATGGATCAA
ATCAGTGCTG AAGAAGGCAT CTCTGCTGAA CAGATCGTAG TCAAAATTAC TGACCAAGGT
TATGTTACCT CACACGGTGA CCATTATCAT TTTTACAATG GGAAAGTTCC TTATGATGCG
ATTATTAGTG AAGAGTTGTT GATGACGGAT CCTAATTACC ATTTTAAACA ATCAGACGTT
ATCAATGAAA TCTTAGACGG TTACGTTATT AAAGTCAATG GCAACTATTA TGTTTACCTC
AAGCCAGGTA GTAAGCGCAA AAACATTCGA ACCAAACAAC AAATTGCTGA GCAAGTAGCC
AAAGGAACTA AAGAAGCTAA AGAAAAAGGT TTAGCTCAAG TGGCCCATCT CAGTAAAGAA
GAAGTTGCGG CAGTCAATGA AGCAAAAAGA CAAGGACGCT ATACTACAGA CGATGGCTAT
ATTTTTAGTC CGACAGATAT CATTGATGAT TTAGGAGATG CTTATTTAGT ACCTCATGGT
AATCACTATC ATTATATTCC TAAAAAAGAT TTGTCTCCAA GTGAGCTAGC TGCTGCACAA
GCCTACTGGA GTCAAAAACA AGGTCGAGGT GCTAGACCGT CTGATTACCG CCCGACACCA
GCCCCAGGTC GTAGGAAAGC CCCAATTCCT GATGTGACGC CTAACCCTGG ACAAGGTCAT
CAGCCAGATA ACGGTGGTTA TCATCCAGCG CCTCCTAGGC CAAATGATGC GTCACAAAAC
AAACACCAAA GAGATGAGTT TAAAGGAAAA ACCTTTAAGG AACTTTTAGA TCAACTACAC
CGTCTTGATT TGAAATACCG TCATGTGGAA GAAGATGGGT TGATTTTTGA ACCGACTCAA
GTGATCAAAT CAAACGCTTT TGGGTATGTG GTGCCTCATG GAGATCATTA TCATATTATC
CCAAGAAGTC AGTTATCACC TCTTGAAATG GAATTAGCAG ATCGATACTT AGCCGGCCAA
ACTGATGACA ACGACTCAGG TTCAGATCAC TCAAAACCAT CAGATAAAGA AGTGACACAT
ACCTTTCTTG GTCATCGCAT CAAAGCTTAC GGAAAAGGCT TAGATGGTAA ACCATATGAT
ACGAGTGATG CTTATGTTTT TAGTAAAGAA TCCATTCATT CAGTGGATAA ATCAGGAGTT
ACAGCTAAAC ACGGAGATCA TTTCCATTAT ATAGGATTTG GAGAACTTGA ACAATATGAG
TTGGATGAGG TCGCTAACTG GGTGAAAGCA AAAGGTCAAG CTGATGAGCT TGTTGCTGCT
TTGGATCAGG AACAAGGCAA AGAAAAACCA CTCTTTGACA CTAAAAAAGT GAGTCGCAAA
GTAACAAAAG ATGGTAAAGT GGGCTATATT ATGCCAAAAG ATGGCAAGGA CTATTTCTAT
GCTCGTTATC AACTTGATTT GACTCAGATT GCCTTTGCCG AACAAGAACT AATGCTTAAA
GATAAGAAGC ATTACCGTTA TGACATTGTT GATACAGGCA TTGAGCCACG ACTTGCTGTA
GATTTGTCAA GTCTGCCGAT GCATGCTGGT AATGCTACTT ACGATACTGG AAGTTCGTTT
GTTATCCCAC ATATTGATCA TATCCATGTC GTTCCGTATT CATGGTTGAC GCGCAATCAG
ATTGCAACAA TCAAGTATGT GATGCAACAC CCCGAAGTTC GTCCGGATGT ATGGTCTAAG
CCAGGGCATG AAGAGTCAGG TTCGGTCATT CCAAATGTTA CGCCTCTTGA TAAACGTGCT
GGTATGCCAA ACTGGCAAAT TATCCATTCT GCTGAAGAAG TTCAAAAAGC CCTAGCAGAA
GGTCGTTTTG CAGCACCAGA CGGCTATATT TTCGATCCAC GAGATGTTTT GGCAAAAGAA
ACTTTTGTAT GGAAAGATGG CTCCTTTAGC ATCCCAAGAG CAGATGGCAG TTCATTGAGA
ACCATTAATA AATCTGATCT ATCCCAAGCT GAGTGGCAAC AAGCTCAAGA GTTATTGGCA
AAGAAAAATG CTGGTGATGC TACTGATACG GATAAACCTG AAGAAAAGCA ACAGGCAGAT
AAGAGCAATG AAAACCAACA GCCAAGTGAA GCCAGTAAAG AAGAAAAAGA ATCAGATGAC
TTTATAGACA GTTTACCAGA CTATGGTCTA GATAGAGCAA CCCTAGAAGA TCATATCAAT
CAATTAGCAC AAAAAGCTAA TATCGATCCT AAGTATCTCA TTTTCCAACC AGAAGGTGTC
CAATTTTATA ATAAAAATGG TGAATTGGTA ACTTATGATA TCAAGACACT TCAACAAATA
AACCCTTAA
 
Protein sequence
MKKTYGYIGS VAAILLATHI GSYQLGKHHM GLATKDNQIA YIDDSKGKVK APKTNKTMDQ 
ISAEEGISAE QIVVKITDQG YVTSHGDHYH FYNGKVPYDA IISEELLMTD PNYHFKQSDV
INEILDGYVI KVNGNYYVYL KPGSKRKNIR TKQQIAEQVA KGTKEAKEKG LAQVAHLSKE
EVAAVNEAKR QGRYTTDDGY IFSPTDIIDD LGDAYLVPHG NHYHYIPKKD LSPSELAAAQ
AYWSQKQGRG ARPSDYRPTP APGRRKAPIP DVTPNPGQGH QPDNGGYHPA PPRPNDASQN
KHQRDEFKGK TFKELLDQLH RLDLKYRHVE EDGLIFEPTQ VIKSNAFGYV VPHGDHYHII
PRSQLSPLEM ELADRYLAGQ TDDNDSGSDH SKPSDKEVTH TFLGHRIKAY GKGLDGKPYD
TSDAYVFSKE SIHSVDKSGV TAKHGDHFHY IGFGELEQYE LDEVANWVKA KGQADELVAA
LDQEQGKEKP LFDTKKVSRK VTKDGKVGYI MPKDGKDYFY ARYQLDLTQI AFAEQELMLK
DKKHYRYDIV DTGIEPRLAV DLSSLPMHAG NATYDTGSSF VIPHIDHIHV VPYSWLTRNQ
IATIKYVMQH PEVRPDVWSK PGHEESGSVI PNVTPLDKRA GMPNWQIIHS AEEVQKALAE
GRFAAPDGYI FDPRDVLAKE TFVWKDGSFS IPRADGSSLR TINKSDLSQA EWQQAQELLA
KKNAGDATDT DKPEEKQQAD KSNENQQPSE ASKEEKESDD FIDSLPDYGL DRATLEDHIN
QLAQKANIDP KYLIFQPEGV QFYNKNGELV TYDIKTLQQI NP