Gene SAG0751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG0751 
Symbol 
ID1013555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp744770 
End bp745672 
Gene Length903 bp 
Protein Length300 aa 
Translation table11 
GC content34% 
IMG OID637315939 
ProductHAD superfamily hydrolase 
Protein accessionNP_687766 
Protein GI22536915 
COG category[R] General function prediction only 
COG ID[COG1011] Predicted hydrolase (HAD superfamily) 
TIGRFAM ID[TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED
[TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTACTT CAATTGTTTT TGATGTTGAT GATACGATTT ACGATCAACA AGCACCTTAC 
CGTATTGCTG TGGAGAAATG CTTTCCTGAT TTTGATATGA GCGCTATTAA CCAAGCTTAT
ATTCGCTTTC GTCATTATTC AGATATTGGC TTTCCACGAG TAATGGCTGG TGAATGGACA
ACAGAATATT TTCGTTTCTG GCGATGCAAA GAAACGCTTT TAGAGTTTGG TTATCGTGAA
ATTGACGAAG CTACTGGTAT TTATTTTCAA GAAATATACG AACATGAACT TGAAAATATT
ACAATGCTTG ATGAGATGCG TATGACACTT GACTTTTTGA AATCAAAAAA TGTACCAATG
GGAATTATTA CAAATGGTCC TACGGAACAT CAATTGAAAA AAGTTAAAAA ATTGGGACTT
TATGACTATG TTGATCCAAA ACGTGTTATA GTTAGTCAAG CAACTGGTTT TCAAAAGCCT
GAAAAGGAGA TTTTTAATTT AGCAGCAGAG CAATTTGATA TGAACCCTTC AACTACACTT
TATGTGGGTG ATTCATATGA TAATGATATT ATGGGTGCAT TTAATGGTGG TTGGCATTCT
ATGTGGTTCA ACCATAGAGG ACGTTCTTTA AAACCGGGAA TTAAACCAGT TTATGATGTT
GCTATTGATA ACTTTGAGCA ATTATTCGGT GCTGTTAAAG TGTTGTTTGA CTTACCTGAT
AATAAATTTA TTTTTGATAT CAACGATAAA AGTAATCCAG TCCTTGAAAT GGGACTTAAT
AATGGTTTAA TGATGGCAGC AGAGCGTCTG CTTGAGAGTA ATATGAGCGT TGACAAAGTT
GTTATTTTAC TGCGCCTAAC TGCAAAACAA GAAAAAGTAT TACGCATGAA GTATGCTAGA
TAA
 
Protein sequence
MITSIVFDVD DTIYDQQAPY RIAVEKCFPD FDMSAINQAY IRFRHYSDIG FPRVMAGEWT 
TEYFRFWRCK ETLLEFGYRE IDEATGIYFQ EIYEHELENI TMLDEMRMTL DFLKSKNVPM
GIITNGPTEH QLKKVKKLGL YDYVDPKRVI VSQATGFQKP EKEIFNLAAE QFDMNPSTTL
YVGDSYDNDI MGAFNGGWHS MWFNHRGRSL KPGIKPVYDV AIDNFEQLFG AVKVLFDLPD
NKFIFDINDK SNPVLEMGLN NGLMMAAERL LESNMSVDKV VILLRLTAKQ EKVLRMKYAR