Gene SAG1678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1678 
Symbol 
ID1014487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1673684 
End bp1675066 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content37% 
IMG OID637316847 
ProductHAD superfamily hydrolase 
Protein accessionNP_688669 
Protein GI22537818 
COG category[R] General function prediction only
[S] Function unknown 
COG ID[COG0561] Predicted hydrolases of the HAD superfamily
[COG4696] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR00099] Cof subfamily of IIB subfamily of haloacid dehalogenase superfamily
[TIGR01484] HAD-superfamily hydrolase, subfamily IIB 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000161748 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCAATTA AGGCAGTATT TTTTGATATT GATGGTACAC TTCTTAATGA TCGTAAGAAT 
GTACAAAAGT CAACAATTAA AGCAATTCGA AATTTGAAAG ACCAAGGAAT ACTAGTCGGT
TTAGCAACGG GGCGAGGTCC TAGTTTTGTA CAACCTTTTT TAGAAAACCT TGGTTTAGAT
TTTGCTGTAA CCTATAATGG TCAATATATC TATAGTAGAA GTGAAATTAT TTATACCAAT
CAATTATCTA AGACAACTGT CTATCGTCTG ATTCGTTATG CTGGAGCAAG AAGAAGAGAA
ATTTCATTAG GAACAGCCTC AGGATTACTC GGTTCAGGTA TTATTGGTCT AGGAACTAGC
CGTTTGGGGC AGATTGTATC TAGCCTTGTT CCGAGAAAAT GGGCAAAAGC GATTGAACGA
AGCTTTAAGC ATTTTATTCG TCGGATTAAA CCTCAAAATA TTGATAGCCT CATGGTTATC
TTACGAGAAC CTATTTATCA GGTCGTTTTA GTTGCAACAG AGGGCGAATC AGAGCGAATT
CAAAAACAAT TTCCTCGTGT TAAATTAACA AGAAGCAGTC CTTACTCAAT GGATGTCATT
TCTGAAGGGC AGTCAAAAGT TAAGGGAATT GAACGTGTTG GTCAACGCTA TGGTTTTGAT
CTATCCGAAG TGATAGCATT TGGAGATTCT GATAATGATA TTGAGATGTT ATCTCAAGTT
GGCATTGGTG TTGCCATGGG GAACGCTAGT CAGCAAGTGA GAGAAAATGC ACGTTATACA
ACTGCTGACA ATAATGATGA TGGTATCTCT AAGGCATTAG CCCATTATGG ACTTATCCAA
TTTGAGATTG AAAAAACATT CAGTAGTCGT GACGAGAATT TCAATAAAGT AAAATCCTTC
CATCTATTAA TGGATGGTGA AACTATTGAA ACGCCACGCT TATATGACAG TAAGGAAGCT
GGTTTCAGGT CAGACTTTAA AGTAGAAGAA ATCGTTGAGT TCTTGTATGC TGCTAGTCAA
GGTAACCAAA AAGTATTTGA CCAATCTATC CGTAATTTAC ACTTAGCTAT TGATAAAGCA
AGAGATAAGG TTATTTCTAA AGACCATCCA GAAACACCAT TAGTGGGAGA AGTGGATGCC
TTAACAGATT TACTTTATCT GACTTATGGC TCCTTTGTTC TTATGGGAGT CGACCCAAAA
CCTCTTTTTG ATACAGTACA TGAGGCCAAT ATGGGGAAAA TCTTTCCAGA TGGCAAAGCT
CATTTTGATC CTGTTACTCA TAAAATTTTA AAACCAGACG ACTGGGAAGA ACATTTCGCT
CCTGAGCCAT CAATTCGACG TGAATTAGAT AGCCAAATTC AGAAATCCTT AAATCGAAAA
TAA
 
Protein sequence
MAIKAVFFDI DGTLLNDRKN VQKSTIKAIR NLKDQGILVG LATGRGPSFV QPFLENLGLD 
FAVTYNGQYI YSRSEIIYTN QLSKTTVYRL IRYAGARRRE ISLGTASGLL GSGIIGLGTS
RLGQIVSSLV PRKWAKAIER SFKHFIRRIK PQNIDSLMVI LREPIYQVVL VATEGESERI
QKQFPRVKLT RSSPYSMDVI SEGQSKVKGI ERVGQRYGFD LSEVIAFGDS DNDIEMLSQV
GIGVAMGNAS QQVRENARYT TADNNDDGIS KALAHYGLIQ FEIEKTFSSR DENFNKVKSF
HLLMDGETIE TPRLYDSKEA GFRSDFKVEE IVEFLYAASQ GNQKVFDQSI RNLHLAIDKA
RDKVISKDHP ETPLVGEVDA LTDLLYLTYG SFVLMGVDPK PLFDTVHEAN MGKIFPDGKA
HFDPVTHKIL KPDDWEEHFA PEPSIRRELD SQIQKSLNRK