Gene SAG0428 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG0428 
Symbol 
ID1013230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp442266 
End bp443303 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content40% 
IMG OID637315633 
Productalcohol dehydrogenase, zinc-containing 
Protein accessionNP_687462 
Protein GI22536611 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTAG CTACATTTAT TGAACCTGGC AAAATGGTTA TAACTGATAC ACCAAAACCA 
GTCATTGAAC AAGAGACAGA TGCTGTTATC AAAATTGTTA GAGCCTGTGT TTGTGGTTCA
GACTTGTGGT GGTATCGTGG TATTTCAAAA CGTGAAAGTG GTTCTTTTGC AGGTCATGAG
GCTATTGGTA TCGTTGAGGA AGTTGGTACT AAAGTAACTG ACGTGTCAAA AGGTGATTTT
GTTATTGTTC CCTTTACACA TGGCTGTGGC CAATGTCCGT CTTGTAAGGC TGGATTTGAT
GGAAATTGTA CAAATCATCA AGCTGCAAAA AATGTAGGAT ACCAAGGTCA ATATCTACGT
TATACTAATG CAAACTGGGC ATTGGTTAAG ATCCCAGGAC AACCTTCTGA TTATGATAAT
GAGACTCTTA ACTCTCTGCT CACCTTATCA GATGTAATGG CAACTGGTTA TCATGCTGCA
GCAACTGCAG AAGTTAAAGA AGGTGACACG GTAGTTGTCA TGGGAGATGG TGCTGTTGGA
CTCTGTGGTG TTATCGCTGC TAAAATGTTA GGTGCTAACC GTATCATTGC AATGAGTCGT
CACAAAGATA GGCAGGAACT AGCATTGACT TTTGGTGCAA CGGATATTGT CGAAGAACGG
GGTGACGAAG CCGTTAAACG TGTTTTAGAT TTAACCAATC AAGCAGGTGC CGATGCTGTT
TTAGAATGTG TTGGTACAGA GCAGTCTGTT GATACAGCTA CCCAAATTGC TAGGCCTGGT
GCTGTCATAG GACGTGTTGG TATCCCACAA AATCCAGACA TGAATACCAA TAATTTATTC
TGGAAAAATA TCGGCCTTAG AGGTGGCATT GCTTCTGTAA CAACATTTGA TAAGTCTGTC
CTTTTAGATG CTGTTCTAAC TCATAAAATT AACCCAGGCT TAGTTTTTAC AAAATCCTTT
GTATTAGATG ATATTCAAAA GGCTTATGAA GCAATGGATA AACGTGACGC TATCAAATCC
TTAGTGATTG TTGACTGA
 
Protein sequence
MKVATFIEPG KMVITDTPKP VIEQETDAVI KIVRACVCGS DLWWYRGISK RESGSFAGHE 
AIGIVEEVGT KVTDVSKGDF VIVPFTHGCG QCPSCKAGFD GNCTNHQAAK NVGYQGQYLR
YTNANWALVK IPGQPSDYDN ETLNSLLTLS DVMATGYHAA ATAEVKEGDT VVVMGDGAVG
LCGVIAAKML GANRIIAMSR HKDRQELALT FGATDIVEER GDEAVKRVLD LTNQAGADAV
LECVGTEQSV DTATQIARPG AVIGRVGIPQ NPDMNTNNLF WKNIGLRGGI ASVTTFDKSV
LLDAVLTHKI NPGLVFTKSF VLDDIQKAYE AMDKRDAIKS LVIVD