Gene SAG1686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1686 
Symbol 
ID1014495 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1683520 
End bp1684527 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content35% 
IMG OID637316855 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionNP_688677 
Protein GI22537826 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.643032 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGATTTC ATCAGTTAAG TGATAAAATT AATATTGAAA TATTGAAGCA GAAAACTAGC 
TTAGATCTAG AAGTTAGCCA AAAAAAATTA GCCAAAGAAG AAGAGTTAAA AAATATCATA
AAAGGTGAGG ATCAGCGGTT TCTAGTAATA GTAGGTCCTT GTTCTGCGGA TAATCCAAAG
GCGGTGCTAA CTTATGCAAA GCGTTTAGCT AAATTGGAAG CTGCCTTTAA AGATAAAATG
TTTTTGGTTA TGAGAGTTTA TACCGCTAAG CCAAGAACGA ATGGTGATGG TTATAAAGGC
TTAGTTCACC ATTCTGACAA ACTAGGAGTT TTTTTTCAAG CACGTAAGAT GCACTATGAC
ATCATTCGAG AGACTGGACT CCTGACAGCT GATGAATTAC TTTATCCAGA GATGTTATCA
GTGATGGATG ATTTAGTTTC TTACTATGCT ATTGGTGCGC GTTCAGTGGA AGATCAGGGA
CACCGTTTTA TATCTTCAGG TATAGATGCA CCAGTAGGTA TGAAAAATCC AACGTCTGGT
AATTTAAGAG TCATGTTTAA TGCGGTCTAT GCAGCACAAA ATCAACAGGA ATTATTCTAT
CAAAATAAGC AAGTTAGAAC AGATGGTAAT TTGCTGTCAC ATGTTATTTT GAGAGGATAT
CATAATGCCG ACTATCGAAG TATCCCTAAT TATCATTATG AAAACCTTTT AGAAACTATT
ACTCATTATG AAGAGACTGA TTTGCAAAAT CCCTTTATTG TTGTTGATAC AAATCATGAC
AATTCAGGTA AGCAATTTTT AGAACAAATT CGCATTGTGA AATCAGTACT TGCAGACCGA
CAGTGGCATA CTAAAATTAG AAACTATGTT CGTGGTTTTT TGATAGAATC TTATTTAGAA
GATGGAAGAC AAGATAAACC AGATGTTTTT GGTAAGTCTA TCACAGATCC ATGTCTAGGT
TGGGATAAAA CAGAAATGCT AATTAGGGAT ATTTACAATA GTTTGTAA
 
Protein sequence
MGFHQLSDKI NIEILKQKTS LDLEVSQKKL AKEEELKNII KGEDQRFLVI VGPCSADNPK 
AVLTYAKRLA KLEAAFKDKM FLVMRVYTAK PRTNGDGYKG LVHHSDKLGV FFQARKMHYD
IIRETGLLTA DELLYPEMLS VMDDLVSYYA IGARSVEDQG HRFISSGIDA PVGMKNPTSG
NLRVMFNAVY AAQNQQELFY QNKQVRTDGN LLSHVILRGY HNADYRSIPN YHYENLLETI
THYEETDLQN PFIVVDTNHD NSGKQFLEQI RIVKSVLADR QWHTKIRNYV RGFLIESYLE
DGRQDKPDVF GKSITDPCLG WDKTEMLIRD IYNSL