Gene Sare_3950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3950 
Symbol 
ID5708221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4493817 
End bp4494905 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content71% 
IMG OID641273375 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001538731 
Protein GI159039478 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.178604 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.35096 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCACCT CGGAGATGGA CCGGATCAGC GATCAGCGGA TCGATCGTGT GGTGCCGCTG 
ACCACCCCGG CCCTGTTACA CCACGAGCTG CCCCTGAACG ATCGGCTCAC CTCGGCCGTA
CTCACTGGCA GACGGGCTGT CGGCCGGGTG CTGGACCGCG CCGACGACCG CCTCCTGGTG
GTGGTCGGCC CGTGTTCGGT ACACGATCCG GCCGCCGCCC TCGACTACGC CCACCGGCTC
CGCGAGGTCG CCGGTCGGCT CGCCGACGAC CTGCTTGTGG TGATGCGGGT CTACTTCGAG
AAGCCGCGCT CGACCGTGGG CTGGAAGGGG CTCATCAACG ATCCCGGGCT GGACGGTTCC
GGTGATGTGA ACACCGGCCT GCGTCGGGCC CGCGCGCTGC TGATCGACGT GCTGCGCCTG
GGTCTCCCGG TCGGATGCGA GTTCCTGGAC CCGATCACCC CGCAGTACAT CGCCGACACG
GTGGCCTGGG GTGCGATCGG CGCCCGGACC GTGGAGAGCC AGGTGCACCG CCAGCTCGCC
TCCGGCTTGT CGATGCCGAT CGGGATGAAG AACCGCCCCG ACGGCAGCAT CTCCACCGCG
GTGGACGCGA TCCGGGCGGC CGGCGTGCCA CACGTGTTCC CCGGCATCGA CATCTCCGGC
ACCCCAGCGA TCATGCACAC CCGAGGCAAC GCGGACGGTC ACCTGGTGCT GCGCGGTGGT
GGCAACCGCC CGAACTACGA CGCGAAGTCG GTGGCGGACG CGCTCGCGCT GCTGCGCGCC
GACGGGCTGC CCGAGCGGCT GGTGATCGAC GCCAGCCATG CCAACAGCGG CAAGGACCAC
CGGAACCAGC CGCTCGTCGC CGCCGACGTG GCCGCCCAAC TCGCCGGGGG CCAGCACGGC
ATCGTCGGCA TCATGCTGGA GAGCTTCCTG CTGTCGGGTC GGCAGGGCCT GGACCCGACC
CGCGAGCTGA CGTACGGGCA GTCGATCACC GATGCCTGCA TCGGCTGGGA CACCACGGAA
GAGGTGCTGG CCGACCTGGC AGCCGCCGTG CGCACCCGCC GTCGGGCTCC GGCCGTCACC
CCCGCCTGA
 
Protein sequence
MTTSEMDRIS DQRIDRVVPL TTPALLHHEL PLNDRLTSAV LTGRRAVGRV LDRADDRLLV 
VVGPCSVHDP AAALDYAHRL REVAGRLADD LLVVMRVYFE KPRSTVGWKG LINDPGLDGS
GDVNTGLRRA RALLIDVLRL GLPVGCEFLD PITPQYIADT VAWGAIGART VESQVHRQLA
SGLSMPIGMK NRPDGSISTA VDAIRAAGVP HVFPGIDISG TPAIMHTRGN ADGHLVLRGG
GNRPNYDAKS VADALALLRA DGLPERLVID ASHANSGKDH RNQPLVAADV AAQLAGGQHG
IVGIMLESFL LSGRQGLDPT RELTYGQSIT DACIGWDTTE EVLADLAAAV RTRRRAPAVT
PA