Gene Strop_3571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_3571 
Symbol 
ID5060046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp4091314 
End bp4092402 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content71% 
IMG OID640475826 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001160380 
Protein GI145596083 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.534041 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCACCT CCGAGACGAA CCGGATCAGC GACCAGCGGA TCGACCGTGT GGTGCCGCTG 
ACCACCCCGG CACTGCTACA CCACGAGCTG CCCCTGGACA GTTCGCTCAC CTCGGCCGTA
CTCACTGGCA GACGGGCCGT CGGCCGAGTC TTGGACCGCG CCGACGACCG CCTCCTGGTG
GTAGTCGGCC CCTGTTCGGT ACATGACCCG GTCGCCGCCC TCGCCTACGC TCACCGGCTC
CGCGAGCTCG CCGATCGGCT CGCCGACGAC CTACTCGTGG TGATGCGGGT CTACTTCGAG
AAGCCGCGCT CAACCGTCGG CTGGAAGGGG CTTATCAACG ACCCGGGACT GGACGGTTCC
GGTGATGTGA ACACGGGCCT GCGCCGGGCC CGTGCGCTCC TGATCGACGT GCTGCGCCTG
GGCCTCCCGG TCGGTTGCGA GTTCCTGGAC CCGATCACCC CGCAGTACAT CGCCGACACG
GTGGCCTGGG GCGCGATCGG CGCCCGGACC GTGGAGAGCC AGGTGCACCG CCAGCTCGCC
TCCGGCCTGT CGATGCCGAT CGGAATGAAG AACCGTCCCG ACGGCAGCAT CTCCACCGCG
ACGGACGCCA TCCGGGCGGC CGGCGTGCCG CACGTCTTCC CCGGCATCGA CGCCTCCGGC
ACCCCAGCGA TCATGCACAC CCGCGGTAAC ACCGACGGCC ACCTGGTGCT GCGCGGTGGC
GGCAACCAGC CGAACTACGA CGCCGAATCG GTGACGGACG CGCTGGCGCT GCTGCGCGAC
GCCGGGCTTC CCGAACGGCT GGTCATCGAC GCCAGCCACG CCAACAGCGG CAAGGACCAC
CGTAACCAGC CGCTCGTCGC CGCCGACGTG GCCGCCCAAC TCGCCGGAGG CCAGCACGGC
ATCGTCGGCG TCATGCTGGA GAGCTTCCTG CTCGCAGGTC GGCAGGACCT GGACCCGACC
CGCGAGCTGA CCTACGGGCA GTCGATCACC GACGCCTGTA TCGGCTGGGA CACCACCGAG
GAGGTCCTGG CCGACCTGGC CGCCGCCGTG CGCACCCGAC GGCGGGCTCC GGCCGTCACC
CCTGTCTGA
 
Protein sequence
MTTSETNRIS DQRIDRVVPL TTPALLHHEL PLDSSLTSAV LTGRRAVGRV LDRADDRLLV 
VVGPCSVHDP VAALAYAHRL RELADRLADD LLVVMRVYFE KPRSTVGWKG LINDPGLDGS
GDVNTGLRRA RALLIDVLRL GLPVGCEFLD PITPQYIADT VAWGAIGART VESQVHRQLA
SGLSMPIGMK NRPDGSISTA TDAIRAAGVP HVFPGIDASG TPAIMHTRGN TDGHLVLRGG
GNQPNYDAES VTDALALLRD AGLPERLVID ASHANSGKDH RNQPLVAADV AAQLAGGQHG
IVGVMLESFL LAGRQDLDPT RELTYGQSIT DACIGWDTTE EVLADLAAAV RTRRRAPAVT
PV