Gene Sare_1253 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1253 
Symbol 
ID5703481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1449486 
End bp1450547 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content71% 
IMG OID641270768 
Product3-dehydroquinate synthase 
Protein accessionYP_001536149 
Protein GI159036896 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000152397 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAGAAGAG TCGTTCCGGT GCGTCTCGGG GAACGTTCCT ACCAGGTCCT GATCGGCCCG 
GGCGTCCGCA CATCGCTGGC CGAGGTGATT CGCCGACTGG GCGCCGAACG AGCCGCCGTC
GTGTCGGCCC GCCCGCGGGA ATGGGTGCCC GACACGGGCG TGGAGACCCT GCTGCTGCCC
GCCCGCGACG GCGAGCAGAG CAAGACGCTC GCCACGGTGG AGGCGTTGTG CCACGCGTTC
GTGCGGTTCG GGCTCACCCG GTCCGACGTT GTCGTCTCGT GCGGTGGCGG GACGACAACC
GACGTCGTCG GACTGGCTGC CGCGCTCTAC CACCGGGGGG TGGACGTGAT CCATCTGCCC
ACGTCGCTGC TGGCCCAGGT GGACGCCAGC GTCGGCGGTA AAACGGCGGT GAACCTGCCC
GACGGCAAGA ACCTGGTCGG CGCGTACTGG CAGCCCCGCG CGGTGCTGTG CGACACGGAC
TACCTGTCGA CCCTGCCCCC GCGGGAGTTG CTCAACGGCC TGGGTGAGAT CGCCCGCTGT
CACTTTATCG GTGCGGGTGA CCTGCGCGGG CTACCGCTCG CGGAGCAGAT CGCCGCCAGT
GTGACCCGCA AGGCGGGCAT CGTCGAGGTC GACGAGCGGG ACGCCGGTAG GCGGCATCTG
CTCAACTACG GCCATACGCT GGGCCACGCG CTCGAGCTGG CCACCGGATT CGCGCTGCGG
CACGGCGAGG CGGTCGCAGT CGGCACCGTC TTCGCCGGCC GGCTGGCGGG CGCGCTGGGC
CGCATCAACC AGTCCAGAGT GGATGAACAT CTGGCGGTCG TGCGCCACTA CAACCTGCCC
GCCGCCCTGC CCGCCGAGGT CGATCCCAGG GCCCTGGTCC GCCAGATGCG CCGGGACAAG
AAGGCGATCA GTGGTCTCGG TTTCGTCCTG GACGGGCCCG AGGGCGCGGA GCTGGTGAGT
GACGTGCCGG AGAACGTGGT GCTCGCTGTC CTCGACGCGA TGCCGCGAGC GCCCATGGAC
GCGCTCGTCG GCGCCCTCAC GACCGGTGCG GTGCGGACAT GA
 
Protein sequence
MRRVVPVRLG ERSYQVLIGP GVRTSLAEVI RRLGAERAAV VSARPREWVP DTGVETLLLP 
ARDGEQSKTL ATVEALCHAF VRFGLTRSDV VVSCGGGTTT DVVGLAAALY HRGVDVIHLP
TSLLAQVDAS VGGKTAVNLP DGKNLVGAYW QPRAVLCDTD YLSTLPPREL LNGLGEIARC
HFIGAGDLRG LPLAEQIAAS VTRKAGIVEV DERDAGRRHL LNYGHTLGHA LELATGFALR
HGEAVAVGTV FAGRLAGALG RINQSRVDEH LAVVRHYNLP AALPAEVDPR ALVRQMRRDK
KAISGLGFVL DGPEGAELVS DVPENVVLAV LDAMPRAPMD ALVGALTTGA VRT