Gene Sare_1454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1454 
Symbol 
ID5704165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1679471 
End bp1681453 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content73% 
IMG OID641270963 
Product1-deoxy-D-xylulose-5-phosphate synthase 
Protein accessionYP_001536344 
Protein GI159037091 
COG category[H] Coenzyme transport and metabolism
[I] Lipid transport and metabolism 
COG ID[COG1154] Deoxyxylulose-5-phosphate synthase 
TIGRFAM ID[TIGR00204] 1-deoxy-D-xylulose-5-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0111876 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGTTG AACCGGACAC GGCAAACCAC CCGGGGCTGC TGGCCGCCGT ACGCGGCCCG 
CAGGACGTCA AGCGGATGTC CACCGAGCAG TTGGGCATCC TCGCCGCCGA GATCCGTGAC
TTCCTGGTCG CCAAGGTCTC CCGCACCGGC GGGCACATCG GCCCCAACCT GGGCGTGGTG
GAGCTGACCC TCGCGCTGCA CCGGGTCTTC GACTCCCCGC GGGACCGGCT CCTGTTCGAC
ACCGGCCACC AGGCCTACGT ACACAAGATC CTCACCGGCC GGCAGGACGG CTTCGACCGG
CTGCGCCAGC GTGACGGGCT CTCCGGCTAC CCGAGCCAGG CCGAGAGCGA GCATGACCTG
ATCGAGAACT CGCACGCCTC CACCGCCCTG TCCTACGCCG ACGGGCTGGC CAAGGCGTAC
GCGCTGCGGG GTGAGTCCCG GGCGGTGGCG GCCGTGGTCG GCGACGGCGC GCTGACCGGC
GGTATGTGCT GGGAGGCGTT GAACAACATT GCGACCGCCG GCAACCCACT GGTGATCGTG
GTCAACGACA ACGGCCGGTC CTACTCGCCG ACCATCGGTG GGCTCGCCGA CCACCTGTCG
ACGCTGCGCC TGAATCCCAG CTACGAGCGG GTGCTGGACA CGGTCCGCGA GGCGCTCGGC
TCGACCCCGC TGGTCGGCCG GCCGATGTAC GAGGTGCTGC ACGCGGTCAA GCGGGGCATC
AAGGACGCGG TCGCGCCGCA GGCGATGTTC GAGGACCTCG GCATCAAGTA CGTCGGCCCG
GTCGACGGGC ACGACATCGT GGCGGTCGAG GGGGCCCTGC GCGCGGCGAA GAACTTCGGC
GGCCCCGTGA TCGTGCACGC GGTCACCCGC AAGGGCTACG GGTACCGCCC GGCCGAGGAG
GACGAGGCCG ACTGCCTGCA CGGCCCGGGC GCGTTCAACG TCGAGACCGG CCAACTGGTC
GCCGCGCCGA CGGTGAAGTG GACCCACGTC TTCGCCGACG AGTTGGTGGC GATCGCCGAC
GAGCGACCGG ATGTGGTGGG GATCACCGCC GCGATGGCCG AGCCGACCGG CATCGCCAAG
CTCGCCCGCA AGTATCCGGA GCGCACCTAC GACGTGGGTA TCGCCGAGCA GCACGCCGCC
ACCTCGGCGG CCGGCTTGGC GCTGGGCGGT CTGCACCCGG TGGTCGCGGT CTACGCGACC
TTCCTGAACC GGGCGTTCGA CCAGGTCCTG CTGGACGTGG CGATGCACAA GCTGCCGGTG
ACCTTCGTGC TCGACCGGGC CGGGATCACC GGCCCGGACG GGCCCAGCCA CTACGGCATG
TGGGACATGT CCGTCTTCGG GGTGGTGCCG GGCCTGCGGA TCGCCGCGCC CCGCGACGCC
GCCACCCTCC GCGAGGAACT GCGCGAGGCA GTCGCCGTCA ACGACGGGCC GACCATCGTC
CGGTTCCCGA CCGGCGCCGT CGCCGCCGAC CTGCCGGCGC TGCGCCGGGT CGGGCCGGTC
GACGTGCTCG CCGAGTCGGC CCGCACCGAC GTGCTGCTGG TCGCGGTCGG CTCCTTCGCC
GGCCTGGGTG TGCAGGTCGC CGGCCGGGTC GCCGAGCAGG GCTACGGTGT CACCGTCGTG
GACCCGCGCT GGGTCCGGCC GGCCCCGGCC GAACTGGTGG AACTGGCCGC CGGGCACCGG
CTCGTGGTCA CCGTGGAGGA CGGCGTCCGG GTTGGTGGGG TCGGCGACGC GCTCGCCCAG
GCGATGCGGG ACGCCGACGT CGAGGTGCCG GTGAAGGACC TCGGAGTGCC GGCCGACTGG
CACCCGCACG GCACCCGGGC GCAGATCCTC GCCGACCTCG GTCTGACCGC CCAGGACGTG
GCCCGCGACG TCACCGGCTG GATCTCCCGC CTCGACGTCG ACGCCGCCGA CACCGAGGAC
GCGCTCGCGT CCGAGCCGGT GGGGTCGGTC GTCACCCCGC GGGAGGCTCC CGCTCCGAAG
TGA
 
Protein sequence
MSVEPDTANH PGLLAAVRGP QDVKRMSTEQ LGILAAEIRD FLVAKVSRTG GHIGPNLGVV 
ELTLALHRVF DSPRDRLLFD TGHQAYVHKI LTGRQDGFDR LRQRDGLSGY PSQAESEHDL
IENSHASTAL SYADGLAKAY ALRGESRAVA AVVGDGALTG GMCWEALNNI ATAGNPLVIV
VNDNGRSYSP TIGGLADHLS TLRLNPSYER VLDTVREALG STPLVGRPMY EVLHAVKRGI
KDAVAPQAMF EDLGIKYVGP VDGHDIVAVE GALRAAKNFG GPVIVHAVTR KGYGYRPAEE
DEADCLHGPG AFNVETGQLV AAPTVKWTHV FADELVAIAD ERPDVVGITA AMAEPTGIAK
LARKYPERTY DVGIAEQHAA TSAAGLALGG LHPVVAVYAT FLNRAFDQVL LDVAMHKLPV
TFVLDRAGIT GPDGPSHYGM WDMSVFGVVP GLRIAAPRDA ATLREELREA VAVNDGPTIV
RFPTGAVAAD LPALRRVGPV DVLAESARTD VLLVAVGSFA GLGVQVAGRV AEQGYGVTVV
DPRWVRPAPA ELVELAAGHR LVVTVEDGVR VGGVGDALAQ AMRDADVEVP VKDLGVPADW
HPHGTRAQIL ADLGLTAQDV ARDVTGWISR LDVDAADTED ALASEPVGSV VTPREAPAPK