Gene Sare_1302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1302 
Symbol 
ID5703682 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1510386 
End bp1511597 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content72% 
IMG OID641270813 
Product1-deoxy-D-xylulose 5-phosphate reductoisomerase 
Protein accessionYP_001536194 
Protein GI159036941 
COG category[I] Lipid transport and metabolism 
COG ID[COG0743] 1-deoxy-D-xylulose 5-phosphate reductoisomerase 
TIGRFAM ID[TIGR00243] 1-deoxy-D-xylulose 5-phosphate reductoisomerase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.130609 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTTCTC CCCGCGACCT TGTGTTGCTC GGCTCCACCG GCTCGATCGG TACCCAGGCC 
ATCGATGTCG CGCGCCGTAA CCCGGACCGG TTCCGGGTGG TGGCGCTCGG CGCTGGCGGC
GGCAACGTCG GGTTGCTCGC CGCCCAGGCC CTGGACCTCG GCGTGGAGGC GGTCGGCGTG
GCGCGGGCGT CCGTCGTGCA GGACCTCCAA CTCGCCTTCT ACGCCGAGGC GAGCCGCCGA
GGCTGGGCCC GTGGCGACGT CAAGCTGCCG AAGATCATCG CCGGCCCGGA CGCGATGACC
GAGCTGGCCC AGTGGCCCGG CGACGTGGTG CTCAACGGGG TGGTCGGCTC ACTCGGGCTC
GCGCCGACCC TGGCCGCGCT GCGCGCCGGC CGCACCCTCG CCCTGGCCAA CAAGGAGTCG
CTGGTCGCCG GCGGTCCCCT GGTGCGGGCC GCGATGACCC GGCCGGAGCA GATCGTTCCG
GTCGACTCCG AGCACACGGC GCTCGCCCAG TGCCTGCGTT CCGGGTGCCG CCCCGAGGTG
CGTCGGCTGA TCGTGACCGC CAGTGGTGGC CCGTTCCGGG GGTGGCGGCG CGACGAGTTG
ACCCACGTCA CGCCGGAGCA GGCGCTCGCG CACCCGACCT GGGACATGGG GCCGGTCATC
ACGATCAACT CGGCGACGAT GGTCAACAAG GCGCTGGAGG TGATCGAGGC GCACGAGCTG
TTCGGCGTGC CGTACGCCGA CATCGCCGTG ACCGTGCACC CCCAGTCAGT GATCCACTCG
ATGGTCGAGT TCGTCGACGG CTCCACCATC GCCCAGGCCA GCCCGCCGGA CATGCGGCTG
CCGATCGCGG TGGCGCTGGG CTGGCCCGAC CGCGTGCCGG ACGCCGCTCC CGCCGTCGAC
TGGACCACCA GCCACACCTG GCAGTTCGCG CCACTGGACG ATGCGGCGTT CCCGGCGGTC
GCCCTCGCCA AGGCGGCCGG CGAGGCCGGG CGCAGCCGGC CGGCGATCTA CAACGCGGCC
AACGAGGAGT GCGTCGCGGC GTTCGTCGCC GGCCGGCTGC CGTTCCTCGG CATCGTCGAC
ACCCTGGAGC AGGTGCTGGC GGACGCTCCT GACTTCGGCG AACCAGGTAC CGTCGAGGAC
GTGCTCGCGG CCGAGTCGTG GGCGCGCGGG CACGCGCAGA CGATCATCGA GAAATCAGTG
GAAGGAGCTT GA
 
Protein sequence
MTSPRDLVLL GSTGSIGTQA IDVARRNPDR FRVVALGAGG GNVGLLAAQA LDLGVEAVGV 
ARASVVQDLQ LAFYAEASRR GWARGDVKLP KIIAGPDAMT ELAQWPGDVV LNGVVGSLGL
APTLAALRAG RTLALANKES LVAGGPLVRA AMTRPEQIVP VDSEHTALAQ CLRSGCRPEV
RRLIVTASGG PFRGWRRDEL THVTPEQALA HPTWDMGPVI TINSATMVNK ALEVIEAHEL
FGVPYADIAV TVHPQSVIHS MVEFVDGSTI AQASPPDMRL PIAVALGWPD RVPDAAPAVD
WTTSHTWQFA PLDDAAFPAV ALAKAAGEAG RSRPAIYNAA NEECVAAFVA GRLPFLGIVD
TLEQVLADAP DFGEPGTVED VLAAESWARG HAQTIIEKSV EGA