Gene Sare_1121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1121 
Symbol 
ID5706064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1267153 
End bp1269000 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content70% 
IMG OID641270636 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001536020 
Protein GI159036767 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.836437 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000632283 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCTGAGC TGCGGTCGAG GACCTCCACC CACGGTCGGA CGATGGCGGG CGCCCGAGCC 
CTGTGGCGGG CCACCGGGAT GACCGACGAC GACTTCGGCA AGCCGATCGT TGCCATCGCG
AACAGTTTCA CCCAGTTCGT TCCGGGACAC GTCCACCTCA AGGACCTCGG TGGCCTGGTC
GCCGACGCGG TAGCCGAGGC GGGCGGGGTG GGCCGGGAGT TCAACACCAT CGCCGTGGAC
GACGGCATCG CGATGGGTCA CGGCGGAATG CTCTATTCGC TGCCCAGCCG GGAACTGATC
GCCGACGCCG TGGAATACAT GGTCAACGCG CACTGCGCCG ACGCCCTGGT CTGCATCTCC
AACTGCGACA AGATCACTCC TGGGATGCTG CTGGCCGCAC TGCGGCTGAA CATCCCGACC
GTCTTCGTCT CCGGCGGCCC GATGGAGGCC GGCAAGACGG TCGCGATCGA GGGAATCGTG
CATTCCAAGA TCGACCTGAT CGACGCGATG ATCGCCGCGT CCAACGAGGC TGTCACCGAT
GACCAGCTCG ACCAGATCGA ACGCTCGGCC TGCCCCACCT GCGGCTCCTG CTCCGGCATG
TTCACCGCCA ACTCGATGAA CTGCCTCACC GAGGCGATCG GCCTGGCCCT GCCCGGCAAC
GGGTCGACGC TGGCGACCCA CGCCGCCCGC CGGTCGCTCT TCGTCGACGC CGGCCGCACC
GTCGTGGAGA TCGCCAAGCG CTGGTACGAC GGTGACGACG CCACGGTGCT GCCCCGCGCG
GTCGCCAACC GCGCCGCCTT CGAGAACGCG GTCGCCCTCG ACGTCGCGAT GGGCGGCTCG
ACGAACACCA TCCTGCACTT GCTCGCCGCC GCTCGAGAGG CCGAGCTGGA CTTCGGGGTG
GCGGACATCG ACACCATCTC CCGGCGGGTG CCCTGCCTGG CCAAGGTCGC ACCGAACTCT
CCCCTCTACC ACATGGAGGA CGTCCATCGA GCCGGCGGCA TCCCGGCCAT CCTCGGTGAG
CTGGACCGGG CCGGGCTACT CAACCGGGAG GTGCACGCGG TGCACTCCCC CTCGCTGGCA
ACCTGGCTCG CTGACTGGGA CGTTCGCGGC GACGCGGCGA CACCGGAGGC AGTCGACCTG
TTCCACGCCG CACCAGGTGG GGTACGCACC GTCGAGCCGT TCTCCACCAC CAACCGCTGG
TCCACGCTGG ACACCGACGC GGCCGGCGGC TGCGTACGGG ACCGGGCGCA CGCGTACACC
GCTGACGGCG GCTTGGCCAT CCTGCACGGC AACCTCGCAC CGGACGGCTG TGTGGTGAAG
ACCGCCGGTG TGCCCGAGGA GTGCCTCACC TTCCGCGGCC CGGCCAGGGT CTACGAATCC
CAGGACGATG CCGTCGCCGC CATCCTGGCC AAGGAGGTGA CCGCCGGGGA CGTCGTGGTG
ATCCGCTACG AGGGCCCCCG GGGCGGCCCC GGGATGCAGG AGATGCTCTA CCCCACCTCG
TTCCTCAAGG GCCGGGGGCT GGGGCGGGCC TGCGCGCTGC TCACCGACGG ACGGTTCTCC
GGCGGTACCT CCGGACTGTC CATCGGGCAC GTCTCCCCCG AGGCTGCCGC TGGCGGGCTG
ATCGCCCTGG TCGAACCGGG CGACGAGATC GTCATCGACA TTCCGAACCG CACCATCGAG
CTCGCTGTTC CGGCCGACGT GTTGGACGCC CGTCGGGTGG CGCAGGAGAA GCGGGACCGC
CCGTACACGC CCGCGGATCG TCAGCGCCCC GTGTCCGCGG CGCTACGCGC GTACGCCTCG
ATGGCGACCT CCGCCAGCGA CGGTGCCTAC CGCCGCGTCC CCGAGTAG
 
Protein sequence
MPELRSRTST HGRTMAGARA LWRATGMTDD DFGKPIVAIA NSFTQFVPGH VHLKDLGGLV 
ADAVAEAGGV GREFNTIAVD DGIAMGHGGM LYSLPSRELI ADAVEYMVNA HCADALVCIS
NCDKITPGML LAALRLNIPT VFVSGGPMEA GKTVAIEGIV HSKIDLIDAM IAASNEAVTD
DQLDQIERSA CPTCGSCSGM FTANSMNCLT EAIGLALPGN GSTLATHAAR RSLFVDAGRT
VVEIAKRWYD GDDATVLPRA VANRAAFENA VALDVAMGGS TNTILHLLAA AREAELDFGV
ADIDTISRRV PCLAKVAPNS PLYHMEDVHR AGGIPAILGE LDRAGLLNRE VHAVHSPSLA
TWLADWDVRG DAATPEAVDL FHAAPGGVRT VEPFSTTNRW STLDTDAAGG CVRDRAHAYT
ADGGLAILHG NLAPDGCVVK TAGVPEECLT FRGPARVYES QDDAVAAILA KEVTAGDVVV
IRYEGPRGGP GMQEMLYPTS FLKGRGLGRA CALLTDGRFS GGTSGLSIGH VSPEAAAGGL
IALVEPGDEI VIDIPNRTIE LAVPADVLDA RRVAQEKRDR PYTPADRQRP VSAALRAYAS
MATSASDGAY RRVPE