Gene Strop_1017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_1017 
Symbol 
ID5057463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp1145798 
End bp1147576 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content70% 
IMG OID640473286 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001157869 
Protein GI145593572 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.145908 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGACCC GGCCTAACCC ACTCTTCGTC GGCATCGAGC GCGCCCCAGC GCGCGCCTCC 
ATGCGCGCCA CAGGTCTGTC GACCGAGGAC CTGAAGAAGC CGATGATCGG GGTAGCGCAC
AGTTGGATCG GCACGATGCC GTGCAACCTC AACCACCGGC GGCTCGCCCA GGAGGTGATG
GCCGGCGTGC GCGCCGCAGG TGGCACCCCG ATCGAGATCA ACACCATCGC GATCTCGGAC
GTGATCACCA TGGGCACGGA GGGAATGCGG ACCTCACTGG TGAGCCGCGA GGTGATCGCC
GACTCCATCG AGTTGGTATG CCGCGGCCAC GGTCTCGACG GCCTGGTGAC CCTGGCCGGC
TGTGACAAGA CCATACCCGG TGCGGCGCTG GCCCACGTGC GACTCGACAT CCCCGGAGCC
GTCATCTATT CCGGCACGAT GATGCCGGGC GAGCACCTCG GGCGCGACAT CACCCTGCAG
GATGTGTTTG AGGCCGTCGG CAGCGCCACC GCCACCGGCT GCACCGACGA GCTCGACAAG
CTCGAGCGCG CGGCCTGTCC CGGTATCGGG GCCTGCGCGG GCCACTACAC GGCCAATACG
ATGGCCGTGG TGCTGGAGTT CCTCGGGCTT TCCCCGTTCG GCTCGATGGA TCCGCCGGCA
GTCGACGCCC GCAAGGACAC GGTCTGCCGC CAGGCCGGCG AGCTGGTCAT GCGGGCGGTG
GCAGAAGGGC TGCGGCCCAG CCGGTTCCTG ACGCCCTCCT CGTTGCGCAA CGCCATCGCC
GCCGGGGTCG CCACCGGCGG CTCGACGAAC ATGGTGCTCC ACCTGTTGGC GATCGCTCGG
GAGGCGGGCA TCCCGCTGGA CATTGACGAG TTCGACCGGA TCAGCTCGGT GACCCCGATC
ATCGCTGACC TGCGTCCAAA CGGGACGTAC ACCGCGGTGG ACCTCGACCG GGCGGGCGGC
ACCCGGGTGA TCGCCCGCCA CATGGTCGAC GCGGGCCTGA TTGCTGGCGA CGAAAGCACC
GTCACCGGCC GCACCGTCGC ACACGAGGCC GCGGACGCGG CCGAGACACC CGGCCAACGG
GTCGTCACCA CGGTGGAGGC ACCCCTATCG CCTTCCGGTG CTCTCCTGAT CTTGCGGGGT
AACCTAGCGC CCGATGGCAG TGTGGTGAAG GCACCCGGCG CGGTGACCCT GCGGATGACC
GGCACCGCAT TGGTGTTCAA CTGCGAGGAG GAAGCGATGG CCGCGGTCCA GACTGGCCGC
GTCCGGCCAG GCCACGTCGT CGTCATCCGC TACGAGGGTC CGCGCGGGGG TCCAGGGATG
AGGGAAATGC TCGGAGTGAC CTCGGCGCTC ATCGGCCGCG GCCTGGGCAC GTCGGTCGGT
CTGGTGACCG ACGGCCGTTT CTCGGGCGCG ACCAGGGGAC TGATGGTGGG GCACGTCGCT
CCGGAGGCGG CGGAGGGCGG ACCGATCGCG GCGGTGTGCG ACGGTGACCG GATCACCATC
GACCTGCAGC GGCGCGAATG CTCTGTCGAC CTGGATCCGG GCGAACTGGC CGCGCGGATG
CGAGACTGGT CGGCTCCGCC ACCGCGCTAC ACGATCGGCG TCATGGCCAA ATACTGGTCG
ACGGTCTCGT CGGCGGCCGT GGGCGCCGTG ACGACCCCGC ACCCCACCCA GGGCCCGGCG
ACAGCGTCAG GTAAGGCCGA GGAGTGCCAG CAGGCGAGTG CGGTCGAGGG CGTGATGGCG
GTTGGCGGCG GCGATGTCGG TGCTGCCGGC GGTTCGTAG
 
Protein sequence
MTTRPNPLFV GIERAPARAS MRATGLSTED LKKPMIGVAH SWIGTMPCNL NHRRLAQEVM 
AGVRAAGGTP IEINTIAISD VITMGTEGMR TSLVSREVIA DSIELVCRGH GLDGLVTLAG
CDKTIPGAAL AHVRLDIPGA VIYSGTMMPG EHLGRDITLQ DVFEAVGSAT ATGCTDELDK
LERAACPGIG ACAGHYTANT MAVVLEFLGL SPFGSMDPPA VDARKDTVCR QAGELVMRAV
AEGLRPSRFL TPSSLRNAIA AGVATGGSTN MVLHLLAIAR EAGIPLDIDE FDRISSVTPI
IADLRPNGTY TAVDLDRAGG TRVIARHMVD AGLIAGDEST VTGRTVAHEA ADAAETPGQR
VVTTVEAPLS PSGALLILRG NLAPDGSVVK APGAVTLRMT GTALVFNCEE EAMAAVQTGR
VRPGHVVVIR YEGPRGGPGM REMLGVTSAL IGRGLGTSVG LVTDGRFSGA TRGLMVGHVA
PEAAEGGPIA AVCDGDRITI DLQRRECSVD LDPGELAARM RDWSAPPPRY TIGVMAKYWS
TVSSAAVGAV TTPHPTQGPA TASGKAEECQ QASAVEGVMA VGGGDVGAAG GS