Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_1017 |
Symbol | |
ID | 5057463 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | - |
Start bp | 1145798 |
End bp | 1147576 |
Gene Length | 1779 bp |
Protein Length | 592 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640473286 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_001157869 |
Protein GI | 145593572 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.145908 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGACCC GGCCTAACCC ACTCTTCGTC GGCATCGAGC GCGCCCCAGC GCGCGCCTCC ATGCGCGCCA CAGGTCTGTC GACCGAGGAC CTGAAGAAGC CGATGATCGG GGTAGCGCAC AGTTGGATCG GCACGATGCC GTGCAACCTC AACCACCGGC GGCTCGCCCA GGAGGTGATG GCCGGCGTGC GCGCCGCAGG TGGCACCCCG ATCGAGATCA ACACCATCGC GATCTCGGAC GTGATCACCA TGGGCACGGA GGGAATGCGG ACCTCACTGG TGAGCCGCGA GGTGATCGCC GACTCCATCG AGTTGGTATG CCGCGGCCAC GGTCTCGACG GCCTGGTGAC CCTGGCCGGC TGTGACAAGA CCATACCCGG TGCGGCGCTG GCCCACGTGC GACTCGACAT CCCCGGAGCC GTCATCTATT CCGGCACGAT GATGCCGGGC GAGCACCTCG GGCGCGACAT CACCCTGCAG GATGTGTTTG AGGCCGTCGG CAGCGCCACC GCCACCGGCT GCACCGACGA GCTCGACAAG CTCGAGCGCG CGGCCTGTCC CGGTATCGGG GCCTGCGCGG GCCACTACAC GGCCAATACG ATGGCCGTGG TGCTGGAGTT CCTCGGGCTT TCCCCGTTCG GCTCGATGGA TCCGCCGGCA GTCGACGCCC GCAAGGACAC GGTCTGCCGC CAGGCCGGCG AGCTGGTCAT GCGGGCGGTG GCAGAAGGGC TGCGGCCCAG CCGGTTCCTG ACGCCCTCCT CGTTGCGCAA CGCCATCGCC GCCGGGGTCG CCACCGGCGG CTCGACGAAC ATGGTGCTCC ACCTGTTGGC GATCGCTCGG GAGGCGGGCA TCCCGCTGGA CATTGACGAG TTCGACCGGA TCAGCTCGGT GACCCCGATC ATCGCTGACC TGCGTCCAAA CGGGACGTAC ACCGCGGTGG ACCTCGACCG GGCGGGCGGC ACCCGGGTGA TCGCCCGCCA CATGGTCGAC GCGGGCCTGA TTGCTGGCGA CGAAAGCACC GTCACCGGCC GCACCGTCGC ACACGAGGCC GCGGACGCGG CCGAGACACC CGGCCAACGG GTCGTCACCA CGGTGGAGGC ACCCCTATCG CCTTCCGGTG CTCTCCTGAT CTTGCGGGGT AACCTAGCGC CCGATGGCAG TGTGGTGAAG GCACCCGGCG CGGTGACCCT GCGGATGACC GGCACCGCAT TGGTGTTCAA CTGCGAGGAG GAAGCGATGG CCGCGGTCCA GACTGGCCGC GTCCGGCCAG GCCACGTCGT CGTCATCCGC TACGAGGGTC CGCGCGGGGG TCCAGGGATG AGGGAAATGC TCGGAGTGAC CTCGGCGCTC ATCGGCCGCG GCCTGGGCAC GTCGGTCGGT CTGGTGACCG ACGGCCGTTT CTCGGGCGCG ACCAGGGGAC TGATGGTGGG GCACGTCGCT CCGGAGGCGG CGGAGGGCGG ACCGATCGCG GCGGTGTGCG ACGGTGACCG GATCACCATC GACCTGCAGC GGCGCGAATG CTCTGTCGAC CTGGATCCGG GCGAACTGGC CGCGCGGATG CGAGACTGGT CGGCTCCGCC ACCGCGCTAC ACGATCGGCG TCATGGCCAA ATACTGGTCG ACGGTCTCGT CGGCGGCCGT GGGCGCCGTG ACGACCCCGC ACCCCACCCA GGGCCCGGCG ACAGCGTCAG GTAAGGCCGA GGAGTGCCAG CAGGCGAGTG CGGTCGAGGG CGTGATGGCG GTTGGCGGCG GCGATGTCGG TGCTGCCGGC GGTTCGTAG
|
Protein sequence | MTTRPNPLFV GIERAPARAS MRATGLSTED LKKPMIGVAH SWIGTMPCNL NHRRLAQEVM AGVRAAGGTP IEINTIAISD VITMGTEGMR TSLVSREVIA DSIELVCRGH GLDGLVTLAG CDKTIPGAAL AHVRLDIPGA VIYSGTMMPG EHLGRDITLQ DVFEAVGSAT ATGCTDELDK LERAACPGIG ACAGHYTANT MAVVLEFLGL SPFGSMDPPA VDARKDTVCR QAGELVMRAV AEGLRPSRFL TPSSLRNAIA AGVATGGSTN MVLHLLAIAR EAGIPLDIDE FDRISSVTPI IADLRPNGTY TAVDLDRAGG TRVIARHMVD AGLIAGDEST VTGRTVAHEA ADAAETPGQR VVTTVEAPLS PSGALLILRG NLAPDGSVVK APGAVTLRMT GTALVFNCEE EAMAAVQTGR VRPGHVVVIR YEGPRGGPGM REMLGVTSAL IGRGLGTSVG LVTDGRFSGA TRGLMVGHVA PEAAEGGPIA AVCDGDRITI DLQRRECSVD LDPGELAARM RDWSAPPPRY TIGVMAKYWS TVSSAAVGAV TTPHPTQGPA TASGKAEECQ QASAVEGVMA VGGGDVGAAG GS
|
| |