Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1121 |
Symbol | |
ID | 5706064 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 1267153 |
End bp | 1269000 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641270636 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_001536020 |
Protein GI | 159036767 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.836437 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000632283 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCTGAGC TGCGGTCGAG GACCTCCACC CACGGTCGGA CGATGGCGGG CGCCCGAGCC CTGTGGCGGG CCACCGGGAT GACCGACGAC GACTTCGGCA AGCCGATCGT TGCCATCGCG AACAGTTTCA CCCAGTTCGT TCCGGGACAC GTCCACCTCA AGGACCTCGG TGGCCTGGTC GCCGACGCGG TAGCCGAGGC GGGCGGGGTG GGCCGGGAGT TCAACACCAT CGCCGTGGAC GACGGCATCG CGATGGGTCA CGGCGGAATG CTCTATTCGC TGCCCAGCCG GGAACTGATC GCCGACGCCG TGGAATACAT GGTCAACGCG CACTGCGCCG ACGCCCTGGT CTGCATCTCC AACTGCGACA AGATCACTCC TGGGATGCTG CTGGCCGCAC TGCGGCTGAA CATCCCGACC GTCTTCGTCT CCGGCGGCCC GATGGAGGCC GGCAAGACGG TCGCGATCGA GGGAATCGTG CATTCCAAGA TCGACCTGAT CGACGCGATG ATCGCCGCGT CCAACGAGGC TGTCACCGAT GACCAGCTCG ACCAGATCGA ACGCTCGGCC TGCCCCACCT GCGGCTCCTG CTCCGGCATG TTCACCGCCA ACTCGATGAA CTGCCTCACC GAGGCGATCG GCCTGGCCCT GCCCGGCAAC GGGTCGACGC TGGCGACCCA CGCCGCCCGC CGGTCGCTCT TCGTCGACGC CGGCCGCACC GTCGTGGAGA TCGCCAAGCG CTGGTACGAC GGTGACGACG CCACGGTGCT GCCCCGCGCG GTCGCCAACC GCGCCGCCTT CGAGAACGCG GTCGCCCTCG ACGTCGCGAT GGGCGGCTCG ACGAACACCA TCCTGCACTT GCTCGCCGCC GCTCGAGAGG CCGAGCTGGA CTTCGGGGTG GCGGACATCG ACACCATCTC CCGGCGGGTG CCCTGCCTGG CCAAGGTCGC ACCGAACTCT CCCCTCTACC ACATGGAGGA CGTCCATCGA GCCGGCGGCA TCCCGGCCAT CCTCGGTGAG CTGGACCGGG CCGGGCTACT CAACCGGGAG GTGCACGCGG TGCACTCCCC CTCGCTGGCA ACCTGGCTCG CTGACTGGGA CGTTCGCGGC GACGCGGCGA CACCGGAGGC AGTCGACCTG TTCCACGCCG CACCAGGTGG GGTACGCACC GTCGAGCCGT TCTCCACCAC CAACCGCTGG TCCACGCTGG ACACCGACGC GGCCGGCGGC TGCGTACGGG ACCGGGCGCA CGCGTACACC GCTGACGGCG GCTTGGCCAT CCTGCACGGC AACCTCGCAC CGGACGGCTG TGTGGTGAAG ACCGCCGGTG TGCCCGAGGA GTGCCTCACC TTCCGCGGCC CGGCCAGGGT CTACGAATCC CAGGACGATG CCGTCGCCGC CATCCTGGCC AAGGAGGTGA CCGCCGGGGA CGTCGTGGTG ATCCGCTACG AGGGCCCCCG GGGCGGCCCC GGGATGCAGG AGATGCTCTA CCCCACCTCG TTCCTCAAGG GCCGGGGGCT GGGGCGGGCC TGCGCGCTGC TCACCGACGG ACGGTTCTCC GGCGGTACCT CCGGACTGTC CATCGGGCAC GTCTCCCCCG AGGCTGCCGC TGGCGGGCTG ATCGCCCTGG TCGAACCGGG CGACGAGATC GTCATCGACA TTCCGAACCG CACCATCGAG CTCGCTGTTC CGGCCGACGT GTTGGACGCC CGTCGGGTGG CGCAGGAGAA GCGGGACCGC CCGTACACGC CCGCGGATCG TCAGCGCCCC GTGTCCGCGG CGCTACGCGC GTACGCCTCG ATGGCGACCT CCGCCAGCGA CGGTGCCTAC CGCCGCGTCC CCGAGTAG
|
Protein sequence | MPELRSRTST HGRTMAGARA LWRATGMTDD DFGKPIVAIA NSFTQFVPGH VHLKDLGGLV ADAVAEAGGV GREFNTIAVD DGIAMGHGGM LYSLPSRELI ADAVEYMVNA HCADALVCIS NCDKITPGML LAALRLNIPT VFVSGGPMEA GKTVAIEGIV HSKIDLIDAM IAASNEAVTD DQLDQIERSA CPTCGSCSGM FTANSMNCLT EAIGLALPGN GSTLATHAAR RSLFVDAGRT VVEIAKRWYD GDDATVLPRA VANRAAFENA VALDVAMGGS TNTILHLLAA AREAELDFGV ADIDTISRRV PCLAKVAPNS PLYHMEDVHR AGGIPAILGE LDRAGLLNRE VHAVHSPSLA TWLADWDVRG DAATPEAVDL FHAAPGGVRT VEPFSTTNRW STLDTDAAGG CVRDRAHAYT ADGGLAILHG NLAPDGCVVK TAGVPEECLT FRGPARVYES QDDAVAAILA KEVTAGDVVV IRYEGPRGGP GMQEMLYPTS FLKGRGLGRA CALLTDGRFS GGTSGLSIGH VSPEAAAGGL IALVEPGDEI VIDIPNRTIE LAVPADVLDA RRVAQEKRDR PYTPADRQRP VSAALRAYAS MATSASDGAY RRVPE
|
| |