Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_7305 |
Symbol | |
ID | 5675606 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 8927975 |
End bp | 8929828 |
Gene Length | 1854 bp |
Protein Length | 617 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641246142 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_001511530 |
Protein GI | 158319022 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.150096 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGCTT TGCGTTCCCG CACCACCACC CACGGTCGAA ACATGGCCGG TGCCCGCGCC CTGTGGCGGG CGACCGGCAT GACCGACGAC GACTTCGGCA AGCCGATCGT CGCGATCGCG AACAGCTTCA CGCAGTTCGT TCCCGGCCAT GTCCACCTGC GGGACCTCGG GAAGATCGTC GCGGACGCGG TGGCCGGGTC GGGCGGGGTG GCCAAGGAGT TCAACACGAT CGCCGTCGAC GACGGCATCG CGATGGGCCA CGGCGGCATG CTGTACTCCC TGCCATCCCG GGAGATCATC GCGGACAGCG TCGAGTACAT GGTCAACGCC CACTGCGCCG ATGCGCTGGT CTGCATCTCG AACTGTGACA AGATCACCCC GGGGATGCTG ATCGCCGCGC TGCGGCTCAA CATCCCGACC GTCTTCGTCT CCGGCGGCGC GATGGAATCC GGTCACGCCG TCGTCACCGG CGGGATCGTG CGGTCGCGGC TCGACCTGAT CGACGCGATG ACCGCGGCCG TCAACCCCGA CGTCAGCGAC GCCGACCTGG ACACCATCGA GCGGTCCGCC TGCCCCACCT GCGGCTCCTG CTCCGGCATG TTCACGGCCA ACTCGATGAA CTGCCTGACC GAGGCGCTCG GCCTGGCGCT GCCGGGCAAC GGCTCGACGC TGGCCACCGC GGCGGCCCGG CGCGGCCTGT TCGTCGAGGC CGGTCGCCTG GTCGTCGACC TGGCCCGCCG GTACTACGAG AAGGACGACG AGGCCGTCCT ACCCCGCTCG ATCGCGAGCG CGGCGGCGTT CCGCAACGCG TTCGCGGTGG ACGTCGCCAT GGGCGGTTCG ACGAACACCG TCCTGCACCT GCTCGCCGCC GCCGTCGAGG CCGGGGTCGA GGTGACCCTC GACGACATCG ACCAGGTCTC CCGCTCGGTG GCCTGCCTGT GCAAGGTGGC GCCCAGCTCG ACCGACTACT ACATGGAAGA CGTCCACCGG GCCGGCGGGA TCCCGGCGAT CCTCGGTGAG CTCGACCGCG GCGGTCTGGT GGACCCGAAC GTGCACAGCG TGCACGCGGC GAGCCTGCGC GAGTTCCTCG ACCGCTGGGA CGTGCGCGGG GCGGACCCGT CCCCGGACGC GATCGAGCTG TTCCACGCCG CGCCCGGCGG CGTGCGCACC GTCGAACCGT TCGGCTCGAC GAACCGCTGG GACACCCTCG ACACCGACGC GAAGAACGGC TGCATCCGCT CGGTCGAGCA CGCCTACTCG GCCGACGGCG GCCTGGCCGT GCTGCGCGGC AACCTGGCCC CCGACGGCGC CGTGGTGAAG ACGGCCGGCG TCGACGAGAG CCAGTGGACG TTCCGCGGGC CCGCGCTGGT CGTCGAGAGC CAGGAGGCCG CGGTCGACGC GATCCTGAAC AAGGTCGTCA AGGCGGGCGA CGTGATCATC GTCCGGTATG AGGGCCCCCG CGGTGGGCCC GGCATGCAGG AGATGCTCTA CCCGACGGCG TTCCTCAAGG GCCGCGGCCT CGGGCCGAAG TGCGCGCTGA TCACCGATGG CCGCTTCTCC GGTGGCAGCT CGGGCCTGTC GATCGGCCAC GTCTCCCCGG AGGCGGCGCA CGGTGGCCCG ATCGCGCTCG TCCGGGACGG TGATCTCGTC GAGATCGACA TCCCGCGGCG GCGGATCGAC CTGCTGGTGC CGGACGCCGA GCTCGCCGCG CGGCGGGCCG AGATCGAGGC GAACGGCGGC TACCACCCGG CGAACAGGGA GCGTGTCGTG TCGGCCGCGC TGCGCGCCTA CGCGGCCATG GCGACGTCCG CCTCGACCGG TGCCGCCCGT GACGTCCGGC TCATCACGGG ATGA
|
Protein sequence | MPALRSRTTT HGRNMAGARA LWRATGMTDD DFGKPIVAIA NSFTQFVPGH VHLRDLGKIV ADAVAGSGGV AKEFNTIAVD DGIAMGHGGM LYSLPSREII ADSVEYMVNA HCADALVCIS NCDKITPGML IAALRLNIPT VFVSGGAMES GHAVVTGGIV RSRLDLIDAM TAAVNPDVSD ADLDTIERSA CPTCGSCSGM FTANSMNCLT EALGLALPGN GSTLATAAAR RGLFVEAGRL VVDLARRYYE KDDEAVLPRS IASAAAFRNA FAVDVAMGGS TNTVLHLLAA AVEAGVEVTL DDIDQVSRSV ACLCKVAPSS TDYYMEDVHR AGGIPAILGE LDRGGLVDPN VHSVHAASLR EFLDRWDVRG ADPSPDAIEL FHAAPGGVRT VEPFGSTNRW DTLDTDAKNG CIRSVEHAYS ADGGLAVLRG NLAPDGAVVK TAGVDESQWT FRGPALVVES QEAAVDAILN KVVKAGDVII VRYEGPRGGP GMQEMLYPTA FLKGRGLGPK CALITDGRFS GGSSGLSIGH VSPEAAHGGP IALVRDGDLV EIDIPRRRID LLVPDAELAA RRAEIEANGG YHPANRERVV SAALRAYAAM ATSASTGAAR DVRLITG
|
| |