Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_4515 |
Symbol | |
ID | 3907492 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 5389611 |
End bp | 5391464 |
Gene Length | 1854 bp |
Protein Length | 617 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637881848 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_483590 |
Protein GI | 86743190 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.133337 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGCCC TGCGCTCCCG CACCACCACC CACGGCCGGA ACATGGCCGG CGCCCGCGCC CTGTGGCGCG CGACCGGGAT GACCGACGAC GACTTTGGCA AGCCGATCGT CGCCGTGGCT AACAGCTTCA CCGAGTTCGT CCCGGGGCAT GTGCACTTGC GTAACCTTGG CTCGCTGGTG GCCGGGGCGG TGGCCGAGGC CGGTGGGGTG GCGCGCGAGT TCAACACCAT TGCCGTGGAC GACGGCATCG CGATGGGGCA TGGCGGGATG CTCTACTCGC TGCCGTCCCG CGAGCTCATC GCCGACAGCG TCGAGTACAT GGTGAACGCC CACTGCGCCG ACGCCCTGGT CTGCATCTCC AACTGCGACA AGATCACCCC GGGGATGCTG CTCGCGGCAC TGCGGCTGAA CATCCCGACC GTGTTCGTCT CCGGCGGAGC GATGGAGTCG GGCAACGCGG TCATCTCCGG CGGCACGGCT CGGTCCAGGC TGGACCTCAT CACCGCGATG TCGGCGGCGG TCAACCCGGA CGTCTCGGAC GGCGACCTGT CGACGATCGA GCGTTCGGCC TGCCCGACGT GTGGATCCTG CTCCGGCATG TTCACGGCGA ACTCGATGAA CTGCCTGACC GAGGCGATCG GGCTGTCCCT GCCCGGCAAC GGGTCGACCC TGGCCACCGC CGCCGCCCGC CGTGAGCTGT TCGTCGAGGC CGGGCGCCTC GTGGTCGACC TGGCGCGGCG CTATTACGAG AAGGACGACG AGGCGGTCCT GCCCCGGTCG ATCGCGACCG CCGCCGCCTT CCGTAACGCC TTCGCGGTCG ACGTGGCGAT GGGCGGCTCG ACGAACACCG TGCTGCATCT GTTGGCCGCC GCCGTCGAGG CCGGCGTTGA CGTCACCCTC GCCGACATCG ACCAGATCTC CCGCACCGTC CCCTGCCTGT GCAAGGTGGC GCCGAGCTCC ACCCGTTACT ACATGGAGGA CGTCCACCGG GCGGGCGGCA TCCCCGCGAT CCTCGGCGAG CTCGACCGGG CCGGGCTGCT CGACCCGGAC CCGCACACGG TGCACTCCGC GAGCCTGCGC GAGTTCCTCG ACCGCTGGGA CGTCCGCGGC CCGAGCCCCT CGCCGGACGC GATCGAGCTG TTCCACGCGG CGCCGGGTGG CGTGCGCACG ATCGAGCCGT TCAGCTCCAC CAATCGGTGG GACACCCTTG ACACCGACGC CAGGGACGGT TGCATCCGTT CGGTCGAGCA CGCCTACTCC GCCGAGGGTG GGCTCGCGGT GCTGTTCGGC AACCTGGCCG TCGAGGGCGC CGTCGTGAAG ACGGCCGGTG TGGACGAGGG CCAGTGGACC TTCCGCGGCC CGGCGCTCGT GGTCGAGAGC CAGGAAGAAG CGGTCGACGC CATCCTCACC GGGCGGGTCA AGGCCGGGAA TGTGATCATC GTCCGCTACG AGGGCCCTCG CGGCGGTCCG GGGATGCAGG AGATGCTCTA CCCCACCGCG TTCCTCAAGG GCCGCGGTCT CGGCCCGAAG TGCGCCCTGA TCACCGACGG CCGGTTCTCC GGCGGGAGCT CCGGACTGTC GATCGGTCAC GTCTCCCCGG AGGCGGCCCA CGGCGGGACG ATCGCCCTGG TCCGCGACGG GGACATCATC GAGATCGACA TCCCGGCCCG CCGGCTGGAG CTCGTGGTCT CCGACGAGGA GCTGGCGAGC CGACGCGCGG CGCTGGAGGC GGCCGGCGGC TACCGTCCCA CCGGGCGGGA ACGGCCGGTG TCCATGGCGC TGCGGGCCTA TGCGGCGATG GCGACCTCGG CCTCCACCGG TGCCGCGCGC GACGTCGGTC TGCTCGGCGG CTGA
|
Protein sequence | MPALRSRTTT HGRNMAGARA LWRATGMTDD DFGKPIVAVA NSFTEFVPGH VHLRNLGSLV AGAVAEAGGV AREFNTIAVD DGIAMGHGGM LYSLPSRELI ADSVEYMVNA HCADALVCIS NCDKITPGML LAALRLNIPT VFVSGGAMES GNAVISGGTA RSRLDLITAM SAAVNPDVSD GDLSTIERSA CPTCGSCSGM FTANSMNCLT EAIGLSLPGN GSTLATAAAR RELFVEAGRL VVDLARRYYE KDDEAVLPRS IATAAAFRNA FAVDVAMGGS TNTVLHLLAA AVEAGVDVTL ADIDQISRTV PCLCKVAPSS TRYYMEDVHR AGGIPAILGE LDRAGLLDPD PHTVHSASLR EFLDRWDVRG PSPSPDAIEL FHAAPGGVRT IEPFSSTNRW DTLDTDARDG CIRSVEHAYS AEGGLAVLFG NLAVEGAVVK TAGVDEGQWT FRGPALVVES QEEAVDAILT GRVKAGNVII VRYEGPRGGP GMQEMLYPTA FLKGRGLGPK CALITDGRFS GGSSGLSIGH VSPEAAHGGT IALVRDGDII EIDIPARRLE LVVSDEELAS RRAALEAAGG YRPTGRERPV SMALRAYAAM ATSASTGAAR DVGLLGG
|
| |