Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0844 |
Symbol | fumC |
ID | 5669260 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 989387 |
End bp | 990778 |
Gene Length | 1392 bp |
Protein Length | 463 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641239773 |
Product | fumarate hydratase |
Protein accession | YP_001505208 |
Protein GI | 158312700 |
COG category | [C] Energy production and conversion |
COG ID | [COG0114] Fumarase |
TIGRFAM ID | [TIGR00979] fumarate hydratase, class II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.921062 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAGC AGGAATACCG CATCGAGCAC GACAGCATGG GTGACGTGCG GGTGCCGGCC GCGGCGAAGT GGCGAGCGCA GACGCAGCGG GCGGTCGAGA ACTTCCCCGT CTCCGGCCAG CGCATCTCGC GCGCCCACAT CGCGGCGCTC GCGCGGATCA AGGCCGCGGC CGCGACCGTG AACGCCAAGC TCGGCGTTCT CGACGGTGAC ATCGCCGCGG CGATCGCCGA GGCCGCCGGC GAGGTGGCCC GGGGCGACTG GGACACCCAG TTCCCGCTCG ACGTCTTCCA GACCGGCAGC GGCACGTCGT CCAACATGAA CACCAACGAG GTGCTGGCGT CGCTCGCCTC GGAGCGGCTC GGCCGGGCCG TTCACCCGAA CGACCACGTG AACGCCTCGC AGTCGTCGAA CGACGTGTTC CCGTCCTCGA TCCACGTGGC GGCGACCCAG GGGATCGTGC ACGAGCTGAT CCCCGCACTC GATCACCTCG CCGTCTCACT GGAGCGGAAG GCGACCGAGT TCGCCGAGGT GGTCAAGTCG GGGCGCACCC ACCTCATGGA CGCCACGCCG GTGACCCTCG GCCAGGAGTT CGGCGGGTAC GCGGCGGCCG TCCGGCTGGG TGTCGAGCGG CTCAACGACG CGCTGCCCCG GATCGGCGAG CTGCCGCTGG GCGGCACCGC GGTCGGCACC GGCATCAACA CGCCGCCGGG GTTCTCGGCG GCGGTCATCG AGGAGCTGGC GGCGAGCACC GGGCTCCCGC TCACCGAAGC CCGCAACCAC TTCGAGGCCC AGGCCTCGCG GGACGGCCTC GTCGAGGCGT CCGGCGTGCT GCGGGTGATC GCGGTGTCGC TCTACAAGAT CTCGAACGAC CTGCGGTGGG CGTCCTCGGG CCCGCGGGCC GGGCTGGGCG AGATCAGGCT GCCGGACCTG CAGCCCGGGT CGTCGATCAT GCCCGGCAAG GTGAACCCGG TCATCCCCGA GGCGGTCTGC CAGGTGGTGG CCCAGGTGGT CGGCAACGAC GCGGCGGTGG CGTTCGGTGG CTCGGCCGGC AACTTCGAGC TCAACGTGAT GCTGCCGGTC ATCGCCCGCA ACCTGCTCGA GTCGATCCAC ATCCTGTCGA CGATCAGCCG TCTGTTCGCC GACCGCTGCA TCGACGGCAT CGTCGCGGAC GTCGAGCGCT GCCGCCGCTA CGCGGAGTCC TCGCCGTCGG TCGTGACGCC GCTCAACAAG TACATCGGCT ACGAGGAGGC GGCCAAGGTC GCCAAGCAGT CGCTGGCCGA GGAGAAGACG ATCCGCGAGG TCGTGATCGA GCGCGGGTAC GTCCAGGCCG GCAAGCTCAC CGAGGCCGAG CTGGACAGCG CGCTCGACGT CCTGTCGATG ACGCACCCGT AG
|
Protein sequence | MSEQEYRIEH DSMGDVRVPA AAKWRAQTQR AVENFPVSGQ RISRAHIAAL ARIKAAAATV NAKLGVLDGD IAAAIAEAAG EVARGDWDTQ FPLDVFQTGS GTSSNMNTNE VLASLASERL GRAVHPNDHV NASQSSNDVF PSSIHVAATQ GIVHELIPAL DHLAVSLERK ATEFAEVVKS GRTHLMDATP VTLGQEFGGY AAAVRLGVER LNDALPRIGE LPLGGTAVGT GINTPPGFSA AVIEELAAST GLPLTEARNH FEAQASRDGL VEASGVLRVI AVSLYKISND LRWASSGPRA GLGEIRLPDL QPGSSIMPGK VNPVIPEAVC QVVAQVVGND AAVAFGGSAG NFELNVMLPV IARNLLESIH ILSTISRLFA DRCIDGIVAD VERCRRYAES SPSVVTPLNK YIGYEEAAKV AKQSLAEEKT IREVVIERGY VQAGKLTEAE LDSALDVLSM THP
|
| |