Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4162 |
Symbol | |
ID | 5672517 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4944196 |
End bp | 4947072 |
Gene Length | 2877 bp |
Protein Length | 958 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641243035 |
Product | precorrin-4 C11-methyltransferase |
Protein accession | YP_001508452 |
Protein GI | 158315944 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1010] Precorrin-3B methylase [COG2073] Cobalamin biosynthesis protein CbiG [COG2875] Precorrin-4 methylase |
TIGRFAM ID | [TIGR01465] precorrin-4 C11-methyltransferase [TIGR01466] precorrin-3B C17-methyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.329616 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.542159 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCACC CCGGCCGCGG CACCGGTCAG CCAGTGGACG GCGCCGGGCC GGGACGCGGG CGGGTCAGTT TCGTCGGTGC CGGGCCGGGT GCCGCCGATC TGATCACCGT GCGCGGCGCG GCGGTTCTCG CCGCCGCCGA CGTGGTGATC TGGGCGTCCT CGCTGGTGCA CCCCGCCATC CTCGGCCATG CCCGTGACGG CGCCGAACTG ATCGACTCCG CGGCGTTGCC GTTGGAGGGC GTCGAGGCGG TCTACCGCCG TGCCGCCGAG CAGGGTCTGC ACGTCGCCCG GGTCCACTCG GGGGATCCGG CGCTGTGGGG GGCGGTGCAG GAGCAGCTGG AGATCTGCGA CCGGCTCGGC CTGGACAGCG ACATCGTGCC CGGGGTGAGC AGCTTCACCG CGCTCGCCGC GGCGTTGCGC CGGGAGCTGA CCGTCCCCGA GGTCGCCCAG TCGGTGATCC TGACCCGGCT CGGGGGTGGG AAGACCCCGA TGCCACCCGG GGAGGAGGTC CGCGACCTCG CCCGGCACGG CACGACGATG GCGCTGTTCC TGTCCGCGGC CCGTTCCGGG CAGCTGCGGG AGGAGCTGCT CGCCGGCGGC TACCCGCCGG ACACGCCCTG TGTCGTCGCC TACCAGGTGA GCTGGCCGGA CGAGCTGATC GTGCGCGCCA CCCTCGACAC GCTCGCCGAC ACGGTCAAGG CGCACCGGCT GTGGAAGCAC ACCCTGGTGC TCGTCGGCCC GGCGCTGGTG GCCGGGGGGC GACGCTCCCA CCTGTACCAC CCGGGGCATT TCCACACCTA CCGGCGGGCG CAGTCCGGTG CGCGCACGGT GCTGCGCGAC GCCGACCGGG AGCTGGCCGC CGCCGGCGCC ACCCAGCTCC CCGCCAGCCC ACACCCCGTC GCCCCGGCCC CGGCCGTGCT GGAACGGGAG CGGGGGGCGC CGGCCGAGCC GGTGGCACCG CAGGGCGCGG CGGTGGACGG CGCGGGCCTG GTGGCGCTGG TGGCGGTGAC CGCCGCCGGG CGGGCCGCGG CCGGTGACCT CACCCGCCGC TGGCCCACCG CCCGGCTCTA CCCGGGCCGG CCCCGCGACG CGATCACCGC CGCGCTCGCG GACGGCGTCA CGGGCATTGT CTGTTTCCTG GCGACGGGGG CGACGGTCCG CCTCCTCGAC GGGCTGCTGC GCGGCAAGGA CCACGACCCG GGTGTGGTGT GTGTGGACGA GGCGCGCCGG TTCGCGGTCG CGTTGTGCGG TGGGCACGCC GGTGGCGCGA ACCGGCTCGC CGAGCAGGTC GCCGACGCCC TCGGCGCCAC ACCGGTGATC ACCACGGCCA GTGACGCCAT CGGGGTGAGC GCCCTGGACG GGTTCGGCGC CGAGCTCGGG TTCACTGTCG AACCGGGTTC GGAGCTGGCC GCCGTCGGCA CGGCCGTCCT GTCGGGTCAT CCCGTCGCCC TGCACGCCGA CGCGGTGTGG CCGCTGCCGC CGCTGCCCCC GTCCGTCACC ACCACTATCA CCGGCACCGG CGCTGTTGAT GGTGGGGCGG TGGGCGTGGA CGGGACGGCG GTGGCGTCGA TTCATGTCAC CGACCGGCTG CTGGGTGCGC CGCCGGACGG GGCCGGCCCG CGGGTGGTCT ACCGGCCCCC GAGCCTGGTC GTCGGGGTCG GGGCGAGCCG CGGCGCGCCC GCCGACGAGA TCGACGTGCT CATCGACACC GCGCTGGCCG GCGCCGGGCT GTCCCCGCTG GCGGTGACCC ATCTGGCGTC GGTGACCGCG AAAGCCGACG AGGTGGGGTT GCTGGAGGTG GCCCGCCGCC GCGGCTGGCC GCTGGTGGTG CACCCCCCGC AGGCCCTGGC CGTGGTCACC GTGCCGCACC CGTCGGAGGT GGTGCGCGCC GCGGTCGGCA CGCCGAGTGT CGCCGAAGCC GCCAGTCTCC TGCCCGCCCC GCCCGCCGCC AGCCCGCCGA CCGCCGCGAT GGGCGGCATC CCGGACGGCG CGACGGCCGG CGAGTCAACC GGCGTACCGG CGGTGACGGC GCAGCTGGTG GTCGCCAAGC AGGTCAGCGC GCACGCGACC GTCGCTGTGG CCCGGCATCG GCCCCGCGGC CGGCTCGCCG TCGTCGGGCT CGGTCCCGGG GACCGTCGGC TACTCACCCC CGCCGCCCGC GCCGAGCTGG CCCGCGCCCG GATCGTGGTC GGGTTGGACC AGTATGTGCG GTCCGTCGCC GATCTGCTGC CCGCCGGGGT CACCGTGCTC GACAGCGGGC TCGGTGACGA GCAGGCCCGC GCCGAGACCG CCGTCGCACA CGCCCGCGCC GGGCATGCCG TCGCGCTCAT CGGCAGCGGT GACGCCGGGG TGTACGCGAT GGGCAGCCCC GCGCTCGACC TGGCCGACGG GTCGTTCGAC GTGGTCGCCG TGCCGGGGGT GACCGCGGCG CTCGCCGCCG CCGCGCTGCT CGGCGCCCCG CTCGGGCACG ACCATGCGCT GATCAGCCTG TCGGACCTGC ACACCCCGTG GGAGCGGATC GTGGGCCGGG TCCGCGCGGT CGCCGAAGCC GACCTGGTCG TCGCGTTCTA CAACCCGCGC AGCCGCACCC GACGCCACCA GCTACCCGAC GCCCTCGACG TGCTCGCCGC GCACCGCCCG GCCGGCACCC CGGTGGGGAT CGTCACCGAC GCGTTTCGGC CCGCCCAGCG GATCACCGTC ACGACCCTCG GTGCCCTCAC CGACCGCACA GCCACCCAGG ACGGTGCGGG TGGGGTTGGT GAGCGGGAGC GGCTGCTGGA CCTCGTCGGG ATGACGACCA CGGTCGTCGT CGGCTCGAGC CAGACCCGGC TGCGCGCCGG CCTGGTCGTC ACCCCACGGG ACTACACGTG GCGCTGA
|
Protein sequence | MSHPGRGTGQ PVDGAGPGRG RVSFVGAGPG AADLITVRGA AVLAAADVVI WASSLVHPAI LGHARDGAEL IDSAALPLEG VEAVYRRAAE QGLHVARVHS GDPALWGAVQ EQLEICDRLG LDSDIVPGVS SFTALAAALR RELTVPEVAQ SVILTRLGGG KTPMPPGEEV RDLARHGTTM ALFLSAARSG QLREELLAGG YPPDTPCVVA YQVSWPDELI VRATLDTLAD TVKAHRLWKH TLVLVGPALV AGGRRSHLYH PGHFHTYRRA QSGARTVLRD ADRELAAAGA TQLPASPHPV APAPAVLERE RGAPAEPVAP QGAAVDGAGL VALVAVTAAG RAAAGDLTRR WPTARLYPGR PRDAITAALA DGVTGIVCFL ATGATVRLLD GLLRGKDHDP GVVCVDEARR FAVALCGGHA GGANRLAEQV ADALGATPVI TTASDAIGVS ALDGFGAELG FTVEPGSELA AVGTAVLSGH PVALHADAVW PLPPLPPSVT TTITGTGAVD GGAVGVDGTA VASIHVTDRL LGAPPDGAGP RVVYRPPSLV VGVGASRGAP ADEIDVLIDT ALAGAGLSPL AVTHLASVTA KADEVGLLEV ARRRGWPLVV HPPQALAVVT VPHPSEVVRA AVGTPSVAEA ASLLPAPPAA SPPTAAMGGI PDGATAGEST GVPAVTAQLV VAKQVSAHAT VAVARHRPRG RLAVVGLGPG DRRLLTPAAR AELARARIVV GLDQYVRSVA DLLPAGVTVL DSGLGDEQAR AETAVAHARA GHAVALIGSG DAGVYAMGSP ALDLADGSFD VVAVPGVTAA LAAAALLGAP LGHDHALISL SDLHTPWERI VGRVRAVAEA DLVVAFYNPR SRTRRHQLPD ALDVLAAHRP AGTPVGIVTD AFRPAQRITV TTLGALTDRT ATQDGAGGVG ERERLLDLVG MTTTVVVGSS QTRLRAGLVV TPRDYTWR
|
| |