Gene Franean1_4162 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4162 
Symbol 
ID5672517 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4944196 
End bp4947072 
Gene Length2877 bp 
Protein Length958 aa 
Translation table11 
GC content76% 
IMG OID641243035 
Productprecorrin-4 C11-methyltransferase 
Protein accessionYP_001508452 
Protein GI158315944 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1010] Precorrin-3B methylase
[COG2073] Cobalamin biosynthesis protein CbiG
[COG2875] Precorrin-4 methylase 
TIGRFAM ID[TIGR01465] precorrin-4 C11-methyltransferase
[TIGR01466] precorrin-3B C17-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.329616 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.542159 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCACC CCGGCCGCGG CACCGGTCAG CCAGTGGACG GCGCCGGGCC GGGACGCGGG 
CGGGTCAGTT TCGTCGGTGC CGGGCCGGGT GCCGCCGATC TGATCACCGT GCGCGGCGCG
GCGGTTCTCG CCGCCGCCGA CGTGGTGATC TGGGCGTCCT CGCTGGTGCA CCCCGCCATC
CTCGGCCATG CCCGTGACGG CGCCGAACTG ATCGACTCCG CGGCGTTGCC GTTGGAGGGC
GTCGAGGCGG TCTACCGCCG TGCCGCCGAG CAGGGTCTGC ACGTCGCCCG GGTCCACTCG
GGGGATCCGG CGCTGTGGGG GGCGGTGCAG GAGCAGCTGG AGATCTGCGA CCGGCTCGGC
CTGGACAGCG ACATCGTGCC CGGGGTGAGC AGCTTCACCG CGCTCGCCGC GGCGTTGCGC
CGGGAGCTGA CCGTCCCCGA GGTCGCCCAG TCGGTGATCC TGACCCGGCT CGGGGGTGGG
AAGACCCCGA TGCCACCCGG GGAGGAGGTC CGCGACCTCG CCCGGCACGG CACGACGATG
GCGCTGTTCC TGTCCGCGGC CCGTTCCGGG CAGCTGCGGG AGGAGCTGCT CGCCGGCGGC
TACCCGCCGG ACACGCCCTG TGTCGTCGCC TACCAGGTGA GCTGGCCGGA CGAGCTGATC
GTGCGCGCCA CCCTCGACAC GCTCGCCGAC ACGGTCAAGG CGCACCGGCT GTGGAAGCAC
ACCCTGGTGC TCGTCGGCCC GGCGCTGGTG GCCGGGGGGC GACGCTCCCA CCTGTACCAC
CCGGGGCATT TCCACACCTA CCGGCGGGCG CAGTCCGGTG CGCGCACGGT GCTGCGCGAC
GCCGACCGGG AGCTGGCCGC CGCCGGCGCC ACCCAGCTCC CCGCCAGCCC ACACCCCGTC
GCCCCGGCCC CGGCCGTGCT GGAACGGGAG CGGGGGGCGC CGGCCGAGCC GGTGGCACCG
CAGGGCGCGG CGGTGGACGG CGCGGGCCTG GTGGCGCTGG TGGCGGTGAC CGCCGCCGGG
CGGGCCGCGG CCGGTGACCT CACCCGCCGC TGGCCCACCG CCCGGCTCTA CCCGGGCCGG
CCCCGCGACG CGATCACCGC CGCGCTCGCG GACGGCGTCA CGGGCATTGT CTGTTTCCTG
GCGACGGGGG CGACGGTCCG CCTCCTCGAC GGGCTGCTGC GCGGCAAGGA CCACGACCCG
GGTGTGGTGT GTGTGGACGA GGCGCGCCGG TTCGCGGTCG CGTTGTGCGG TGGGCACGCC
GGTGGCGCGA ACCGGCTCGC CGAGCAGGTC GCCGACGCCC TCGGCGCCAC ACCGGTGATC
ACCACGGCCA GTGACGCCAT CGGGGTGAGC GCCCTGGACG GGTTCGGCGC CGAGCTCGGG
TTCACTGTCG AACCGGGTTC GGAGCTGGCC GCCGTCGGCA CGGCCGTCCT GTCGGGTCAT
CCCGTCGCCC TGCACGCCGA CGCGGTGTGG CCGCTGCCGC CGCTGCCCCC GTCCGTCACC
ACCACTATCA CCGGCACCGG CGCTGTTGAT GGTGGGGCGG TGGGCGTGGA CGGGACGGCG
GTGGCGTCGA TTCATGTCAC CGACCGGCTG CTGGGTGCGC CGCCGGACGG GGCCGGCCCG
CGGGTGGTCT ACCGGCCCCC GAGCCTGGTC GTCGGGGTCG GGGCGAGCCG CGGCGCGCCC
GCCGACGAGA TCGACGTGCT CATCGACACC GCGCTGGCCG GCGCCGGGCT GTCCCCGCTG
GCGGTGACCC ATCTGGCGTC GGTGACCGCG AAAGCCGACG AGGTGGGGTT GCTGGAGGTG
GCCCGCCGCC GCGGCTGGCC GCTGGTGGTG CACCCCCCGC AGGCCCTGGC CGTGGTCACC
GTGCCGCACC CGTCGGAGGT GGTGCGCGCC GCGGTCGGCA CGCCGAGTGT CGCCGAAGCC
GCCAGTCTCC TGCCCGCCCC GCCCGCCGCC AGCCCGCCGA CCGCCGCGAT GGGCGGCATC
CCGGACGGCG CGACGGCCGG CGAGTCAACC GGCGTACCGG CGGTGACGGC GCAGCTGGTG
GTCGCCAAGC AGGTCAGCGC GCACGCGACC GTCGCTGTGG CCCGGCATCG GCCCCGCGGC
CGGCTCGCCG TCGTCGGGCT CGGTCCCGGG GACCGTCGGC TACTCACCCC CGCCGCCCGC
GCCGAGCTGG CCCGCGCCCG GATCGTGGTC GGGTTGGACC AGTATGTGCG GTCCGTCGCC
GATCTGCTGC CCGCCGGGGT CACCGTGCTC GACAGCGGGC TCGGTGACGA GCAGGCCCGC
GCCGAGACCG CCGTCGCACA CGCCCGCGCC GGGCATGCCG TCGCGCTCAT CGGCAGCGGT
GACGCCGGGG TGTACGCGAT GGGCAGCCCC GCGCTCGACC TGGCCGACGG GTCGTTCGAC
GTGGTCGCCG TGCCGGGGGT GACCGCGGCG CTCGCCGCCG CCGCGCTGCT CGGCGCCCCG
CTCGGGCACG ACCATGCGCT GATCAGCCTG TCGGACCTGC ACACCCCGTG GGAGCGGATC
GTGGGCCGGG TCCGCGCGGT CGCCGAAGCC GACCTGGTCG TCGCGTTCTA CAACCCGCGC
AGCCGCACCC GACGCCACCA GCTACCCGAC GCCCTCGACG TGCTCGCCGC GCACCGCCCG
GCCGGCACCC CGGTGGGGAT CGTCACCGAC GCGTTTCGGC CCGCCCAGCG GATCACCGTC
ACGACCCTCG GTGCCCTCAC CGACCGCACA GCCACCCAGG ACGGTGCGGG TGGGGTTGGT
GAGCGGGAGC GGCTGCTGGA CCTCGTCGGG ATGACGACCA CGGTCGTCGT CGGCTCGAGC
CAGACCCGGC TGCGCGCCGG CCTGGTCGTC ACCCCACGGG ACTACACGTG GCGCTGA
 
Protein sequence
MSHPGRGTGQ PVDGAGPGRG RVSFVGAGPG AADLITVRGA AVLAAADVVI WASSLVHPAI 
LGHARDGAEL IDSAALPLEG VEAVYRRAAE QGLHVARVHS GDPALWGAVQ EQLEICDRLG
LDSDIVPGVS SFTALAAALR RELTVPEVAQ SVILTRLGGG KTPMPPGEEV RDLARHGTTM
ALFLSAARSG QLREELLAGG YPPDTPCVVA YQVSWPDELI VRATLDTLAD TVKAHRLWKH
TLVLVGPALV AGGRRSHLYH PGHFHTYRRA QSGARTVLRD ADRELAAAGA TQLPASPHPV
APAPAVLERE RGAPAEPVAP QGAAVDGAGL VALVAVTAAG RAAAGDLTRR WPTARLYPGR
PRDAITAALA DGVTGIVCFL ATGATVRLLD GLLRGKDHDP GVVCVDEARR FAVALCGGHA
GGANRLAEQV ADALGATPVI TTASDAIGVS ALDGFGAELG FTVEPGSELA AVGTAVLSGH
PVALHADAVW PLPPLPPSVT TTITGTGAVD GGAVGVDGTA VASIHVTDRL LGAPPDGAGP
RVVYRPPSLV VGVGASRGAP ADEIDVLIDT ALAGAGLSPL AVTHLASVTA KADEVGLLEV
ARRRGWPLVV HPPQALAVVT VPHPSEVVRA AVGTPSVAEA ASLLPAPPAA SPPTAAMGGI
PDGATAGEST GVPAVTAQLV VAKQVSAHAT VAVARHRPRG RLAVVGLGPG DRRLLTPAAR
AELARARIVV GLDQYVRSVA DLLPAGVTVL DSGLGDEQAR AETAVAHARA GHAVALIGSG
DAGVYAMGSP ALDLADGSFD VVAVPGVTAA LAAAALLGAP LGHDHALISL SDLHTPWERI
VGRVRAVAEA DLVVAFYNPR SRTRRHQLPD ALDVLAAHRP AGTPVGIVTD AFRPAQRITV
TTLGALTDRT ATQDGAGGVG ERERLLDLVG MTTTVVVGSS QTRLRAGLVV TPRDYTWR