Gene Francci3_1521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1521 
Symbol 
ID3904987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1821425 
End bp1823005 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content74% 
IMG OID637878858 
Productprecorrin-2 C20-methyltransferase / cobalt-factor II C20-methyltransferase / precorrin-3 methyltransferase 
Protein accessionYP_480626 
Protein GI86740226 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1010] Precorrin-3B methylase
[COG2243] Precorrin-2 methylase 
TIGRFAM ID[TIGR01466] precorrin-3B C17-methyltransferase
[TIGR01467] precorrin-2 C20-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0252228 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCCGT CGAAGGGCCG GCTGTGGGGA GTCGGGGTTG GACCGGGGGA TCCCGAGCTC 
GTCACCCTCA AGGCGGCCCG CCTGATCCGT GACGCCGACG TGATCGCCTA CCACAGCGCC
CGGCACGGCC GCAGCATCGC CCGGTCGGTC GCCGCGAGCC AGCTGCGCGG CGACCAGATC
GAGGAGGCGC TGGTCTACCC GGTCACCACC GAGACGACCT CCCACCCCGG TGGCTACCGC
GGCGCGATCG ACGAGTTCTA CGAGGACTGC GCCAAGCGGC TGGCCGTCCA CCTCGACGCC
GGCCGGGACG TCGTGGTCCT CAGCGAGGGT GACCCGTTCT TCTACGGCTC GTTCATCCAC
CTGCACCGGC GCCTCGCCGA CCGGTACCCG ACCGAGGTCG TGCCGGGGGT GACGTCCCTG
TCGGCGGGGT GCGCGGTGCT CGGCCGGCCG CTGGTCGAGG GCAACGAAGT CCTCACCGTG
CTGCCGGGCA CGCTGCCGCC GACGGTCCTC GCCGAGCGTA TCGCCGGCAC GGACACCGCC
GTCGTGCTCA AGATGGGACG GACCTTCCCG GGGGTCCGGG ACGCCTTCAC CGCCGCGGGA
CGTCTCGCGG ACACCTGGTA CGTGGAACGG GCCACCACCT CCGGCCAGCG CATCGCCCCG
CTCGGTGCGG TAGACCCGGC CACGGTGCCG TACTTCTCGC TCGCGGTACT GCCGAGCCCG
GTCCGGGGGC CGGACGATCC GGCGCCGCTG CGCGCGTCGC AGGCGGGCTG GGTACCAACG
GCGCCCACCG TTGGTAAGGC TGGCGCCGTT GGTACCGGCG GCACGCCCGG CGCGGGGGAG
GTCGTCGTGG TGGGCCTCGG ACCGGGCGCC GCGGGCTGGT TGACGCCGCA GGCCGCCGAG
GCGCTGGCCG CCGCCGACGA CCTCATCGGC TACGGCCCCT ACCTCGACCG GGTGCCGGTC
GATCAACGCC AGCGCCGGCA CGCCTCGGGG AACACCGTCG AGGCCGAGCG CGCCGAGCTC
GCCCTCGAAC TGGCGGCCGG CGGCGCGAAC GTCGCCGTGG TCTCCTCCGG TGATCCCGGG
GTCTTCGCGA TGGCCACGGC CGTCGTCGAG GCCGCGGCGG CGGAGCGGTT CGCGGGCGTC
GAGGTGCGGG TCGTGCCCGG GCTGACCGCC GCGCAGGCGG TGGCGAGCCG GGTCGGGGCG
CCACTCGGCC ACGACTTCTG CGTGCTGTCG CTGTCCGACC GACTCAAGCC GTGGGAGGTC
ATCGAGCGGC GGCTGCGGGC GGCGGCCGCG GGCGACTTCG TCCTGGCGCT GTACAACCCG
GCTTCCCGTA CCCGTCGCCA TCAGCTGGAG CGGGCCCACG AGGTGCTGCT CGAACACCGC
CCCTCGGACA CGGTGGTGGT TATCGGCCGG GATGTCGGCG GTCCCACCGA GAGCATCACC
GTGACGAGCC TCGGTGCGTT CGACCCGGCC GAGGTCGACA TGCGCTGCCT GCTGCTCATC
GGCTCGTCGA CGACCCGGGT GGTGCGCCGT GGCCCCGGCC GGGACCTCGT GTTCACTCCA
CGTCGTTACC CGGTCACCTA G
 
Protein sequence
MVPSKGRLWG VGVGPGDPEL VTLKAARLIR DADVIAYHSA RHGRSIARSV AASQLRGDQI 
EEALVYPVTT ETTSHPGGYR GAIDEFYEDC AKRLAVHLDA GRDVVVLSEG DPFFYGSFIH
LHRRLADRYP TEVVPGVTSL SAGCAVLGRP LVEGNEVLTV LPGTLPPTVL AERIAGTDTA
VVLKMGRTFP GVRDAFTAAG RLADTWYVER ATTSGQRIAP LGAVDPATVP YFSLAVLPSP
VRGPDDPAPL RASQAGWVPT APTVGKAGAV GTGGTPGAGE VVVVGLGPGA AGWLTPQAAE
ALAAADDLIG YGPYLDRVPV DQRQRRHASG NTVEAERAEL ALELAAGGAN VAVVSSGDPG
VFAMATAVVE AAAAERFAGV EVRVVPGLTA AQAVASRVGA PLGHDFCVLS LSDRLKPWEV
IERRLRAAAA GDFVLALYNP ASRTRRHQLE RAHEVLLEHR PSDTVVVIGR DVGGPTESIT
VTSLGAFDPA EVDMRCLLLI GSSTTRVVRR GPGRDLVFTP RRYPVT