Gene Franean1_4862 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4862 
SymbolcobN 
ID5673202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5831366 
End bp5835166 
Gene Length3801 bp 
Protein Length1266 aa 
Translation table11 
GC content75% 
IMG OID641243717 
Productcobaltochelatase subunit CobN 
Protein accessionYP_001509133 
Protein GI158316625 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1429] Cobalamin biosynthesis protein CobN and related Mg-chelatases 
TIGRFAM ID[TIGR02257] cobaltochelatase, CobN subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.114113 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.358132 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAACA CACCGCCGGG CGGCGTCCCG GCCGAGCCGG TGCTGGCCGC TGACCCGGTG 
CTGGCCCTGG TGTCCACCGC CGACACCGAG CTGCTCGCCG CGCGGGCCAG CGGCGTGCCC
TGGCGGGTCG CCAACCCGGC CCGGCTCGAC GCGGCGGAGG TGCCGGCGTT CCTCGCCGGG
GCCGACGTGG TGGTGGTGCG CCTGCTCGGC GGTGAACGGG CCTGGGCCGA GGGCCTCGCC
GCGACGGTGG CGAGCGGGCT GCCGGTCGTC GCCCTCGGCG GCGAGTCGAC ACCGGACGCG
GCGCTGATGG CGCGCGGCAC CGTCCCGGCC GGCGTCGCCA CCGAGGCGCT GGCCTACCTG
ACCGAAGGCG GCGTCGGAAA CCTCGCCGAG CTCTACCGCT TCCTCTCCGA CACGCTGCTG
CTCACCGGGC TGGGGTTCGC CCCGCCGGTG GAGCTGCCCG CGCACGGGCC GCACGGCGAC
CGGGTGGCCA AGCCGGACCG CCCGACCGTC GGCGTCGTGT TCTACCGGGC GCACGCCACC
TCCGGGAACA CGGCGTTCGT CGACGCGCTG TGCGACGCGC TCGAAGGAGC CGGGGCCAAC
GCCCGTCCGG TGTTCTGCTC GACGCTGCGG GGCGCCGCGG CCGGCGGAGT GATCGACGAC
CTGGCCGGCG TCGACGCGCT CGTGGTGACC GTGCTCGCCG CCGGCGGCAC CCGGGCGTCG
GACGCGTCCG CCGGCGGCGA CGAGGACGCC TGGGATGTCG GCGCGCTCGC CGCGCTCGAC
GTCCCGGTGG TGCAGGGGCT GTGCCTGACC TCGTCGCGCG CCGTCTGGGC GGACAGCGAC
GCCGGTCTGT CCCCCATGGA CGCGGCGATG CAGGTCGCCA TCCCCGAGTT CGACGGCCGG
ATCATCACCG TCCCGTTCTC GTTCAAGGAG CAGGGGGTGG ACGGCGTCCC GGTGTACGTG
GCCGATCCGG AGCGCACCGC CCGGGTCGCC GGGATCGCGT CCCGACTGGC CGGGCTGCGG
CACACCCCGG CCGCCGAGAA GAAGATCGCG ATCGTGCTCT CGTCGTACCC GACGAAGCAC
TCGCGGGTCG GTAACGCCGT CGGGCTCGAC ACCCCGGCGT CGGCTGTGCT GCTGCTCGAC
GCGCTGCGCG CCGCCGGCTA CGACCTCGGC GACGGCTACC CGGGCGAGGA CCTCCGCTCC
GCCGACGGCG GGCCGGGCGA GATCAGGCCG GACGCCGCGG CCGGAGACGC CCTGATCCAC
GCCCTGATCG CCGCCGGCGG GCACGACGTC GAGTGGCTGA CCGCGGAGCA GCTCGCCGCG
GCGCCGGCGC GGGTGCCGGC GTCAACCTAC GAGGGATGGT TCGCCCGCCT GCCGGAGTCG
CTGCGCGGCG CCATGACCGA GCACTGGGGC CCACCGCCGG GCTCCCTCTA CGTCGACGAC
GGCGCTGCCG GCGACGGGCG CGGGCGCGGG CCGGCGGCGG AGCCGGCGAT CGTGCTGGCC
GCGCTGCGCT TCGGCAACGT CGTCGTGATG ATCCAGCCGC CGCGGGGGTT CGGGGAGAAC
CCGGTGGCGA TCTACCACGA CCCGGACCTG CCGCCGTCCC ACCACTACCT GGCGGCCTAC
CGCTGGCTGG ACGAGGCCTT CGGCGCGGAC GCCGTCATCC ACCTCGGCAA GCACGGGACG
CTGGAGTGGC TGCCCGGCAA GGGCCTGGGG CTGTCCGCGG GGTGCGCGCC GGACGCCGTC
CTCGGCGACC TGCCGTTCGT CTACCCGTTC CTGGTCAACG ACCCGGGGGA GGGGACGCAG
GCCAAGCGCC GCGCCCACGC CGTGATCGTC GATCACCTCG TACCGCCGAT GGCCCGCGCC
GACTCCTACG GCGACATGGC CAAGCTGGAG CAGCTCCTCG ACGAGTACGC GACGCTCGCC
GCGCTCGACC CGGCCAAGCT GCCCGCCGTA CGCGCCCAGA TCTGGACGCT CATCCAGTCC
GCCCAGCTGC ACCACGACCT CGGGCAGGCC GACCGCCCCA GCGACGCCGA GTTCGACGAC
TTCCTGCTGC ACGTCGACGG CTACCTCTGC GAGGTCAAGG ACACCCAGAT CCGCGACGGC
CTGCACATCC TCGGCGTGGC CCCGGCGGGG GAGGCGCGGA CGAACCTGGT CACCGCGATC
CTGCGCGCCA ACCAGATGTG GGCGGGCCAG GCCGGGGCGG TACCCGGGCT GCGCGCGGCG
CTGGGCCACA CCGAAGGCGA CACCGAGGCG TCCCGGACGC AGGTCGACGC GTTCGAGCAG
ACCGCCCGGT CGCTGGTGGC CGCCCTCGAG GACGCCGGCT GGTCGCCGGC CGCCGTCAGC
TCCGTGGTGG CCGGGCAGTC GGCCGAGGCG GTGCCGGCCG CCGCGCGGGC CGAGGTCGAG
CGGGTCCTCA CGTTCGCCGC CACCGAGGTC GTCCCCCGGC TCGCCCGCAC CCCGGACGAG
ATCGTTAACA CCGTGCGGGC CCTCGACGGG CGGTTCGTCC CGCCCGGCCC GTCCGGCTCG
CCGACCCGTG GGCTCGTCAA CGTGCTGCCC ACCGGGCGCA ACTTCGCCTC CGTCGACCCG
AAGGCCATCC CGTCCCGGCT CGCCTGGGAG ACCGGTCGCG CGCTCGCCGA CTCGCTGATC
GAGCGCTACC TCGCCGACAC CGGCGGCTAC CCGGCCTCGG TCGGGCTGAC CGTCTGGGGG
ACGTCGGCGA TGCGCACCCA GGGCGACGAC ATCGCCGAGG TGCTCTGGCT GTTGGGCTGC
CGTCCCACCT GGGACGACGC GTCCCGGCGG GTCACCGGCT TCGAGGTCGT CCCGGGCGCG
GAGCTCGGGC GTCCGCGGGT GGACGTGGTG GTGCGGATCT CCGGATTCTT CCGGGACGCG
TTCCCGCACG TCGTGGCCCT GCTCGACGAC GCGGTGCGGG CGGTGGCGGT GCTCGACGAG
CCCGACGACG TCAACGCGCT GGCCGCTCAC GTCCGGGCCG ACCTGGCGGC GCTCGGGGCC
GTCCCGACGG TCGGCGGGCG CGGCGGCACC GCCGACCCGG CCGCGCTGCG CCGCGCGACG
ACGCGGATCT TCGGTTCGAA GCCGGGCGCG TACGGAGCCG GCCTGTTGCC GCTGATCGAC
TCCCGGCAGT GGCGCTCGGA TGCCGACCTG GCCGAGGTGT ACGCGGTCTG GGGCGGCTAC
GCCTACGGGC GCGGCCTGGA CGGCGCCGAG GCCCGCGCCG ACATGGAGAC CGCGTTCCGC
CGCATCCAGA TCGCGGTGAA GAACCAGGAC ACCCGCGAGC ACGACCTCGT CGACTCCGAC
GACTACTTCC AGTACCACGG CGGGATGGTG GCCGCCGTCC GCGCGCTCAC CGGCACGTCG
CCAGCCGCCT ACGTCGGTGA CAGCGCGCTG CCGGACGCCG TCCGCACCCG CACCCTGCAG
GAGGAGACCC ACCGGGTGTT CCGCGCCCGG GTGGTCAACC CGCGCTGGAT CGCGGCGATG
CGCCGTCACG GCTACAAGGG CGCGTTCGAG CTGGCCGCGA CGGTCGACTA CCTCTTCGGC
TACGACGCGA CCGCCGGTGT CGTGCAGGAC TGGATGTACG AGTCGCTGGC CGCCTCCTAC
GTCTTCGACG CCGAGACCCG CGAGTTCCTG CGGACGTCCA ACCCGTGGGC ACTGCGCGGC
ATGACCGAGC GCCTGCTGGA GGCCGCGGAG CGTGGCCTGT GGGAGGCTCC TCAGGAAGCC
ACGCTGGCCG GCCTGCAGTC CACTTACCTT GAACTGGAAG GTGATCTGGA GGCCGCGGGC
GACGGAGAGA GCGCCGGATA A
 
Protein sequence
MPNTPPGGVP AEPVLAADPV LALVSTADTE LLAARASGVP WRVANPARLD AAEVPAFLAG 
ADVVVVRLLG GERAWAEGLA ATVASGLPVV ALGGESTPDA ALMARGTVPA GVATEALAYL
TEGGVGNLAE LYRFLSDTLL LTGLGFAPPV ELPAHGPHGD RVAKPDRPTV GVVFYRAHAT
SGNTAFVDAL CDALEGAGAN ARPVFCSTLR GAAAGGVIDD LAGVDALVVT VLAAGGTRAS
DASAGGDEDA WDVGALAALD VPVVQGLCLT SSRAVWADSD AGLSPMDAAM QVAIPEFDGR
IITVPFSFKE QGVDGVPVYV ADPERTARVA GIASRLAGLR HTPAAEKKIA IVLSSYPTKH
SRVGNAVGLD TPASAVLLLD ALRAAGYDLG DGYPGEDLRS ADGGPGEIRP DAAAGDALIH
ALIAAGGHDV EWLTAEQLAA APARVPASTY EGWFARLPES LRGAMTEHWG PPPGSLYVDD
GAAGDGRGRG PAAEPAIVLA ALRFGNVVVM IQPPRGFGEN PVAIYHDPDL PPSHHYLAAY
RWLDEAFGAD AVIHLGKHGT LEWLPGKGLG LSAGCAPDAV LGDLPFVYPF LVNDPGEGTQ
AKRRAHAVIV DHLVPPMARA DSYGDMAKLE QLLDEYATLA ALDPAKLPAV RAQIWTLIQS
AQLHHDLGQA DRPSDAEFDD FLLHVDGYLC EVKDTQIRDG LHILGVAPAG EARTNLVTAI
LRANQMWAGQ AGAVPGLRAA LGHTEGDTEA SRTQVDAFEQ TARSLVAALE DAGWSPAAVS
SVVAGQSAEA VPAAARAEVE RVLTFAATEV VPRLARTPDE IVNTVRALDG RFVPPGPSGS
PTRGLVNVLP TGRNFASVDP KAIPSRLAWE TGRALADSLI ERYLADTGGY PASVGLTVWG
TSAMRTQGDD IAEVLWLLGC RPTWDDASRR VTGFEVVPGA ELGRPRVDVV VRISGFFRDA
FPHVVALLDD AVRAVAVLDE PDDVNALAAH VRADLAALGA VPTVGGRGGT ADPAALRRAT
TRIFGSKPGA YGAGLLPLID SRQWRSDADL AEVYAVWGGY AYGRGLDGAE ARADMETAFR
RIQIAVKNQD TREHDLVDSD DYFQYHGGMV AAVRALTGTS PAAYVGDSAL PDAVRTRTLQ
EETHRVFRAR VVNPRWIAAM RRHGYKGAFE LAATVDYLFG YDATAGVVQD WMYESLAASY
VFDAETREFL RTSNPWALRG MTERLLEAAE RGLWEAPQEA TLAGLQSTYL ELEGDLEAAG
DGESAG