Gene Franean1_1235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1235 
Symbol 
ID5669648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1478003 
End bp1481038 
Gene Length3036 bp 
Protein Length1011 aa 
Translation table11 
GC content71% 
IMG OID641240167 
Productvitamin B12-dependent ribonucleotide reductase 
Protein accessionYP_001505595 
Protein GI158313087 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0209] Ribonucleotide reductase, alpha subunit 
TIGRFAM ID[TIGR02504] ribonucleoside-diphosphate reductase, adenosylcobalamin-dependent 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00432808 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGAGA CCCGCGGCAA GGGGAACAGA TCGGCCGCGA AGGTGGCCGG GCTGAAGATC 
CGGCGGGTTC GCACCACGCC CGGAGTCCAC CCCTACGACG AGGTGGAGTG GGAGTCCCGC
GACGTCGTCA TGACGAACTG GCGGGACGGA TCGGTCAACT TCGAGCAACG TGGGGTGGAG
TTCCCGACCG CCTGGTCGGT CAACGCCTCC AACATCGTCG CCAGCAAGTA CTTCCGCGGC
GCGCTCGGCT CGCCCGATCG GGAGTCGTCG CTGCGCCAGC TCATCGACCG GGTCGTGCTC
CGCTACGGCG CGGCCGGCCG CGAGCACGGC TACTTCGCGT CGGAGAGCGA CGCCGAGATC
TTCGAGCACG AGCTGACCTG GATGCTGCTG CACCAGGTCT TCAGCTTCAA CTCCCCGGTC
TGGTTCAACG TCGGGACGGC CGCGCCGCAG CAGGTGTCGG CGTGCTTCAT CCTCGCGGTG
GACGACACCA TGGAGTCGAT CCTCAACTGG TACCGGGAGG AGGGCCTGAT CTTCAAGGGC
GGCTCCGGTG CCGGTCTCAA CCTCTCCCGA ATCCGCTCCT CGAAGGAGCT CCTCTCCTCC
GGCGGCACCG CGTCCGGCCC GGTCAGTTTC ATGCGCGGCG CCGACGCCTC GGCGGGCACC
ATCAAGTCGG GCGGGGCGAC CCGGCGCGCG GCGAAGATGG TCGTCCTCGA CGTCGACCAT
CCCGACGTCG TCGAGTTCGT CGAAACCAAG GCGCGTGAGG AGGACAAGAT CCGCGCGCTG
CGCGACGCCG GGTTCGACAT GGACCTCGGC GGGCGCGACA TCGCCTCGGT GCAATACCAG
AACGCCAACA ACTCGGTGCG GGTCAACGAC GAGTTCATGC GCGCCGTGGT CGACGGGACG
TCCTTCGACC TGCGGGCGCG TCTCGACAAC CGGGTGATCG AGTCGGTGGA CGCCCGCGGG
CTGTTCCGCA CGATCTCCCA GGCCGCCTGG GAGTGCGCCG ACCCGGGCAT CCAGTACGAC
GGCACGATCA ACGACTGGCA CACCTGCCCG GAGTCCGGCC GGATCTCCGC GTCGAACCCG
TGCTCGGAGT ACGTCCACCT GGACAACTCG AGCTGCAACC TGGCCTCGCT GAACCTGATG
AAGTTCCTGC GGCCGGACCG CACCTTCGAC ACCGAGACGT TCGTCGCGTC CGTCGAGCTG
ATCATCACGG CGATGGACAT CTCGATCTGC TTCGCGGACT TCCCGACCCC GGAGATCACC
AAGGTGACCC GGGCGTACCG GCAGCTCGGT ATCGGGTACG CCAACCTGGG CGCGCTGCTC
ATGGCCAGCG CCCGTGCGTA CGACTCCGAC GGGGGCCGGG CGCTCGCCGC CGCGATCACC
TCGCTGATGA CTGCCACCGC CTACCGCCGC TCGGCGGAGC TCGCCGGCAC CGTCGGGGCC
TACGACGGCT TCGCCCGCAA CGCCGACGCC CACCGGCGGG TCGTGCGCAA GCACTCGGCG
GCGAACGACG CCGTCCGCAC CGTCCACGCG GAGGAGGCCC GCCTGATCAC CGCGGCCTCG
AAGGAGTGGG AGCGCGCGCT GTCGGTGGGG GAGAAGCACG GCTGGCGCAA CGCCCAGGTC
AGCCTGCTGG CCCCGACCGG GACGATCGGC CTGGCGATGG ACTGCGACAC CACCGGGATC
GAGCCGGACC TGGCCCTGGT GAAGATGAAG AAGCTGGTCG GCGGGGCCAG CATGAAGATC
GTGAACCAGA CGGTGCCCGC GGCGCTGCGC GCGCTGGGAT ACGCCGAGGA GACGATCGAG
GCGATCGTCG AGTACATCGC CGAGCACGGC CACGTCGTCG ACGCCCCCGG CCTGCGCACC
GAGCACTACG AGGTGTTCGA CTGCGCCATC GGGGACCGCG CGATCAGCCC GATGGGCCAC
GTCCGGATGA TGGCCGCCGT CCAGCCGTTC CTCTCCGGCG CCATCTCCAA GACGGTGAAC
ATGCCGGCGA CGGCCACCGT CGCCGAGGTC GAGGAGATCT ACCTCGAGGG ATGGCGGCTC
GGCCTCAAGG CCCTCGCGAT CTACCGGGAC AACTGTAAGG TCGGCCAGCC ACTGACGGAT
GTGAAGGGCG CGAAGCGGGA CGCCGCCGCG GCGGCCGAGG TGGCCTCCGA CGTGGCCGCC
CGGGCCGCCG CCGGAGCGGC CGCGGCCGTT GCAGCGGCGC CGTCCGCCGT GACGGCGCAG
CCCGCGGTCG GCCCCGGCGC TCCCGGCCCG CACTGGCCGG CCGGGCCCGG TGAGCACCGG
CCCGTCCGCC GCAGGCTGCC GAAGACGCGT CCGTCGCAGA CCGTCTCGTT CAGCGTCGGC
GGTGCCGAGG GGTACATGAC CGCCGGCTCC TACCCGGACG ACGGCCTCGG CGAGGTCTTC
ATCAAGATGT CCAAGCAGGG CTCCACCCTC GCCGGTGTGA TGGACGCGTT CTCGATCGCG
GTGTCCATCG CGCTGCAGTA CGGCGTGCCG CTGGAGGCCT ACGTCAGCAA GTTCATCAAC
ATGCGGTTCG AGCCGGCCGG TATGACCGAC GACCCGGACG TGCGGATCGC CCAGTCGATC
ATGGACTACC TGTTCCGCCG GCTGGCCCTG GACTACCTGA CCGAGGAGCA GCGCGCGGAG
CTGGGCATCC TGTCCGCCTC GGAGCGGACG CGGCGGTTGT CCGACCAGGC CGGCGGCGGC
CCGGCCGCGG CGGCGGAGCC GGAGATCGAC ATCGAGTCGC TGCGGGCGTC CGCCCCGGCC
GAGGGTGACT CGGACCCGGC GGCCACCGCC GGTGCGGCGC CGGCCCGGTC TGCGCCGGCC
CGGTCTGCGC CGGGCGGGTC CGCGCCGGCC GGGCCGGGCG TGGGCGGCGG GCCGGAGCTG
GTCGCGCAGG AGCTGCGACT GGCCATGCCG GGCGGTTCGC TGACCGCCCA GACGGTGGTC
GACGCGCCGC TCTGCTTCAC CTGCGGTGTG AACATGCGCC CGGCCGGCAG CTGCTACGTC
TGCGAGCAGT GCGGCTCCAC CAGCGGCTGC AGCTGA
 
Protein sequence
MTETRGKGNR SAAKVAGLKI RRVRTTPGVH PYDEVEWESR DVVMTNWRDG SVNFEQRGVE 
FPTAWSVNAS NIVASKYFRG ALGSPDRESS LRQLIDRVVL RYGAAGREHG YFASESDAEI
FEHELTWMLL HQVFSFNSPV WFNVGTAAPQ QVSACFILAV DDTMESILNW YREEGLIFKG
GSGAGLNLSR IRSSKELLSS GGTASGPVSF MRGADASAGT IKSGGATRRA AKMVVLDVDH
PDVVEFVETK AREEDKIRAL RDAGFDMDLG GRDIASVQYQ NANNSVRVND EFMRAVVDGT
SFDLRARLDN RVIESVDARG LFRTISQAAW ECADPGIQYD GTINDWHTCP ESGRISASNP
CSEYVHLDNS SCNLASLNLM KFLRPDRTFD TETFVASVEL IITAMDISIC FADFPTPEIT
KVTRAYRQLG IGYANLGALL MASARAYDSD GGRALAAAIT SLMTATAYRR SAELAGTVGA
YDGFARNADA HRRVVRKHSA ANDAVRTVHA EEARLITAAS KEWERALSVG EKHGWRNAQV
SLLAPTGTIG LAMDCDTTGI EPDLALVKMK KLVGGASMKI VNQTVPAALR ALGYAEETIE
AIVEYIAEHG HVVDAPGLRT EHYEVFDCAI GDRAISPMGH VRMMAAVQPF LSGAISKTVN
MPATATVAEV EEIYLEGWRL GLKALAIYRD NCKVGQPLTD VKGAKRDAAA AAEVASDVAA
RAAAGAAAAV AAAPSAVTAQ PAVGPGAPGP HWPAGPGEHR PVRRRLPKTR PSQTVSFSVG
GAEGYMTAGS YPDDGLGEVF IKMSKQGSTL AGVMDAFSIA VSIALQYGVP LEAYVSKFIN
MRFEPAGMTD DPDVRIAQSI MDYLFRRLAL DYLTEEQRAE LGILSASERT RRLSDQAGGG
PAAAAEPEID IESLRASAPA EGDSDPAATA GAAPARSAPA RSAPGGSAPA GPGVGGGPEL
VAQELRLAMP GGSLTAQTVV DAPLCFTCGV NMRPAGSCYV CEQCGSTSGC S