Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1258 |
Symbol | |
ID | 3906104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 1501855 |
End bp | 1504473 |
Gene Length | 2619 bp |
Protein Length | 872 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 637878592 |
Product | ComEC/Rec2-related protein |
Protein accession | YP_480365 |
Protein GI | 86739965 |
COG category | [R] General function prediction only |
COG ID | [COG0658] Predicted membrane metal-binding protein [COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) |
TIGRFAM ID | [TIGR00360] ComEC/Rec2-related protein [TIGR00361] DNA internalization-related competence protein ComEC/Rec2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0672087 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGACCG GACTCGCGCT GCCGGAACGG CCGTCCGGGG CTTCCCGTCC CATGGACGCC CGGCTGGTGG GCCCGGCGGT CATGGCCTGG GCTGGCGCGG CGGGTGCCGG CACCTGGCCG CCGTCCGTTC CGTTGCTGGG CGCCGCCGGT GCGGTGCTCG TCGCGGCTCC GATGTTCCTG CTGCTCCAGG TCCGTCCGCG GGGTGGTCGG CCGACTCGCC CGGGACCGGG CGGCGCTCGG GCCGAGGCGG CTACGGGCGC GTCTCGTCCG TCGCAGTCGC AGTCGCAGCC GCGCAGCGCG CCGCTCACCC GGGGCGTTGG GCGGTGCTCC GGCGGCACTG TCGTCGTGCT CACGGCCCTC GTCTTCCTGG CCGCCGGGTT TCTCGCGGGT GGACTGGCGG CTCGTCCGCG TGACAGCGGC CCGCTCGCCG ATCTGGTCCG GCATGGCCGT CCGGTCACCG CCGAGGTCGT CGTCACCGAC GATCCCCGCG CCACCTCCTC GTCCACCGCG GTCCGGACGG GAACCGGGCC CGGAGCCGGA GAGGGAAGAG GAGAGGGCGG CGGCGCCGAT CGCGTGACCT TCGTGATCTC CGTCCGGGCG GAACGCCTTA CCGCGTCACC GGCGGATCTG CGGGTCCGGG CCCCGATGGT CCTGCTCGCG CGGGGTGGCG GCTGGGGTGC TCTGCTGCCC AGCCAGCACC TGGTCGTCTC CGGCAGGCTC GCCGAACCCC GGGTCGGTGA CACCGTGGCC GCCGTCCTGT TCGCCGACGG CCCACCGCGG ACCCGCGGCG GACCCGCAAT GATCCAACGA ATCGCCGGGG GGCTGCGGGC CGGGCTGCGT CGAGCCGCTG GGGGACTGCC TGAACCGCGT CGCGGGCTGC TGCCCGGCCT CGTGGTCGGC GACGTCTCCG GGCTCGACGA TGCGGTGCGC GCGGACTTCC GCGCCGCCGG AATGTCCCAT CTAACGGCCG TGTCCGGTAG CAACGTCGCC ATCGTGACGG CCTCGGCGCT CTACCTCATG GGATGGACCG GGCGTGGGTC ACGGTCGCGC GCCGCGGTGG GAGCCCTCGC TCTCGTCGGG TTCGTCGTGC TCGCCCGGCC CTCGGCCAGC GTGCTGCGTG CCGGGGCGAT GGGTCTGGTC GGCCTGCTCG GGCTCGCCGT TGGCCGTCCC AGGGCGGTGC TGCCCTCCCT GGCGGCGAGC GTGATCATCC TGATCCTGGC GGATCCCGCG CTCGCCCTCT CGGTCGGCTT CGCGCTGTCC GTCCTCGCCA CGGCCGGAAT GATCGTTCTC GCGCCGGGCT GGCGGGACGC GCTCGCTCGC CGGCTGCCGG CGCGAATCGC CGAGGTGCTC TCCGTCGCGG CCGCCGCCCA GCTGGCGTGC ACCCCGGTGT TGGCCTGGAC CGGCGGGGGG CTGAGCCTCG TGGCGGTGCC CGCGAACGTC CTGGCCGTGC CCGCCGTCGC ACCGGCGACC GTGCTGGGAG TACTGACGCT GGTCGTGGCG GCGGTGTCGC CGTCAGCAGC CGGGCTGCTC GCGCACCTCG CCGATCTGCC GTGCTGGTGG CTGGTGACGG TCGCCGGCCG GTGCGCCGAT CTTCCTGCCG CGACGCTGCC GTGGCCCACG GGGGTGATGG GAGCGGGCGT GGCCGCCGGT GTCGCCTGGC TCGTGGTGGC GACGCTGCGG CGGCGGGTGC CGCGGCGACT GGTCGCTGCG GCCCTGGTCG GCCTGCTCCT GGCGCGATGC GCCGTCGCCG GGCGGCTGGC GCCGTGGCCG CCGCCCGGCT GGCGCCTGGT CGCCTGCGAC GTCGGTCAGG GCGACGCCCT CGTGCTGTCG GCGGGACCGG GCACCGCGGT GCTGGTCGAC GCCGGCCCCG ATCCGGCCCT GCTGACCCGC TGCCTCTCCG ACCTCGGCGT GCGACGGATA CCGGTGGTCA TTCTCAGCCA CTTCCACGCC GATCACGTCG AGGGGCTGCC GGCCGTGCTC GGCCGGCTTC CGGTGGGGGA GGTACTCGGC AGTCCGCTGG GGGAACCCGT TCTCCAGTGG CACCGGGTCC AACAGTGGAC CCGGCGGGCC GGTGTGCCGC TGCGGACGGC CGTCATCGGC TCCCGGGCAC AGGTGGGGGC GGTCTCTTGG ACGGTCCTGG CTCCGCGCAC CGTGTTGCAC GGCACCGAGA GCGATCCCAA CAACGCGAGC CTGGTGCTCT CCGCACGGGT GGGGGAGGTG ACCATCCTGC TCACCGGGGA TGTGGAGCCG CCGGCCCAGC GGGTGTTGAC CGGCAGCCCG GAGGATCGGA CGGCGCTACG AGCCGACGTG CTCAAGGTGC CCCATCACGG CGCCGCTGAT CAGGACGCCA CCTTCCTCGC GGCGACCGGG GCCCGATTCG CCCTGATCAG CGTCGGCACC GGGAACAGCT ATGGTCATCC CGCGCCGTCC ACCTTGCGGA CGCTGCGCCG GTCCGGCATG GCCGTCGCCC GGACCGACCG CGACGGCGCG GTGGCGGTGG TCGCCACGGC GGCACCCGCC GGGTCCGCCG AGTCCGGGTC GCTGGCGGGA GCCGCCTCCG GGTCGGGGGC GGGAGTCCGC GTCTCGGTGG TGCTCCGCCG GCCCGGGGGC GGATCGTGA
|
Protein sequence | METGLALPER PSGASRPMDA RLVGPAVMAW AGAAGAGTWP PSVPLLGAAG AVLVAAPMFL LLQVRPRGGR PTRPGPGGAR AEAATGASRP SQSQSQPRSA PLTRGVGRCS GGTVVVLTAL VFLAAGFLAG GLAARPRDSG PLADLVRHGR PVTAEVVVTD DPRATSSSTA VRTGTGPGAG EGRGEGGGAD RVTFVISVRA ERLTASPADL RVRAPMVLLA RGGGWGALLP SQHLVVSGRL AEPRVGDTVA AVLFADGPPR TRGGPAMIQR IAGGLRAGLR RAAGGLPEPR RGLLPGLVVG DVSGLDDAVR ADFRAAGMSH LTAVSGSNVA IVTASALYLM GWTGRGSRSR AAVGALALVG FVVLARPSAS VLRAGAMGLV GLLGLAVGRP RAVLPSLAAS VIILILADPA LALSVGFALS VLATAGMIVL APGWRDALAR RLPARIAEVL SVAAAAQLAC TPVLAWTGGG LSLVAVPANV LAVPAVAPAT VLGVLTLVVA AVSPSAAGLL AHLADLPCWW LVTVAGRCAD LPAATLPWPT GVMGAGVAAG VAWLVVATLR RRVPRRLVAA ALVGLLLARC AVAGRLAPWP PPGWRLVACD VGQGDALVLS AGPGTAVLVD AGPDPALLTR CLSDLGVRRI PVVILSHFHA DHVEGLPAVL GRLPVGEVLG SPLGEPVLQW HRVQQWTRRA GVPLRTAVIG SRAQVGAVSW TVLAPRTVLH GTESDPNNAS LVLSARVGEV TILLTGDVEP PAQRVLTGSP EDRTALRADV LKVPHHGAAD QDATFLAATG ARFALISVGT GNSYGHPAPS TLRTLRRSGM AVARTDRDGA VAVVATAAPA GSAESGSLAG AASGSGAGVR VSVVLRRPGG GS
|
| |