Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2103 |
Symbol | |
ID | 5670503 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2527264 |
End bp | 2529702 |
Gene Length | 2439 bp |
Protein Length | 812 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641241024 |
Product | DNA internalization-related competence protein ComEC/Rec2 |
Protein accession | YP_001506445 |
Protein GI | 158313937 |
COG category | [R] General function prediction only |
COG ID | [COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) |
TIGRFAM ID | [TIGR00360] ComEC/Rec2-related protein [TIGR00361] DNA internalization-related competence protein ComEC/Rec2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.858083 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGCCG TCGCGATGGA GACCGGGCTT GGCCTGCCGA GCGGCTCCGA TGCGGCGAGT GGCGGTCGGC TGGACCTCCG CCTGGTCGGT CCTGCCGTCG CGGTCTGGGG CGGGGCGGCC GCGGCGGGGT ACTGGGGCTT CGGGTCGGTG CTTCTCGGCG CCGCCGGCAT GGTGCTGTGT GCGTGCCCGG TGGGGGTGCT GATCGCGCTG CATCAGGGTC CCGGCGGCCG GCTCGGCCGG CCGGGAGCGC TGGTGGCGTT CGTCGCCCTG GCGTTCCTCG CGGCCGGGGT TCTCGTGGGC GGGGCGGCCG CGCGGCCCCG GTTCACCGGC CCGCTCGCCG AGATGGCCCG GGCCCACCGG ACCGTCAACG CCGAGGTCGT CCTGAGCGAC GACCCGAAGA TCTCGGCGGC GGCGGCGCCG GCTGCCACTT CGGGCCCGCG GGAGGGGCCG ACGGTCACCG CCCGGGCTCG ACTCGAGGCC GTGACCGGAC CCGGCTGCCG GCTGCGGGCC TCTGTGCCCG TGCTCCTGGT GGGGCGCGGC TCGGATCTCG CGAACTACCT GCCGGGGCAG CGGCTCGGCG TCCGCGCCGG GCTGGCGCCG GCGGGTCCGG GCGACACGAT CGCGACGGTT CTGTTCGTCC GGGCACCGCC ACGGGCACAG GGGCGCCCGG ACGCCGTGCA ACGGGCCGCC GGGTGGCTGC GGGCCGGCCT ACGGGACGCC GCCGACGTGG TGCCCCAGCC CGCCGGCGGG CTGCTGCCGG CACTGGTCGT CGGTGACACC TCGGGCCTTG ATCCCGGGCT GAAGGACGAC TTTCGCACCG CCGGGATGAG CCATCTGACC GCGGTGTCCG GCGCCAATCT GGCGATTACC GCGGGGACGG TGCTGTTCCT GCTCGGGCGG CTCCGGCTCG GGGCCCGCTC CAGGGCGGTC GCCGCGGCCC TGGTCCTCGT CGGGTTCGTG ATCCTGGCCC GGCCGTCCGC CAGCGTGGTC CGGGCCGGAG CCATGGGACT GGTCGGCCTG GTCGCGCTCG CCGCGGGTCG GCCCCGCGCG GTGCTGGCGG CGCTGGCCAC CGCGGTCATC GCCGTCGTGA TGGCGGATCC GGCGTTCGCG CTGTCGGCCG GGTTCGCCCT GTCGGTGCTC GCGACGACCG GGATGATCGT GTGGGGGCCC GGCTGGAGTG ACGTGCTGGA GCGCGGCCGC GCGGCCGGCC GGCTCGGCGA GGTCGTGGCG GTGGCCGCCG CCGCCCAGCT TGCCTGTACG CCGGTGCTGG CCTGGCTCGG TGGGGGCATC AGCATTGTCG CGATCCCGGC CAACGTGGTG GCGGCGCCGG CGGTCGCGCC CGCCACCGTG CTCGGCGTGT CGGCGATGGC CGTCGCCGCC GTCAACGACC CGGTCGCCGC GCTGCTCGCG CGGCTTGCGG GGCTGGCCTG CCACTGGCTC GTGCTGGTCG CGGACGTGGC CGCCGGTGTC CCCGCGGCCA CGATCGGCTG GCCGGCGGGG CTCGCGGGGG CGGCCACCGC GCTTGGCTGC GTCGTACTGG TGGTGGCTCT GGCGCGCCGG CGTCCGACCC GCTGGCTGCT GGTGTCGGCG GTGGTGGGCC TGCTGGCGGC CCGGGTGCTC CTGCTGCCGA GGCTGGCGGG CTGGCCGCCG CCCGGCTGGC GGCTCGTTGC CTGCGATGTG GGCCAGGGCG ACGGCCTGGT CCTGCGCGCC GGGCCGGCCT CGGCGGTCGT GGTCGACGTG GGTCCCGACC CGGCGCTGAT CGCCGCCTGC CTGGATGACC TCGGAGTACG CGAGGTGCCG CTGCTCATGC TCACCCATCT GCACGCGGAT CACGCCGCCG GCCTCAGCGG GGTGGTGGGC CGGCTGCCGG TGGGGGAACT GGTGGTGAGC CCGCTGCCCG AGCCGGCCGA CCAGTGGGAC GCGGTCGAGC GCGCGGCGCG GGCGGCGGGC GTGCCGGTGC GGGCCGTCAC CGCCGGCGCC GCGGGGGAGA CCGGAGCGGT CCGCTGGCGG GTGATCGGCC CGGAACGGGT CCTGCGGGGC ACGGCCAGTG ATCCGAACAA CGCCAGTCTC GTGGTCCTGG CGGCGGTCGG CGGCGTGACG ATACTGCTCA CCGGGGATGC CGAGCCGCCG GAGCAGCGGC AGGTGGCGCG GCGTGGTCTC GGGCCCGTCG ACGTGCTCAA GGTGGCTCAC CACGGCTCCG AGGACCAGCT GCCGGAGTTC CTCACCCGGA CCGGCGCCGA GGTGGCGCTG ATCAGCGTCG GCGTCGACAA CACCTACGGG CATCCCGCGC TGAGCACGCT GGCCGGCCTG CGCGCGGCCG GGATGGCGGT GGCCCGCACC GACCTGCACG GCACGGTGGC CGTCGTCGAG ACGGCCGGCG GTGGCGTCCG GGCGGTGGCT CGCCGGCCCG GCCCGCGAGG GGGCGGGGCC GGCTCGTGA
|
Protein sequence | MVAVAMETGL GLPSGSDAAS GGRLDLRLVG PAVAVWGGAA AAGYWGFGSV LLGAAGMVLC ACPVGVLIAL HQGPGGRLGR PGALVAFVAL AFLAAGVLVG GAAARPRFTG PLAEMARAHR TVNAEVVLSD DPKISAAAAP AATSGPREGP TVTARARLEA VTGPGCRLRA SVPVLLVGRG SDLANYLPGQ RLGVRAGLAP AGPGDTIATV LFVRAPPRAQ GRPDAVQRAA GWLRAGLRDA ADVVPQPAGG LLPALVVGDT SGLDPGLKDD FRTAGMSHLT AVSGANLAIT AGTVLFLLGR LRLGARSRAV AAALVLVGFV ILARPSASVV RAGAMGLVGL VALAAGRPRA VLAALATAVI AVVMADPAFA LSAGFALSVL ATTGMIVWGP GWSDVLERGR AAGRLGEVVA VAAAAQLACT PVLAWLGGGI SIVAIPANVV AAPAVAPATV LGVSAMAVAA VNDPVAALLA RLAGLACHWL VLVADVAAGV PAATIGWPAG LAGAATALGC VVLVVALARR RPTRWLLVSA VVGLLAARVL LLPRLAGWPP PGWRLVACDV GQGDGLVLRA GPASAVVVDV GPDPALIAAC LDDLGVREVP LLMLTHLHAD HAAGLSGVVG RLPVGELVVS PLPEPADQWD AVERAARAAG VPVRAVTAGA AGETGAVRWR VIGPERVLRG TASDPNNASL VVLAAVGGVT ILLTGDAEPP EQRQVARRGL GPVDVLKVAH HGSEDQLPEF LTRTGAEVAL ISVGVDNTYG HPALSTLAGL RAAGMAVART DLHGTVAVVE TAGGGVRAVA RRPGPRGGGA GS
|
| |