Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1251 |
Symbol | |
ID | 5669664 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1506549 |
End bp | 1508816 |
Gene Length | 2268 bp |
Protein Length | 755 aa |
Translation table | 11 |
GC content | 79% |
IMG OID | 641240183 |
Product | putative DNA-binding protein |
Protein accession | YP_001505611 |
Protein GI | 158313103 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00288406 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0497189 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGAGC TGCGGGCGGT CGCACTCAGC GAGGACGGCG GTTATCTCGT GCTCGCCGAC GCCGCCGGGC GCGCGGACGC GGAACAGTTC CGCGTCGCGG TGGACGATCG GCTGCGCGCG GCGCTGCGCG GAGCCCGACG TAGTGAAGTA CGCGCGGAAA GCGCGCTGAC CCCCCGTGAG ATCCAGGCCC GACTTCGGGC CGGTGAGACC GCCGCCGACG TCGCGCGGGC CGCGGGCATC CCGGTGGAGC GGGTCGAGCG CTACGAGGGG CCGGTGCTGG CCGAACGCGC CCGGGTGGTC CAGGAGGCGC GCGCCGCGCT GCTGCCCAAG GATCCGGGCG GGGTGCCCGG CCGTCCCCTC GGCGAGGTGG TCGACGCACG GCTGCTCAGC GTGCAGGACG ACCCGGACAC CGCCCAGTGG GACGCCTGGC GCCGGGTGGA CGGGATCTGG CTGGTCCAGC TCACCTCGGA CAGCAGGTGC GCCCGGTGGA CGTGGGACCC GGTGGTGCGC CGGGTGCGCC CGCACGACGA CGCGGCCCGG GCGCTGGTCG CCCCCGAATC AGCGGAGCCG CCCGCCCCGC AGCCGGCGCA GCCGCCCGTG CGCGCCGCCG GCCCGGCGCT GACCCTGGTG CACGACCAGG GCGCGGCGGC GGCCTACCCG ACGCAGTCCG CGGCCGGTGG ACCCGGCCAG CAGCTTCCGG GAACGCAGCC CTCGACGGCT CCCCCCTCGG CCTTCCCCGC TCCGGCGGCC GTCAACGGGA CGGGCTACCT CCCCAGCGCG GCTGCGCCGG GGTACGGGCC GTCGCCGGCC CAGGGGCCCG CGGGCGAGTC TCCGGACCGC CGCCCGGCCG AGCGGGCCGG CGACGACTAC GCCACGGACC ACACCACTCC TGACCACGGG GCGCCCGACT ACGCGACCCC CGGCTACGAG AACACCAGTT ACGAAGCCTC CGGCTATGGA GGCTCCGCCC AGGAAGCCGC CGGCCAGGAC GCCGCGACGC GGGCGAACAC GGGGCAGGGA GCCGGCGGCC ACGGCACCTC GCCGGACCAG GCTGGTCCGG ACGAGTACCA GGGCCCGGTG GCCGCCCCCG CCGCGAGCCG TACGGCCGGC GCCGCGCGGC GCAGCATGCC CGGTGCGTGG CCGGCGGCCC AGGGCGGGGC CCCAGGCCTG TCCGGGCGGC ACAGCGGCGC GGGAGGCCGG CGCACCACGC CCCCGAGCCG GTTCGGCTCC CGGGAGCGGG CCGTTCCGCC GCCGACACCG GCGGCCGAGC CGAGGTGGCT GCCGAACCCG GATACCGAGT ACTCCGAGGT GACCGCCACC GCCGTCGACG CGGCGGCGGC CAACGAGGCG GAGTTCGGCC ACGCCGGCGC GGCTGACATG GAGTCCACCA AGACAGACCT GGTCGCGGTG GCCGAGGCGG CCCTCGCCCC GGCGCCGGAA TCGGCCGAGC CCGGCATCCC GAGCGAGGAC GCCCAGGACA CCCCGGGCGG CTCGGCCGAC CCGGACGAGA CCGTTCAGGC CGCGCACGCC GCCGGTCTGG ACGAGGACGA CGAAGACGAC GATGAGGACA CCGAGGACGG CGGCGACCGG TCGGTGAGCG AGGCGGCGCC CGGCGGGGCC GGGCGGCCCG CGGCTCCCGC GCCGACAGGG CAGCCGGTGC TCCGACCGGC CGCGATCGTG CCGCCTGCGG CCGCGGAGCC GGCGCGGGAA CCCACGACGC CGGTGGAGCG GGTGCCGCGC CGTCCGGCGG CCGCCGCCGC CCGCCGGCCA GCTGTCGCCC GCACCGCTGG CGGTGCCCGT TCACTCGGGG CGCTCGCGGC AGCCGCTGAG GACCTGGACG GCGGCACTGC GGCCTCGGCG GCTCCGGCTG GTGGCGGCCG AGGCGCCGCT GCCCCGGGGC GCGGCGGAGC ACCGCGCCGG CCGGCAGCGG GCCGGGCCGT GGACGGCCCG GCGGGTCGCC CCGCGCGCCC CGGCTCCACC GCGGCGGAGC ACCCCGCCGA GCCGTCCACC GCGGCCGACA GCGAGTTCGA GACCACGGCG GAGACCACCA CTGCCGCCAC GGTCGGCGAC ACCGCCTCCG AAGCCGCCAC AGAGGCCGCC ACAGAGGCGG CCCAGCAGGC CGCACGGGCC GCCCAGCCGG CCGCCGCGGG CAACCGCGGA CGGCAGCCGG CGGCAGGTCG TAGCGGCGAA CGTCCCGCAG GCGGTCGACG CGGACGCAAG TCGGTGCCAG CATGGGACGA CATCGTGTTC GGCGCCCGCC GGCCCTAG
|
Protein sequence | MRELRAVALS EDGGYLVLAD AAGRADAEQF RVAVDDRLRA ALRGARRSEV RAESALTPRE IQARLRAGET AADVARAAGI PVERVERYEG PVLAERARVV QEARAALLPK DPGGVPGRPL GEVVDARLLS VQDDPDTAQW DAWRRVDGIW LVQLTSDSRC ARWTWDPVVR RVRPHDDAAR ALVAPESAEP PAPQPAQPPV RAAGPALTLV HDQGAAAAYP TQSAAGGPGQ QLPGTQPSTA PPSAFPAPAA VNGTGYLPSA AAPGYGPSPA QGPAGESPDR RPAERAGDDY ATDHTTPDHG APDYATPGYE NTSYEASGYG GSAQEAAGQD AATRANTGQG AGGHGTSPDQ AGPDEYQGPV AAPAASRTAG AARRSMPGAW PAAQGGAPGL SGRHSGAGGR RTTPPSRFGS RERAVPPPTP AAEPRWLPNP DTEYSEVTAT AVDAAAANEA EFGHAGAADM ESTKTDLVAV AEAALAPAPE SAEPGIPSED AQDTPGGSAD PDETVQAAHA AGLDEDDEDD DEDTEDGGDR SVSEAAPGGA GRPAAPAPTG QPVLRPAAIV PPAAAEPARE PTTPVERVPR RPAAAAARRP AVARTAGGAR SLGALAAAAE DLDGGTAASA APAGGGRGAA APGRGGAPRR PAAGRAVDGP AGRPARPGST AAEHPAEPST AADSEFETTA ETTTAATVGD TASEAATEAA TEAAQQAARA AQPAAAGNRG RQPAAGRSGE RPAGGRRGRK SVPAWDDIVF GARRP
|
| |