Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2292 |
Symbol | |
ID | 5670691 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2738443 |
End bp | 2741040 |
Gene Length | 2598 bp |
Protein Length | 865 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641241212 |
Product | hypothetical protein |
Protein accession | YP_001506633 |
Protein GI | 158314125 |
COG category | [S] Function unknown |
COG ID | [COG2170] Uncharacterized conserved protein [COG2308] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR02050] uncharacterized enzyme |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.382745 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.649548 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGACG CGCGGATCGC GGCCGTCGGT GTTGAAGAAG AATTCCACAT TCTCGATCTC GCTACCAGGC AGCTGGTACC CCGCGCGGAG GAGATCCTGC GCGGCCTCCC CGACGACCAG TTCTCCCCGG AGCTGCTGCG CTCCGTGGTC GAAACCAACA GCCGGCCCTG TACCGACCTC TCCGACCTCC GGGCGGACCT GCTCGACCTG CGCCGCCGCC TCGCCGCGGT CGCCGAGCCG CTGGGCCTCG GCCCCGCCGC GGCGGGAACG GTGCCGATCG TCGACATGTC CGTCCTCGAC GTCTCGCGGG ACGCGCGCTA CATCCAGATG ACCGAGGAGT ACCAGCTCCT CGCGCGCGAG CAGCTCATCT GCGGCGCCCA GGTCCATGTC GACGTCGCCG ACCGCGACCT GGCGATGGCC GTCACCGCCT GGGTGGCCCC CTGGCTGCCG ATGCTGCTGG CGCTGTCGGC GAGCTCGCCG TTCTGGCGCG GTGCCGACAG CGGCTACGCG AGCATGCGGA CGATGGTCTG GCAGCGCTGG CCGACCGCCG GCGTGGCCGG GCCGTTCCGC ACCGCCGCCG AGTACGACCA GCTGGTGGCG GACCTGGTGA AGTCGGGCGT CATCAGCGAC CCGGGCATGG TCTACTTCGA CGTCCGGCCG TCCGCGCACC TGCCCACCGT CGAGCTGCGC ATCTGCGACG CCTGCCCGGA CGTCGACAAC GTCATCCTGA TCGCCGGCCT GTTCCGGGCG CTGGTGTGCC GGGCGATCGA GGCGGTCGAG GCGGGCGCAC CGGCTCCGCC GCCGCGCGCC GAGCTGCTGC GCGCGGCGAC CTGGCGCGCG GCCCGCTCCG GCATCGAGAG CGACCTCGTG GACCTGTCGG GCACCGGCTG CATGCCGGCC GAGGAGCTGC TGCGCCGGCT GCTCACCGAG GTCCGTCCCG ACCTGGAGAA GGTCGGGGAC TGGGACCTGG TGCGTGACCT GGCCGAGGCG GCGGTGGGGC GGGGGAGCGC CGCGTCCCGC CAGCGCCGGG CTTTCGCCCG CCGCGGCCTG CTGACCGACG TCGCGGACCT CGTGCTGGCC GAGACGCGCG AGTCGCCGGC GTCGGCGGTC CACCCGCCGG GCACCGTCCC GTCGGTGGGC GCGGTGGCAG CGCTGCCGCC GCTGCTGGAC CGCTACCAGC CGTCCGGGTT CGACGAGGTC ATCGCCGAGG GCGGTGGTGT CCGGCCGCAC TACCGCGGCG TGGTGCGCAC CCTGGACCGC CTCGGCCCGG CGGTGCTCGC CGAGCGCGGC GAGGCCATGC AGGCCGAGCA GGTCGAGCGC GGCGTGGTGT TCCGGGTGAA CGGCGAGACC GAGCAGCGCC CGTTCCCGTT CGACCTGGTT CCGCGGGTCG TCACCGCGGG CGACTGGGAG CGCCTGCAGT CCGGGCTCAC CCAGCGCGTG CGGGCGCTGG AGGCGTTCCT GCGCGACACC TACTCCGAGC GCGCCGCGGT CGCCGACGGG GTGATCCCGG CCTGGCTGGT CAACGACTCG CCGGGGCTGC GCCACTCCGG CCGGGTCCTC TCCGGCGACG GGTCGCCGCG CTCCGGCGGT GTGCGGGTGA CGGTGGCGGG CATCGACCTC GTCCGCGGCG CCGACGGCAA GTGGCTCGTG CTGGAGGACA ACCTGCGGGT GCCGTCCGGG ATCGCCTACG CGATCGAGGG CCGGCGGCTG ACCCGGTCGG CGCTGCCCGA GCTCAACCCG CCCGGCGCCA TCCTGGGGGT GGACGCGGTG CCGGCGCTGC TACACGAGGC GCTGGTCGCG GCCGCCCCGC CGGCGGTGCG CGGCGAGCCC GCCGTCGCCG TCCTCACCGT CGGCGAGGAG GACTCCGCCT ACTACGAGCA CACCTTCCTC GCCGAGGAGA TGGGGGTGCC GCTGCTGACC CCGGCCGACA TCCTGGTCGA CGACGACGTG CTCTACGCCG TCGACGGCGG GCGGCGGCGG CGGATCGACG TCCTCTACCG GAGGGTGGAC GAGGACGAGC TGACGGGGCT GCCGGGCGCT GACGGGCTGC CACTGGGCCC CGGGCTGCTG CGCGCGGTGC GGGCTGGCTC GCTCGCGCTG GCGAACGCGC TGGGCAACGG GGTCGCCGAC GACAAGGTCG TCTACGCCTA CGTCTCCCGG ATGATCACCT ACTACCTGGG TGAGCAGCCG CTGCTCGACG ACGTCCCGAC CTATGTGTGC GGTGATCAGG AGCAGTGCTC GCACGTCCTG GAGCACCTCG AGCAGCTCGT CGTGAAGCCG GTGGACGGCT ACGGGGGCTC CGGGGTGGTG ATCGGCCCGC AGGCCGAGCC GTTCGAGCTG ACCGAGGTCC GCGAGCGGAT CCTGGCCGAC CCGCGTGGCT GGATCGGCCA GGAGATGGTC GCCCTGTCGA CCCATCCCAC CTGGGTCGAC GGTGAGCTCC AGCCGTGCGC GGTCGATCTG CGGGCCTTCG TCTACGCCGG CCGCGAGACG GCGGTGGTCG CGCCGGCCGC CCTGAGCCGG GTCGCGCCGC CGGGCAGCCT GATCGTCAAC TCGTCCCGGG GCGGCGGGTC GAAGGACACC TGGCTGCTGC GTCCCTGA
|
Protein sequence | MSDARIAAVG VEEEFHILDL ATRQLVPRAE EILRGLPDDQ FSPELLRSVV ETNSRPCTDL SDLRADLLDL RRRLAAVAEP LGLGPAAAGT VPIVDMSVLD VSRDARYIQM TEEYQLLARE QLICGAQVHV DVADRDLAMA VTAWVAPWLP MLLALSASSP FWRGADSGYA SMRTMVWQRW PTAGVAGPFR TAAEYDQLVA DLVKSGVISD PGMVYFDVRP SAHLPTVELR ICDACPDVDN VILIAGLFRA LVCRAIEAVE AGAPAPPPRA ELLRAATWRA ARSGIESDLV DLSGTGCMPA EELLRRLLTE VRPDLEKVGD WDLVRDLAEA AVGRGSAASR QRRAFARRGL LTDVADLVLA ETRESPASAV HPPGTVPSVG AVAALPPLLD RYQPSGFDEV IAEGGGVRPH YRGVVRTLDR LGPAVLAERG EAMQAEQVER GVVFRVNGET EQRPFPFDLV PRVVTAGDWE RLQSGLTQRV RALEAFLRDT YSERAAVADG VIPAWLVNDS PGLRHSGRVL SGDGSPRSGG VRVTVAGIDL VRGADGKWLV LEDNLRVPSG IAYAIEGRRL TRSALPELNP PGAILGVDAV PALLHEALVA AAPPAVRGEP AVAVLTVGEE DSAYYEHTFL AEEMGVPLLT PADILVDDDV LYAVDGGRRR RIDVLYRRVD EDELTGLPGA DGLPLGPGLL RAVRAGSLAL ANALGNGVAD DKVVYAYVSR MITYYLGEQP LLDDVPTYVC GDQEQCSHVL EHLEQLVVKP VDGYGGSGVV IGPQAEPFEL TEVRERILAD PRGWIGQEMV ALSTHPTWVD GELQPCAVDL RAFVYAGRET AVVAPAALSR VAPPGSLIVN SSRGGGSKDT WLLRP
|
| |