Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1489 |
Symbol | |
ID | 5669893 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1783019 |
End bp | 1786075 |
Gene Length | 3057 bp |
Protein Length | 1018 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641240409 |
Product | hypothetical protein |
Protein accession | YP_001505835 |
Protein GI | 158313327 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1112] Superfamily I DNA and RNA helicases and helicase subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.132129 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.443999 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTGACAG CGGGCGGCGC GCGGCGGCCT GACGGCGACC GGCCGGCCGG CGCGACGATC GTGACGCGAG CCGACGTCAT GCTGCCGGGC GCGCTGGCGG TCGTCCCGTC GGCCAAGGCG ACTGAGCAGC TTCAGTGGGC GCCGACGGCA CTGGTGACGG AATTCAGCGA ACTGGCGGCC CGAGGGGCCC TACCGGCCCG GATGGACCTC GATCGCGGCC AGCTCATCCT CTACACCAGG CGGTATGTCG CCTTCCTCTA TCCGACGAAG TTCGGGGACG GGTACAGCCT CGGCCACGTC TCCCGCCGGT CGTTCCGTGA CGAGGACAAG CTCAGCCGGG GAGCGCTGAT ACTGCACTGC CGCGCTCCCT GGGGGGCGTT CAACGACGTG CGGGACATAC CTCGTCCGCA GGGCACGTTG AGCCCTCGAT GGGCGTCCCA CTGGGACCAG CTGAGCCTCG AGTGGACCTC CCTGGGCCGG GTTGTGGACG GCGCTTCGGA GCTGCCGCCG CACCACCGTG ACTACCTCGC ACTGCTCGAC CACGTCGTCG AGGCGAGCCG GGACATCGAG CTCGCGAAAC TGAGCGGCAT CCCCGAAGCG CCGTACCGGG AGGTGGAACC GACCCGCGAA GAACGCTACT CAGGCCGCGG CGTCTACTCG TTCTCCCTGG CCCGGCCGCC CCGTGGGCTG CCCGTCGGTG CCCTGGTCGA GGTGGCGGAG TGCGGCGACC CGCCGATGCG GGGCAAGGTC GTCCGCAGCG GCGGCCGGAC CCTCGTCGTC CGCTTCGAGC GGACGGTCGA CTTCCAGCGG ATTCCGCCCC AGGGCCGGCT CAGGGCGTCG GTCAGCGATC GAGTCTTCCA GGCTCAACGG GACGCGATCG AGACCCTGGC CAAGGGCGAG GCAGCCAACC CGGCGTTGCT GTCGGTGCTC GTGGACGGCC GCTACGCGCG GTACCAGCCG GATCTCTCCC GTCAGCCGGT ACGGCAGCTC GACGAACACC AGGCGGACGC GTTCCGGCGG GCGCTGGCGG TGCCGGACGT CCTGCTTGTC CTCGGCCCGC CAGGCACCGG CAAGACGACC ACGATCGTCG AGATCGTCAC CGCGTTGGTG GCCCTCGGCC AGCGGGTCCT GGTCACCTCG CACACGAACC GGGCGGTCGA CAACGTCGTC GAGAACCTGC CACCCCAGAT CAACAGCGTC CGGGTCGGCG CGGAGGACTC GATGACCTCA GCCGCCCGCC GCCGCGGCCT CGACACGCTG ACCGAACAGG TCCGGGCCGG AATCCGGGAC GCGGCTGAAC CGCGCGGTCA GCTGACCGAG TTCCGGCAGA GCCGCGCGGT CTTCGACCAG TGGCTCGGCC ACCTGAGGAA CAGTCTGGCG GCGGTGGACG CCGCGGAGGC CGCGCTGGTG TCTGTCGAGC AGGCGGTCGA CCACGCCGTA CAGGCACTCT CGCCGCAGCT CGCGGCCGAC CGCCGGTCGG CCGCGGACCG GCGGACCCGG CAGCTCGGCT CGGTGATCCG GCTGCGGGAG GAGCTGGCCC GGCGGCGGGA GGTGCTCGCC CGCCGCGCGA AGACCACCCC GTCGGACAGC CCCGAGGAGT TCTGGCGGCT GGTCACCCGG TTCCGGCGCT GGTTGCTGGC CCGGGCCCGG CGGCGGGAGC AGGGCGCACG GGAGGCGCTC GTGACGGCGC AGCAGGATCT GACCGAGGCC GAGGCGGCGG TGGAGGCGTT GTCGGCGCAG GCCGTCGCGC TGATCGCCGG CGACCCGGAG CACCGTCCGC TGCTCGCCGA GCGGGAACGT CAGATTGCTG CGCGTGACGG TGGCCACGGC GGCCTCCGGC GGACGGCCGA GGTCGTGCGG GGTGGCCTGC GCGAGGTGCT GCCGGCCCTG CCGACGTTGG CGGCACTGGA CGACGAGCCC GCTGACAGCG CCGGGTGGGT GCGGTTCCAC GACTGGGCGG TCGAGTCGTT CGCGACCGTC GGCCAGCGCG CGGACCTGCT CGGGCATTGG CACGACCAGC TGGCCAGCGC GGAGACCGAG CTCCAGCGCG AACTCGTCCG GTACGCCGAC GTGGTGGCCG CCACCTGCAT CGGCACCGCC ACCAGCAAGG TGCTCAGCGA CACCGTTTTC GACGTGGCCC TGATCGACGA GGCGGGCCAG ATCTCCACGC CGAATTTGTT GGTGCCGATG GTTCGGGCTC GCCGCGCGGT GCTGGTCGGT GACCAGCACC AGCTCCCGCC CTTCCTGGAC GAGGAGGTCC GCGGCTGGGC CGCCGGTCTG AAGAAGGACG GCCGGTACAC ACCGGAGGCG GTCACGAGGG TCAACGGGCT GCTGCGCGCG AGCGGCTTCG AGCTGCTGGT GCCCGGGGCC GCGGCGACCC GGGCGAACTA CGTCGAGCTG AACCTCCAGC GCCGGATGCC CGAACAGGTC GCGCGATTCG TTTCCGACGC GTTCTACGCG GGCCGGCTGG GAACCCGGCA CGGCGGTGGG CGAGACGACC CGTTGTTCCG GTCCGCGTTC GCGATGATCG ACACCTCCGA CCGTCCCGCC GGCGAACGGG GGGAGACGGC GCTGAAGGCC ACCGAGACAC GGCACGAGCA GGGGTATGTC AACGCGCTCG AAGTGGAGCT GATCGTCTCG CTCCTCGCGC GGGCCGCCGG CTGGTACCGG GACTGGGCCG TGATCGTCCC CTACAAGGCG CAGGCGAGGC GGATCAGCGA GGCACTCGCG GGGGAGTTCG GAGACCCGAC CACGGTCGCC GACAACGTCG GCACCGTCGA CTCGTTCCAG GGTGGCGAAC GGGATCTGAT CATCTACGGT TTCACCCGCA GCAACAGCCG GAACTCCGTG GGCTTCCTCA CCGAGCTGCG GCGGCTCAAC GTGGCGGTGT CCCGGGCCAA GCAGCAGCTC GTGATGGTCG GCGACCTCGC CACGCTGAGA TCGGCCACGG ACACCGAGTT CCGGAAGAAG ATCAGCATGA TGGAGGACCA CCTGCGCCGG GACGGCGACC TGAGGCTCTC CTGCGAAATC GAGAAGCTGC TGCGGGACCC GCGGTGA
|
Protein sequence | MVTAGGARRP DGDRPAGATI VTRADVMLPG ALAVVPSAKA TEQLQWAPTA LVTEFSELAA RGALPARMDL DRGQLILYTR RYVAFLYPTK FGDGYSLGHV SRRSFRDEDK LSRGALILHC RAPWGAFNDV RDIPRPQGTL SPRWASHWDQ LSLEWTSLGR VVDGASELPP HHRDYLALLD HVVEASRDIE LAKLSGIPEA PYREVEPTRE ERYSGRGVYS FSLARPPRGL PVGALVEVAE CGDPPMRGKV VRSGGRTLVV RFERTVDFQR IPPQGRLRAS VSDRVFQAQR DAIETLAKGE AANPALLSVL VDGRYARYQP DLSRQPVRQL DEHQADAFRR ALAVPDVLLV LGPPGTGKTT TIVEIVTALV ALGQRVLVTS HTNRAVDNVV ENLPPQINSV RVGAEDSMTS AARRRGLDTL TEQVRAGIRD AAEPRGQLTE FRQSRAVFDQ WLGHLRNSLA AVDAAEAALV SVEQAVDHAV QALSPQLAAD RRSAADRRTR QLGSVIRLRE ELARRREVLA RRAKTTPSDS PEEFWRLVTR FRRWLLARAR RREQGAREAL VTAQQDLTEA EAAVEALSAQ AVALIAGDPE HRPLLAERER QIAARDGGHG GLRRTAEVVR GGLREVLPAL PTLAALDDEP ADSAGWVRFH DWAVESFATV GQRADLLGHW HDQLASAETE LQRELVRYAD VVAATCIGTA TSKVLSDTVF DVALIDEAGQ ISTPNLLVPM VRARRAVLVG DQHQLPPFLD EEVRGWAAGL KKDGRYTPEA VTRVNGLLRA SGFELLVPGA AATRANYVEL NLQRRMPEQV ARFVSDAFYA GRLGTRHGGG RDDPLFRSAF AMIDTSDRPA GERGETALKA TETRHEQGYV NALEVELIVS LLARAAGWYR DWAVIVPYKA QARRISEALA GEFGDPTTVA DNVGTVDSFQ GGERDLIIYG FTRSNSRNSV GFLTELRRLN VAVSRAKQQL VMVGDLATLR SATDTEFRKK ISMMEDHLRR DGDLRLSCEI EKLLRDPR
|
| |