Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2447 |
Symbol | |
ID | 5670843 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2906394 |
End bp | 2908772 |
Gene Length | 2379 bp |
Protein Length | 792 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641241364 |
Product | putative ATP-dependent DNA helicase |
Protein accession | YP_001506785 |
Protein GI | 158314277 |
COG category | [R] General function prediction only |
COG ID | [COG3973] Superfamily I DNA and RNA helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.250058 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.190451 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTGACG GCGCGGCCGA GGGGCGCGAT GAGGATCCCC GGTCTGTGGA TGTCCGGCCG GCAGATCTCC GGCCAGCGGA CGCCCGGTCG GTGGATCCGT GGTCGGCGGA GCTCGCCGCC GAGCGCCGTC ACCTGGACCG CGCCCGCGCG GCGCTGGTTC GGATGCGGCA CGAGGCCGAG GCGACGGAGA TCGCCGAAGG CGACCCCGTC GCTGACAAGG TGACGAACGC GTCCCTGAAA GCGGCGCGGC GGCGGCGCCT CGAGGCGCTG AGCGACCCCG CGAGCGGGGC ACCGTTGTTC TTCGGACGGA TCGACTACGC CGCGGGCGCC GCCGCCGCGC CGGGTCGGCG CGTCCACCTG GGACGGCGCC ATGTGCGGGA GGCCGCCGGC GATGATCCGC TGGTCGTCGA CTGGCGCACC GACATCGCCC GTCCGTACTA CCGCGCCCAT AGGGGCGACG CGATGGGGCT CGTGGCGCGG CGCCGGTTCG GGTTCGACGG CCCCGAGCTC ACCGCGTTCG AGGAGGAACC ACTCGTCGAA CCGTCGCCCG GCCCGGCAGG CGCGTCACGA GCCGGCGGGC CGGCCGGAGG CGACCTGCTC GCCCGCGAGA TCGCGCGGCC CCGCTCGGGG CCCATGCGTG ACATCGTCGC GACGATCCAG CCCGAGCAGG ACGAGATCGT CCGGGCCGAT CTCGAGGTGA GCGTCTGCGT CCAGGGCGCG CCGGGCACCG GCAAGACGGC GGTCGGCCTG CACCGGGCTG CCTACCTGCT GTTCGCCTAC CGGGAGCGGC TCAGTCGCGC CGGGGTGCTC GTCGTCGGGC CGAACCGGGC CTTCCTGCGT TACATCGGCG CGGTCCTGCC CGCGCTTGGC GAGTTCACCG TCACCCACCG TAGTCTCAGC GCCCTCGTGG ACATCGCGAC GGCGCGCGGA ACCGACCCGG AGCCAGTGGC CGAGATCAAG AGCGACGCCC GGATGGCCGT GGTGATCCGC CGGGCGCTCG GCGCCACCCG GCCCGGCGCC TCCCCGCCGG CTGCCCTGAT CACGAGCTCG CTCGGTCGGT GGACGGTGAG CGGCGACGAG ATCGCCGGCT TGGCCGCCGG CATCCAGGCT GGCGGCCACC GCCATCGCGT GGCTCGGGAC CTGCTCGCCC GGCGGATCGC CGCCGCGGTG GTCCGCCGCG CCGAGGAGCG GGGGCACCTG CCGGCCGACT CCGCGGTGGA GCGCCTCGCC CGCACCCGGT CCGTCAGGGC GGCGGTCGAC GCCGTCTGGC CGCGCACCGA CGCGGCCGGG CTGATCCACC GGCTGTTCAC CGACCCCGCC CTGCTCGCCC ACGCCGCCGA CGGCGTGCTC TCCGCGCCCG AGCAGCGCCT GCTGCTGGCC GACCGGCCGA CGTCGCGGCG GGCCGTCCGC TGGTCGCCCG CCGACCTGTT CTGCCTGGAC GAGGCACTGG ACCTGATCGA CGGCGTGCCG GCGTTCGGGC ACGTCATCGT GGACGAGGCC CAGGACCTCT CGGCCATGCA GTGCCGGGCC GTCGGACGCC GCTGCGCCAC CGGCTCGCTC ACCGTCCTCG GGGATCTGGC CCAGGGCACC ACCGCCTGGG CCGCGCGCTC CTGGGAGCAG GCGCTTGCCC ACTTCGGTAA GCCCGCCGCG CGGCTGGAGG TGCTGACCCG AGGCCACCGG GTACCGGCCG AGATCCTCTC GTTCGCCGAC CGTCTGCTCC CGAGCATCGC GCCGGATCTG CCTCCCGCGT CGTCGACCCG CGCCGTCCCG GGTGCGCTGC GGGTGATCGC CGCCCCGCCG GACGACCTGG CCGGCGAGGC TGCACACCAG GTTCGTGCCG GGCTGGGCCG CGGCGGCTCG GTGGCGGTGA TCGCCCCCGA CGAGGCCGTC GGCGGCCTGA CGGCGTCGCT GCGCGCGGCG GGCCTGCCCG CCACCGCCCT CACCGACGAC GCCGCCCTCA CCGACGACGC CGCCCTCACC GACGCCGCCC TGACGGAGGG CGCTTCGCTC ACCGACAAGC CCGTGCCGGA AGGGCGGGCG TCCCGGCCGG TGACTGTCGT GCCCGCGTCG CTCGCGAAGG GACTCGAGTT CGACCATGTG GTCCTCGTCG ACCCGGCGTC CGTCGCCGGC TCCGGGCCGG GCTCCGTCCA GGGGCTGCGC CGGCTCTACG TGGTGCTGAC CCGGGCGGTG ACCTCCCTGG CAGTGATCCA CGCGGGTGAG CTCCCGAGCG CTCTCGCGGA TCCGGGCGGC CCGTTCCCGC GGGTACCCCG GCCCGGGGGA GCGGGCCGCC CGGATCCCGC GCTGGTCAGG GGACTGGAGG TGGCTCCGGA GTCTGCGCGA CCTGCGAGTC GCCGTTCGAC CGGGGCTGGC TCACGGTGA
|
Protein sequence | MGDGAAEGRD EDPRSVDVRP ADLRPADARS VDPWSAELAA ERRHLDRARA ALVRMRHEAE ATEIAEGDPV ADKVTNASLK AARRRRLEAL SDPASGAPLF FGRIDYAAGA AAAPGRRVHL GRRHVREAAG DDPLVVDWRT DIARPYYRAH RGDAMGLVAR RRFGFDGPEL TAFEEEPLVE PSPGPAGASR AGGPAGGDLL AREIARPRSG PMRDIVATIQ PEQDEIVRAD LEVSVCVQGA PGTGKTAVGL HRAAYLLFAY RERLSRAGVL VVGPNRAFLR YIGAVLPALG EFTVTHRSLS ALVDIATARG TDPEPVAEIK SDARMAVVIR RALGATRPGA SPPAALITSS LGRWTVSGDE IAGLAAGIQA GGHRHRVARD LLARRIAAAV VRRAEERGHL PADSAVERLA RTRSVRAAVD AVWPRTDAAG LIHRLFTDPA LLAHAADGVL SAPEQRLLLA DRPTSRRAVR WSPADLFCLD EALDLIDGVP AFGHVIVDEA QDLSAMQCRA VGRRCATGSL TVLGDLAQGT TAWAARSWEQ ALAHFGKPAA RLEVLTRGHR VPAEILSFAD RLLPSIAPDL PPASSTRAVP GALRVIAAPP DDLAGEAAHQ VRAGLGRGGS VAVIAPDEAV GGLTASLRAA GLPATALTDD AALTDDAALT DAALTEGASL TDKPVPEGRA SRPVTVVPAS LAKGLEFDHV VLVDPASVAG SGPGSVQGLR RLYVVLTRAV TSLAVIHAGE LPSALADPGG PFPRVPRPGG AGRPDPALVR GLEVAPESAR PASRRSTGAG SR
|
| |