Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4192 |
Symbol | |
ID | 5672547 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4987001 |
End bp | 4989853 |
Gene Length | 2853 bp |
Protein Length | 950 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641243065 |
Product | DEAD/DEAH box helicase domain-containing protein |
Protein accession | YP_001508482 |
Protein GI | 158315974 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4581] Superfamily II RNA helicase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.443512 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCCGCG CCGGCGCCGG CCGGCCCTCG GCGAGGGTGC CCGGCAAAGG CGGGGATACC CACGCCGCCG CGGGTCGATG CGCCGCCGCG CCTCGGCCGG TGGCGGTTCA CCCGGCGTGG TTCACCCGGC GTGGCCGACG AGCCTGCGCG CCACGGACCG GACGGCGCGC GCCGCCGCGG GGGCGGCCCG TCGCCCGCGC CCGCCGTCCG TGGGGCACGA TCGAGAGCAC CATGACGCCG CACACCTCCA CGCCCGAGAA CACCTCACCC GACCTCGCCC CCGGCGGCAG CGCGCCCGGC GGCACACCCG GCAGCGCGCC GGGCCCGGCC GCCGGCTCGC CGTCGCTGGC CGACCGCCTG CCGGCCGGTC CCGAACCCGA CGCGGACGCC CTGTTCGACG CGTTCGGCGC CTGGGTGACC GACCAGGGCC TGGAACTGTA TCCGGCGCAG ACCGAGGCGC TGATCGAGGT GGTGAGCGGA GCGCACGTCA TCCTGGCGAC ACCGACCGGC AGCGGGAAGA GCCTGGTGGC CACCGGCGCC CACTTCGCCG CGCTCGGCTG GGGCCGGCGC ACCTTCTACA CCGCGCCCAT CAAGGCCCTG GTGTCGGAGA AGTTCTTCGC GCTGTGCGCG GTGTTCGGCC CGGCGCGGGT CGGCATGATG ACCGGTGACG CCTCGGTCAA CGAGACCGCG CCGATCATCT GCTGCACCGC CGAGATCCTG GCGAACATCG CGCTGCGTGA CAGTGCCACG GCCGACGTCG GCCAGGTCGT CATGGACGAG TTCCACTACT ACGCCGACCC GGACCGGGGC TGGGCCTGGC AGGTGCCGCT CATCGAGCTG CCCGACACGC AGTTCCTGCT CATGTCGGCG ACGCTGGGCG ACGTCACCCG GTTCGAGGAG GACCTGACCC GGCGCACCGG GCGGCCCACG GCGGTGGTGC GCTCGGCGCA GCGCCCCGTC CCGCTGTTCC ACCGGTATGT GACGACCCCG CTGCACGAGA CGATCGAGGA GCTGCTCGAG ACCCGCCAGG CGCCGGTCTA TGTCGTGCAC TTCACCCAGG CCTCCGCGCT GGAACGGGCC CAGGCGCTGA TGAGCGTGAA CGTCTCCACC CGGGCGGAGA AGGACGCCAT CGCGGAGATG ATCGGCGGCT TCCGGTTCAC CGCCGGGTTC GGGAAGACGC TGTCCCGTCT GGTCCGGCAC GGGATCGGCG TCCACCACGC GGGGATGCTG CCGAAGTACC GCCGACTGGT CGAGCTGCTC GCGCAGGCCG GCCTGCTCAA GGTCATCTGC GGCACCGACA CGCTCGGGGT GGGCATCAAC GTCCCGATCC GCACCGTGGT GTTCACCGCG CTGTCGAAGT ACGACGGCAC CCGCACCCGG ATCCTGACCG CGCGCGAGTT CCACCAGATC GCCGGGCGGG CCGGGCGGGC CGGCTACGAC ACCGTCGGCA ACGTCGTCGT GCAGGCCCCC GAGCACGTCG TGGAGAACGA GAAGGCGCTG GCGAAGGCCG GTGACGACCC GAAGAAGCGG CGCAAGGTCG TGCGCCGCAA GCCCCCCGAG GGCTTCGTCT CCTGGGGCGT GCCGACCTTC GAGCGCCTCG TCGCCGCCGA ACCCGAGCCG CTGACGTCGC AGTTCGCCGT CAGCCACTCG ATGCTGCTCA ACGTGGTCAA CCGGCCGGGG GACGCGTTCA GCGCGATGCG CCACCTGCTC ACCGACAACC ACGAGTCGCC GGCCGCGCAG CGCCGGCTCA TCCGCCGGGC GATCGCGATC TACCGGGCGC TGCTCGCCGC CGGGGTGGTG GAGAAGCTCG ACGCCCCGGA CGAGACCGGG CGCACCGTGC GGCTCACCGT CGACCTGCAG CTCGACTTCG CGCTCAACCA GCCGCTGTCC CCGTTCGCGC TGGCCGCGAT CGAGCTGCTC GACCGTGAGT CCCCCGACTA TCCGCTGGAC GTCCTGAGCC TGCTCGAGTC GACGCTGGAC AACCCGCGGC AGGTGCTGTC CGCGCAGCAG TTCCGCGCCC GCGGGGAGGC GGTGGCGGCG ATGAAGGCCG AGGGCGTCGA CTACGACCGG CGGATGGAGC TGCTCGACGA GGTCACCTGG CCCAAGCCGC TCGACGAGCT GCTGCACGCT GCGCTCGCCA CCTACCGGCG CGGCCACCCC TGGGTCGACG ACTACGAGCT GTCGCCCAAG TCCGTCGCCC GGGACATGTA CGAGCGGGCG ATGACCTTCG CCGAGTACGT CGACTTCTAC GGCCTGGCCC GCTCGGAGGG GCTGCTGCTG CGCTACCTGG CCGACGCGTT CAAGGCGCTG CGCCAGACGG TGCCCGAGGA CGCGCGCACC GAGGAACTCA CCGACCTCAC CGAGTGGCTG GGCGAGCTGG TCCGCCAGGT CGACTCCAGC CTGCTCGACG AGTGGGAGGC GCTGCGCGAC CCCGCTCCCG ACAGCGGCGC CGCGCCGGGC GGGCTGGTCG AGCGGACCTC GGCGGTCACC GCGAACACCC GCGCGTTCCG GGTGCTGGTG CGCAACGCGC TGTTCCGCCG GGTCGAGCTG GCCGCGCTGC GCCGCTACGA CCTGCTCGGC GACCTCGACG CGGCCGCGGG CTTCGACGCG CAGGCCTGGC GCGACGCCCT CGAGCCGTAC TTCGACACGC ACGCCGAGAT CGGGACGGGA CCGGACGCCC GTGGCCCCGG GCTGCTCATC GTGGAGACCG CCGCGGACCG CTGGACGCTG CGCCAGATCT TCGACGACCC GGCCGGGGAC CACGACTGGG GAATCAGCGC CGAGGTGGAT CTCGCCGCCT CCGACGAGGC CGGCATCGCC GTCCTGCGGG TGACGGCCGT CGACCAGCTC TGA
|
Protein sequence | MSRAGAGRPS ARVPGKGGDT HAAAGRCAAA PRPVAVHPAW FTRRGRRACA PRTGRRAPPR GRPVARARRP WGTIESTMTP HTSTPENTSP DLAPGGSAPG GTPGSAPGPA AGSPSLADRL PAGPEPDADA LFDAFGAWVT DQGLELYPAQ TEALIEVVSG AHVILATPTG SGKSLVATGA HFAALGWGRR TFYTAPIKAL VSEKFFALCA VFGPARVGMM TGDASVNETA PIICCTAEIL ANIALRDSAT ADVGQVVMDE FHYYADPDRG WAWQVPLIEL PDTQFLLMSA TLGDVTRFEE DLTRRTGRPT AVVRSAQRPV PLFHRYVTTP LHETIEELLE TRQAPVYVVH FTQASALERA QALMSVNVST RAEKDAIAEM IGGFRFTAGF GKTLSRLVRH GIGVHHAGML PKYRRLVELL AQAGLLKVIC GTDTLGVGIN VPIRTVVFTA LSKYDGTRTR ILTAREFHQI AGRAGRAGYD TVGNVVVQAP EHVVENEKAL AKAGDDPKKR RKVVRRKPPE GFVSWGVPTF ERLVAAEPEP LTSQFAVSHS MLLNVVNRPG DAFSAMRHLL TDNHESPAAQ RRLIRRAIAI YRALLAAGVV EKLDAPDETG RTVRLTVDLQ LDFALNQPLS PFALAAIELL DRESPDYPLD VLSLLESTLD NPRQVLSAQQ FRARGEAVAA MKAEGVDYDR RMELLDEVTW PKPLDELLHA ALATYRRGHP WVDDYELSPK SVARDMYERA MTFAEYVDFY GLARSEGLLL RYLADAFKAL RQTVPEDART EELTDLTEWL GELVRQVDSS LLDEWEALRD PAPDSGAAPG GLVERTSAVT ANTRAFRVLV RNALFRRVEL AALRRYDLLG DLDAAAGFDA QAWRDALEPY FDTHAEIGTG PDARGPGLLI VETAADRWTL RQIFDDPAGD HDWGISAEVD LAASDEAGIA VLRVTAVDQL
|
| |