Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4867 |
Symbol | |
ID | 5673207 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5839016 |
End bp | 5841988 |
Gene Length | 2973 bp |
Protein Length | 990 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641243722 |
Product | DEAD/DEAH box helicase domain-containing protein |
Protein accession | YP_001509138 |
Protein GI | 158316630 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4581] Superfamily II RNA helicase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.335008 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCTCCG CGCCCGGAGC CGTCGAGGAG TTCGCGGCCC GGTACCCGTT CGGGCTCGAC CCGTTCCAGT CCGAGGCAGT CGCGGCGCTC GCGCAGGGCG AGGGTGTCCT CGTCGCCGCC CCGACGGGCG CCGGCAAGAC CGTCGTGGGT GAGTTCGCCG CCCACCTGGC GCTCGCGACC GGCACACGTT GCTTCTACAC CACCCCGATC AAGGCCCTGT CCAACCAGAA GTACGCCGAC CTGGTCGCCC GGTACGGCGC GGCCTCGATC GGCCTGCTGA CTGGCGACAC CTCGCGCAAC GGGGACGCTC CGGTGGTCGT GATGACCACC GAGGTCCTGC GCAACATGCT CTACACCGAG GCCGCGGGCA GCGCCCGGCT GGACTCCCTC GGCTACGTCG TGATGGACGA GGTCCACTAC CTCGCCGATC GCCAGCGCGG AGCCGTCTGG GAAGAGGTGA TCATTCACCT GCCCCAGCAC GTGCGGCTCG TCTCGCTGTC GGCGACGGTC AGCAACGCGG AGGAGTTCGC CGAGTGGCTG GTCACCGTCC GCGGGCACAC CCGGGTGATC GTCAGTGAGC ACCGCCCGGT GCCGCTGTTC CAGCACGTCC TCGCGGACCG GACGCTGCAC GACCTGTTCG TCGACCAGCC GTCCGGGCTG GATCCGGGCG TCCCGGCGTT CTCGCGTCGC GGCCCGGGGC CGAACGGGCG GGGCGCTGCC CCAGGGTCCG TCGGGGGCGC CACCCCGGGC AGCCGGCCCG GCGCCCGTGC CGCTGAAGCC CGTGCCGGTG ATATCGCCGG TGCTGGTGCG GCGGCCGGGG GGCGGGCGGT CAACCCGGAC CTGCTCCGGC TCGCCCGGGA GGAGTCCCGC GCGGTGTACG AGCGCGGCCG CGGGCCCCGG TCGTCGCGGC CGGGCCGGCC CGGCGCCGGG AACGGGGCCG GGAACGGGCG GCGCCGCTCC GGGCCGCCGA ACCGGCCCGA CGTCATCGTC CGGCTGGACC GTGCCGGCCT GCTGCCCGCG ATCCTGTTCG TCTTCAGCCG GGTCGGCTGC GACGCCGCGG TGGCGTCCTG CATCCAGGCG GGTCTGCGGC TGACGTCCCC CGACGAGCAG CGCGAGATCC GGGAACACGT CCGGGCCCGG ACAGCGGGAG TGCCCCAGGC CGACCTAGCC GTCCTGGGCT ACTGGCAGTG GCTCGAAGGG CTGGAGCGGG GTATCGCCGC GCACCACGCA GGCATGCTCC CCACCTTCAA GGAGGTGGTG GAGGAGCTGT TCGTGCGTGG GCTCGTCCGC GCGGTGTTCG CCACCGAGAC CCTGGCGCTC GGGATCAACA TGCCCGCGCG CACCGTCGTC CTGGAACGGC TCACCAAGTT CAACGGCCAG ACCCGCGCGG ACATCACGCC CGGCGAGTAC ACGCAGCTCA CCGGGCGGGC CGGGCGGCGC GGGATCGACG TCGAGGGCCA CGCCGTCGTG CTGTGGCAGC CCGGGCTGGA CCCGCTCGCC CTCGCCGGCC TGGCCTCGAC CCGGACCTAT CCGCTGAAGT CGTCGTTCCG GCCGTCCTAC AACATGGCTG TCAACCTCGT CGGCCGCCTC GGCGCCGAGC GCGCGCGCAC CGTCCTGGAG TCGTCGTTCG CCCAGTTCCA GGCGGACAAG GCCGTCGTCG GCATCGCCCG CGCGGTGCGC CGTAACCAGA CGGCGATCGA GGAGCTGACC GCCGCCCTCG AATGCGACCG CGGCTCGGTC ACCGAGTACG ACGGGCTGCG CCGCCAGATC CGGGAGCGCG AGGCCGACCT GTCCCGCGCG GGGACGGTGC GCCGCCAGTC CGAGGTCGCC GCGGCGCTGG CGAAGCTGCG CAGCGGCGAC GTGGTGCGCG TGCCGGTCGG GCGGCGCGGC GGGCTTGTCG TGGTGCTCGA CGCCGGTGTC GACGGGGGCT CGGCGGAAGG GCCGCGCCCC GTCGTCCTCA CCGAGGACCG CCAGGTCCGG CGCCTGTCCA TGATTGATTT TCCGGTGGCC GTCGAGCCGC TGGCCCGGGT CCGTATCCCG AAGTCGTTCA ACCCCCGCTC GCCGCAGGCG CGCCGCGACC TCGCCTCGTC GCTGCGCAAC ATCCGGCTGC CGGAGGAGCC CGGCCGCCGC GAGCGGGCGC GTTCGCTGGC CGCGGACGAC GCCGAGCTGG CCCGGCTGCG CCGCGCGATG CGGGCCCATC CCGTCCACGA CTGCCCGGAG CGGGAGGCGC ACCTGCGCTC GGCGGAACGC ATCGACCGCC TGCGCCGCGA GACAGCCGGG CTCGAGCGCA AGGTCGAGGG GCGGACGAAC ACGGTGGCCC GGACCTTCGA CCGGGTCCGC GACACGCTCG CCGAGCTCGG TTACCTGGCC GTCGGCGGCG ACTCCGTCAC CGACGCGGGC GCGATGCTGG CGCGCATCTA CACCGAGCAG GACCTGCAGG TCGCGGAATG CCTGCGCACG GGAGTGTGGG AGGGGCTGAC CCCACCCGCG CTGGCCGCCG CGGTCTCGAC GCTGGTCTTC GAGCCACGCG GTGACGACAT CGCCGCGCCG ACGATCCCCG GTGGGGGCGC GCTGCGCGAC GCGCTCGCGG ACATGGCGGG TGTCTACACC CGGCTGGGCG CCGCGGAGGA CCATCACCGG CTCGGGTTCC TGCGCCCGCC CGACCTCGGC TTCGTCGCGG TCGCGCACGG CTGGGCCTGC GGGCGCGGGC TGGAGAAGGT CCTCGAGGAC GCCGGCGCGG ACCTGACGGC CGGGGACTTC GTCCGCTGGA TGCGCCAGCT CATCGACCTG CTGGACCAGA TCGCGCAGGT CGCCCCGGTC TACGCGGCGA AGACGTCCGC CGGGCCGGCC TCGGGTGGGC CCACGGCGCA GGAGCGCGCC GGCACCGACG GGATCCTCGG CGTCGGCCGG GCCGCCCGGG CTGCGATAGA CGCGATCCGC CGCGGCGTGG TCGCCTACTC GATGTCCGTG TGA
|
Protein sequence | MSSAPGAVEE FAARYPFGLD PFQSEAVAAL AQGEGVLVAA PTGAGKTVVG EFAAHLALAT GTRCFYTTPI KALSNQKYAD LVARYGAASI GLLTGDTSRN GDAPVVVMTT EVLRNMLYTE AAGSARLDSL GYVVMDEVHY LADRQRGAVW EEVIIHLPQH VRLVSLSATV SNAEEFAEWL VTVRGHTRVI VSEHRPVPLF QHVLADRTLH DLFVDQPSGL DPGVPAFSRR GPGPNGRGAA PGSVGGATPG SRPGARAAEA RAGDIAGAGA AAGGRAVNPD LLRLAREESR AVYERGRGPR SSRPGRPGAG NGAGNGRRRS GPPNRPDVIV RLDRAGLLPA ILFVFSRVGC DAAVASCIQA GLRLTSPDEQ REIREHVRAR TAGVPQADLA VLGYWQWLEG LERGIAAHHA GMLPTFKEVV EELFVRGLVR AVFATETLAL GINMPARTVV LERLTKFNGQ TRADITPGEY TQLTGRAGRR GIDVEGHAVV LWQPGLDPLA LAGLASTRTY PLKSSFRPSY NMAVNLVGRL GAERARTVLE SSFAQFQADK AVVGIARAVR RNQTAIEELT AALECDRGSV TEYDGLRRQI REREADLSRA GTVRRQSEVA AALAKLRSGD VVRVPVGRRG GLVVVLDAGV DGGSAEGPRP VVLTEDRQVR RLSMIDFPVA VEPLARVRIP KSFNPRSPQA RRDLASSLRN IRLPEEPGRR ERARSLAADD AELARLRRAM RAHPVHDCPE REAHLRSAER IDRLRRETAG LERKVEGRTN TVARTFDRVR DTLAELGYLA VGGDSVTDAG AMLARIYTEQ DLQVAECLRT GVWEGLTPPA LAAAVSTLVF EPRGDDIAAP TIPGGGALRD ALADMAGVYT RLGAAEDHHR LGFLRPPDLG FVAVAHGWAC GRGLEKVLED AGADLTAGDF VRWMRQLIDL LDQIAQVAPV YAAKTSAGPA SGGPTAQERA GTDGILGVGR AARAAIDAIR RGVVAYSMSV
|
| |