Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4741 |
Symbol | |
ID | 5673083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5661882 |
End bp | 5663357 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641243598 |
Product | DEAD/DEAH box helicase domain-containing protein |
Protein accession | YP_001509014 |
Protein GI | 158316506 |
COG category | [J] Translation, ribosomal structure and biogenesis [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0513] Superfamily II DNA and RNA helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTCTTCGT TCACCGATCT TGGCGTGCCC GCGCGCCTGA CCGATGTTCT CACCCGTCTC GCCATCGAGA CCCCCACCCC CATCCAGCGG GCGACGCTGC CGGACGCGCT CGCCGGACGT GACGTCCTCG GCCGCGGCCG CACGGGGTCC GGCAAGACGC TCGCGTTCCT CCTGCCCGTG GTCGCGCGGC TGTCCGGCGG GCCCCGGGCC CAGGCCGGGG CTCCGCGCGC GCTGGTTCTG GCGCCCACCC GCGAGCTCGC CGCGCAGATC GACACGGCGC TGCGACCGCT GGCCGCCGCG GCCGGCCTCA CCTCGTGCAC CGTGTTCGGC GGCGTCGGTC AGGAACCGCA GGTGGCCGCG ATCCGGCGGG GTGTCGACAT CGTCGTGGCC TGCCCCGGCC GGCTGGAGGA TCTCCTCGCG CAGGGGCGCT GCTCGCTGGC CCAGGTGAAG ATCGTCGTGT TGGACGAGGC CGACCACATG GCCGATCTGG GGTTCCTGCC CGCGGTGCGC CGGCTGATCG GCCAGACCGC GGCCGGTGGG CAGCGCATGC TGTTCTCGGC GACCCTGGAC AAGGCCATCG ACACCCTGGT GCGCCAGTAC CTCAGCAGGC CCGCGGTACA CGAGGCCGAC TCCGCCCAGT CCCCGGTGGG CACGATGGCG CACCATCTGC TGCACGTCGA GCGGGACAGC CGGCTTCCCG TGCTCGTCGA CCTGTCCAGC GCCCCGGGCC GCGCGGTCGT GTTCACCCGC ACGAAGCACG GCGCCAAGGC GCTGGCCCGC CAGCTCAACC GCGCCGGCGT GCGCGCGGTC GAGCTGCACG GCAACCTCAG CCAGAACGCC CGTACCCGCA ACCTCGACGA CTTCCACACC GGCCGGGCCT CGGCGCTGGT CGCCACGGAC ATCGCCGCCC GCGGCATCCA TGTCGACAAC GTGGCACTCG TGATCCACGC CGACCCACCG GCCGACCACA AGGCATACCT GCACCGTTCC GGGCGCACCG CGCGTGCCGG CAACAGTGGC ACCGTCATCA CCCTGGCGAC CACGGAGCAG ATCCGTGAGG TCACGCAACT GGCCCGCGCC GCCGGCATCA GGCCGACGAC CACCCGGGTC GCCTCCGCCG CTCACCCGAT ACTCGCCACG CTCGCCCCCG GCGCCCGGTC GCTCAGCAGC CCCGTGGTCG CGGCACAGGC GGAGACCAGG AGCACTCCAG CCCAGCGCCG GCCGGTCGGT GAAGACCGCG GCACCGAGCC GACGCCTCGA ACGGGCAGAC GGCGTGGACG CCGCCCCGCC GGCAACCACG GCGGCCGCGC CGCCGACCAC GGCCGCCGCG CCGGGCGGTC CACGGCGTCC CGATCGCCCG CGGTCACGGC TCGTACCAGC TCACCCGAGC ACGAGGCCGG CCCGTCACCC GCGAAGCGGT CCAGCTCGCG ACCGACCCAC AGCGCCGCGG CGTTCAGCGC ACGCCGCGCC CGCTGA
|
Protein sequence | MSSFTDLGVP ARLTDVLTRL AIETPTPIQR ATLPDALAGR DVLGRGRTGS GKTLAFLLPV VARLSGGPRA QAGAPRALVL APTRELAAQI DTALRPLAAA AGLTSCTVFG GVGQEPQVAA IRRGVDIVVA CPGRLEDLLA QGRCSLAQVK IVVLDEADHM ADLGFLPAVR RLIGQTAAGG QRMLFSATLD KAIDTLVRQY LSRPAVHEAD SAQSPVGTMA HHLLHVERDS RLPVLVDLSS APGRAVVFTR TKHGAKALAR QLNRAGVRAV ELHGNLSQNA RTRNLDDFHT GRASALVATD IAARGIHVDN VALVIHADPP ADHKAYLHRS GRTARAGNSG TVITLATTEQ IREVTQLARA AGIRPTTTRV ASAAHPILAT LAPGARSLSS PVVAAQAETR STPAQRRPVG EDRGTEPTPR TGRRRGRRPA GNHGGRAADH GRRAGRSTAS RSPAVTARTS SPEHEAGPSP AKRSSSRPTH SAAAFSARRA R
|
| |