Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2957 |
Symbol | |
ID | 3903772 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 3505324 |
End bp | 3507585 |
Gene Length | 2262 bp |
Protein Length | 753 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637880278 |
Product | DEAD/DEAH box helicase-like |
Protein accession | YP_482044 |
Protein GI | 86741644 |
COG category | [R] General function prediction only |
COG ID | [COG1201] Lhr-like helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0630659 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATCTCCC GGACGTCCGG GGCCTCGGGG GCTTCCGGGG CCCCGGGGGC TTCCGGGGCT TCCGGGGCCT CGGGCTTCGC CGCCCTCGAC CCGGCCGTCC AGTATCACGT CGTCAACAGT CTGCGGTGGC CTGCCCTGCG CCCGTTGCAG GAGGCGGCGA TACGTCCGGT CCTCGACGGG CATGACGTGC TGCTGCTGGC ACCGACCGCC GGTGGGAAAA CCGAGGCGGT GGCGCTGCCC CTGCTGTCCC GGATGGCCGG GGAGAAATGG CGCGGGCTGT CGGTGCTGTA TGTGTGTCCG CTGCGGGCGT TGCTGAACAA CCTCGAACCG CGGCTGGCCG CGATGTGCGA ATGGCTCGGC CGGCGGGCCG CCGTCTGGCA TGGAGATGTC GCCGACTCGG TGCGCGGGCG GCTGGCCGTG GATCCGCCGG ATCTTCTGCT GACCACCCCG GAGTCGATCG AGGCGATGCT CGTCTCCGCG CGCGTCGATC ATCGATGGCT CTTCGCCGGC CTGCGGACGG TTGTCGTCGA CGAGGTCCAC GCCTTCGCCG GTGACGACCG CGGTTGGCAT CTGCTGGGGG TGTTGGCCCG GTTACGCGGG CTCGCCAACC GGGCGGTACA GCGGATTGGT TTGTCCGCAA CTGTCGGGAA CCCGGACGTA CTGCTCGACT GGCTCGCGGA CGGATCCGCC CGGCCGCGCG CGGTCGTCCG GCCGAGTGAC GTGACGGCCA CCACGGACGT GACGGCCACC ACGGACGCGA CAGCCACCAC GGACGCGACA GCCACCACGG ACGCGACAGC CACCACGGAG GTCGGGATCG ACTACGTGGG CTCCCTGCGC AACGCCGCCC TCGTGGTCGC CCGCCTTCAT CAGGGGATGA AGAGGCTGGT CTTCTGCGAT TCCCGCAGCC AGGTCGAGGA GTTGACGGTT GCGCTGCGTG ACCTCGGGGT ACGGACGTAT GTATCCCACT CGTCCCTGTC ACGGGATGAC CGGCGGCAGG CCGAGGCCGC CTTCGCGACA TCCACCAACT GCGTCATCGT CGCCACTTCC ACCTTGGAGC TGGGGATCGA CGTCGGTGAC CTGGACCGCG TCATCCAGAT CGGTGCCCCC GCCACGGTGA TCTCCTTCCT GCAACGCCTC GGGCGGACCG GGCGGCGCCC CGGGACCCGA CGCAACACGC TGTTCCTGGC AACGTCGCAG AACTCTCTGT GGCTCGCGGC GGCTATCACG CTGCTGTGGC AACGCGGATA CGTCGAGCCG GTCGTCCCGC CCGTGCTTCC CCGTCACATC GTCGCCCAGC AGATCCTCGG ACTGGCCCTG CAGGAAGGCC GCCTCGTCGA CCGCGACGTC TGGCAGTGGC TCGGCGGGCT TGCCCGGACT CCGGGCGCGG TCGACGTACT GGCGCATCTG GCCCGCTCCG GGTTTGTCGT GCACACCGAC GGGCTGGTAT CGATCGGCCC GCGGGCGGAG AAGGAATACG GGCGACGCTA CTTCCTCGAC CTCACGTCAA CCTTCGCCGG CGAATCGATG GTGCAGGTCT GGTGGGGACG TAGCCTGCTC GGCCAGCTGC CACCGATCGC GCTCGCGGCT CGACCGGAGC ACGGACCGCG GGTGGTTCTT CTCGGCGGTC GTGCCTGGAA GGTGAATCAT GTCGACTGGC GGAGAAGACG AGTCCAGGTC GAACCGAGCC GCTACCCAGG CCGGTCCCGG TGGAACGGCG GGGCCCGGGT CATGTCCTAT CCGCTCGCCC AGGCTCACCT CGACGTCCTC GCCGGGCAAA CGGCCGGCAT CGAGATATCC CGGCGGGCCG CCGACGCGTT GGACAAGCTC CGCGCGCAGC ATTCCTTTGT CAGTGTCAGT GTCAGTGACG AGCCGGGCGA ACGCACCTAC CTCACCCGCG ACCCCGAGAA GCGGCCGATC TGGTGGACAT TTGCCGGGTT CGCGGCCAAT TCGGCCCTCG CCTCGGGGCT CGGCGAGCTG GTTGACGCTG ACGCGCCGGT CGGCGATCTC AGGCTGCGGC TTCGCCCGCA CGTCAACTCC GCCGCGCTAC GGACAATGCT CGATGTTCGA CGTGACGACC TCGTTGACGC TCTTCCGGCC ATCGATGTCG AGGCTGCCGA TGGGCTCAAG TTCTCCGCGG CGATTCCCCT CGAACTCGCG ATCGAGACGC TAGCCGCACG CCTCACCGAT CCATCCGCCG TCCACGCGAC GCTGAAGGCA ATCATCCGGG AGGAGACTGC GCCTCCGCCT GGGGGATGGT AG
|
Protein sequence | MISRTSGASG ASGAPGASGA SGASGFAALD PAVQYHVVNS LRWPALRPLQ EAAIRPVLDG HDVLLLAPTA GGKTEAVALP LLSRMAGEKW RGLSVLYVCP LRALLNNLEP RLAAMCEWLG RRAAVWHGDV ADSVRGRLAV DPPDLLLTTP ESIEAMLVSA RVDHRWLFAG LRTVVVDEVH AFAGDDRGWH LLGVLARLRG LANRAVQRIG LSATVGNPDV LLDWLADGSA RPRAVVRPSD VTATTDVTAT TDATATTDAT ATTDATATTE VGIDYVGSLR NAALVVARLH QGMKRLVFCD SRSQVEELTV ALRDLGVRTY VSHSSLSRDD RRQAEAAFAT STNCVIVATS TLELGIDVGD LDRVIQIGAP ATVISFLQRL GRTGRRPGTR RNTLFLATSQ NSLWLAAAIT LLWQRGYVEP VVPPVLPRHI VAQQILGLAL QEGRLVDRDV WQWLGGLART PGAVDVLAHL ARSGFVVHTD GLVSIGPRAE KEYGRRYFLD LTSTFAGESM VQVWWGRSLL GQLPPIALAA RPEHGPRVVL LGGRAWKVNH VDWRRRRVQV EPSRYPGRSR WNGGARVMSY PLAQAHLDVL AGQTAGIEIS RRAADALDKL RAQHSFVSVS VSDEPGERTY LTRDPEKRPI WWTFAGFAAN SALASGLGEL VDADAPVGDL RLRLRPHVNS AALRTMLDVR RDDLVDALPA IDVEAADGLK FSAAIPLELA IETLAARLTD PSAVHATLKA IIREETAPPP GGW
|
| |