Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6706 |
Symbol | |
ID | 5675019 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8142724 |
End bp | 8144190 |
Gene Length | 1467 bp |
Protein Length | 488 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641245554 |
Product | hypothetical protein |
Protein accession | YP_001510946 |
Protein GI | 158318438 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.58989 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGACC AGCTGTCCGC CGAGGACTAC ATCAAAGCCG TGCAGCGGCC CGATCTCGTG TTCAACCGTC CCGAGCTGAC CGAGGCCCGG TTCGACGTAC ACCCGATGCT GGGCATTCCC GTCCCGGCGT CGGGCAGCAC GGCGGTCGTG TTCCGGGCGA CGGTCGGCGG CACCGCGCAG GCGTTACGGT TCTTCACCCG CGAGGACGCT TCCACCCAGC AGCGCTACAC CGCGCTCAAC GCCTGGTTCA CCGAGCGTGG GCTCACCGGC GACGTCGCCG GCTGCCGGTG GGTGGACGAC GCGATTCTCA TCAACGGTCG GCGCTGGCCG ATGGTCCAGA TGGAATGGGT GGACGGGCAC ACCCTCGACC GGCACGTCGA GGACCTTGTC GAGGCCGACG ACCGCGCGGC GCTGGCGACG CTCGCCGCGT CCTGGCGTGA CCTGCTGCGC CGGACCCAGG CTGCCCGGTT CGCGCACGGC GACCTGCAAC ACGGCAACGT GCTCGTCGAC ACGGCCGGCC AGCTGCGCCT GGTCGACTTC GACAGCTCGT GGATCGCCCC GTTCACCGGC TCCTCGCCGC CGTCCGAGGG CGGACACCGC AACTACCAGC CCCAGAACCG GCCCTGGGGC CCGTGGATGG ACACCTTCCC CGGGCTCGTC ATCTACCTCT CGCTGCTGGC GCTCGCCCGC GAGCCGAAGC AGTGGGAGCA GCTCAACGAC GGCGAGAACC TGCTCTTCCG CGACAGGGAC TACCAGCCTC CGCACCGGAC CGAGGTGTGG TCGTGGGTCG AGCGGCTGCA CGACCCACGG ATCGACCAGC TCTCGGCCCG GCTGCGGGCC TGCTGCGCCC CAGGCTGGAC GGCCACCGGC GACCTGGAAG GGCTGATCTC GGGGCCCGTC CCCTGGTGGA CGCGCACGGC CGCCCACCCG CCAGCCGGCG CGTCGCCCGA GGCCACCACC GCACAGCCCG AGGCCACCAC CGCGCCGCTC CCAGCCGTCA CCGCGCCGGT CAAGGTCACC ACCGCGACGG TCCAGGGCAC CACCGCGCCG CGCCAGACCA CCGGCGCGCC GCCGTGGCCG CCTCCCCAGC CCGCTCCGCC GGCGGGGCAG ACCGTGTGGT GGAAGCAGCC GCCCGGGCAG CGGCCGGAAC CAGGGCGACC GGCACCGGCG CAGCCCGGAC CGGCCCAGTC CGGACCGGCC CAGTCCGGAC CGGCCCAGTG GGCTCCCGTG CGCGCCGCCG GGCGACGGCT GCGCGCAGTG GGCTGGTACC TGTTCGCAGC GGTCGCGGCC GTGGCGACCT GGGGGCTTGG CAGTGCCCTC ATCGCCGCCA CGCTGACCGG CCTGGCCCCC TCCGACGAGG CCGCCATCTC CGCGCTGCTC GCCCTCGTGC CAGCGCTGAT CGTCCTCGTC CTCCTGGTCC GGCGACGGCG AAGGCGACGG CGACAACCAC CGGGGAAGGC GCGATGA
|
Protein sequence | MSDQLSAEDY IKAVQRPDLV FNRPELTEAR FDVHPMLGIP VPASGSTAVV FRATVGGTAQ ALRFFTREDA STQQRYTALN AWFTERGLTG DVAGCRWVDD AILINGRRWP MVQMEWVDGH TLDRHVEDLV EADDRAALAT LAASWRDLLR RTQAARFAHG DLQHGNVLVD TAGQLRLVDF DSSWIAPFTG SSPPSEGGHR NYQPQNRPWG PWMDTFPGLV IYLSLLALAR EPKQWEQLND GENLLFRDRD YQPPHRTEVW SWVERLHDPR IDQLSARLRA CCAPGWTATG DLEGLISGPV PWWTRTAAHP PAGASPEATT AQPEATTAPL PAVTAPVKVT TATVQGTTAP RQTTGAPPWP PPQPAPPAGQ TVWWKQPPGQ RPEPGRPAPA QPGPAQSGPA QSGPAQWAPV RAAGRRLRAV GWYLFAAVAA VATWGLGSAL IAATLTGLAP SDEAAISALL ALVPALIVLV LLVRRRRRRR RQPPGKAR
|
| |