Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0420 |
Symbol | |
ID | 5668843 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 494865 |
End bp | 495920 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641239352 |
Product | RNA-directed DNA polymerase |
Protein accession | YP_001504791 |
Protein GI | 158312283 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3344] Retron-type reverse transcriptase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGACAG GGCGAAGGGG AACAGGTGGC CGGATACTCA ACGGACGGAA GGCATGCGTA ATGCAGAGCG CCGAAATGGT CCTTGGTGTC CTCCGTGAAC GTGGAAGGAG AGGACTGCCG CTGGAGCGGG TGTATCGACA GTTGTTCAAC GCGGCGCTGT ACCTGGTGGC CTACGGGCGT CTGTACTCCA ACAAGGGTGC GATGACGCCC GGGGAGACCG TGGACGGCAT GTCGCTGGCT ACCATCGACC GCATCATCGA TGCGATGCGC CACGAGCGCT ACCGATGGAA ACCGGTGAAG CGGGTGCACA TCCCGAAGAA GAACGGGAAG AAACGCCCGC TGGGCCTGCC GACCTGGTCG GACAAGCTGG TCGCCGAGGT GGTGCGCCTG CTGTTGGAGG CGTACTACGA GCCGACCTTC TCCGACCACT CCCACGGGTT CCGCCCAGGC AGAGCCTGCC ACACCGCACT CGGTGAGGTG GTCGATGTCT GGAAGGGGAC GCACTGGTTC ATTGAGGGCG ACATCGCCCG CTGTTTCGAG GAGCTCGACC ATCAGGTCAT GCTCGACACG GTGGGCGAGA GAATCCACGA CAACCGGTTC CTGGGGCTCC TGAAGGCCAT GCTGCGCGCG GGGTATCTGG AGGACTGGAA ATGGGGAGCG ACACTGTCCG GAACGGTACA GGGCGGTCCG GCGTCCCCGA TCCTTTCCAA TATATATCTC GACCGGCTGG ACAGCTTCGT CGTGACACAC CTGCTCCCGG ACTACAACCG GGGCGAACGC AGGGCATCCA ACCCTGCCTA CCAGAAAATC GAATATGCGA TCGCGCGTGC CCGACGGCAC GGCGACCGGC CAGCATTACG CCGGCTTCGC CAGCAACGCC GCCAGCTGCC CAGCCAGGAT CCCCACGATC CCAGCTATCG GCGGCTACGG TACGTAAGGT ACGCCGACTT ATGCCGACGT CGGCATAAGT CCGCTTATGC CGAGGGCCGG GTTATGCCGA CCGGACTGCT GGAGGGTCTG CGTGTCAGGC AGGTCCTCGT CGTGGGGGCG GCATAA
|
Protein sequence | MSTGRRGTGG RILNGRKACV MQSAEMVLGV LRERGRRGLP LERVYRQLFN AALYLVAYGR LYSNKGAMTP GETVDGMSLA TIDRIIDAMR HERYRWKPVK RVHIPKKNGK KRPLGLPTWS DKLVAEVVRL LLEAYYEPTF SDHSHGFRPG RACHTALGEV VDVWKGTHWF IEGDIARCFE ELDHQVMLDT VGERIHDNRF LGLLKAMLRA GYLEDWKWGA TLSGTVQGGP ASPILSNIYL DRLDSFVVTH LLPDYNRGER RASNPAYQKI EYAIARARRH GDRPALRRLR QQRRQLPSQD PHDPSYRRLR YVRYADLCRR RHKSAYAEGR VMPTGLLEGL RVRQVLVVGA A
|
| |