Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0485 |
Symbol | |
ID | 5668905 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 569226 |
End bp | 570305 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641239415 |
Product | integrase catalytic region |
Protein accession | YP_001504853 |
Protein GI | 158312345 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.444951 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGGTCC ACTGTGTCCG TACGCCTGCT GTATCTGATC TTCGTGCGGG TCTGCGGCTG GCTGGTCCTC CTCGGTCGTT CGTCGGCGTC CAAGGACATC GAGCTGCTGG TGTTGCGGCA CGAGGTCACC ATGCTGCGCC GTACCCAGCC CAAGCCCCGG TGGGACTGGG CGGACCGGGC GGTACTCGCC GCACTGATCC AGCTTTTGCC GAAGACACTG CGAGCGCACC GACTGGCCAC CCCCGGCACC GTCCTACGGT GGCACCGCCG TCTAATCACA CGGAAATGGA CCTACCCGCA CCGGACAGGA CGACCGCCGG TCAGCACGGA GATCGCGACC CTCATCGAGC GGCTCGCGAC CGAGAACACG ACGTGGGGAT ACCAGCGGAC CCAGGGCGAG CTCCTCACAC TCGGCCACCG CATTGGCGCG TCCACGATCG CCGGGTCCTG ACGTCCCTGG GGCTGCCCCC GGCACCGAAA CACCAGACCG ACACGACGTG GCGGCAGTTC CTGCGCACCC AGGCATCGAC GATGCTGGCG GTCGACTTCT TCCACGTGGA CTGCGCCGTG ACGCTGCGGC GTCTGTACTG CTTCTTCGTC CTGGAAGCCG GCTCCCGCTC CGTCCACATC CTCGGGGTCA CCGCCCACCC GGACGGGCTG TGGACCACCC AACAGATCCG CAACCTCCTC ATGGACCTCG GCGCCCGGAC AGCCGACTTC CAGTTCCTGA TCCGCGACCG CGCCGGGCAG TTCACCGCGT CCTTCGACGC GGTCCTCGCC GACGCCGGCA TCACCACCGT CAAGATCCCA CCCCGGACGC CCCGGGCGAA CGCCTACGCC GAACGGTTCG TCCACACAGT CCGGACCGAG GTCACCGACC ACATGCTGAT CGTCGGTGAA CGGCACCTAC GTTCTGTCCT GGCCGAGTAC GCCGCCCACT ACAACGGACG ACGACCCCAC CGCAGCCGCG ACCTTCAACC ACCACGACCC GACCACCCCA TCGCCGACCT GACCAAGGAA CGGATCAAGC GCCGCCCCGT CCTCGACGGC CTGATCAACG AATACGAACG AGCCACCTAA
|
Protein sequence | MMVHCVRTPA VSDLRAGLRL AGPPRSFVGV QGHRAAGVAA RGHHAAPYPA QAPVGLGGPG GTRRTDPAFA EDTASAPTGH PRHRPTVAPP SNHTEMDLPA PDRTTAGQHG DRDPHRAARD REHDVGIPAD PGRAPHTRPP HWRVHDRRVL TSLGLPPAPK HQTDTTWRQF LRTQASTMLA VDFFHVDCAV TLRRLYCFFV LEAGSRSVHI LGVTAHPDGL WTTQQIRNLL MDLGARTADF QFLIRDRAGQ FTASFDAVLA DAGITTVKIP PRTPRANAYA ERFVHTVRTE VTDHMLIVGE RHLRSVLAEY AAHYNGRRPH RSRDLQPPRP DHPIADLTKE RIKRRPVLDG LINEYERAT
|
| |