Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4148 |
Symbol | |
ID | 5672504 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4928820 |
End bp | 4929917 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641243022 |
Product | TnpA family transposase |
Protein accession | YP_001508439 |
Protein GI | 158315931 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4644] Transposase and inactivated derivatives, TnpA family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.160216 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.449147 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTCAGCGA CCAGGTGCGC CAGGCACAGC CCGACCGACG TCGCCGACGG CGCCTCCAGC GTCACGGCCA GTCCGACCAC GCACAGGCTG GCCACGTCGA TCGCCACGGT CACGTACGGC CGGCCGATCG GCAGCCGCTG CCGTTCAGTG GAGCCGGGTG ACATGGGCTT CTACGAGTGG ACCGGCCGCA CGATCAAGTA CCACCGGGTG CAGATCTGTG AGCACCTGGG CTTCCGGGAG TGCTCGGTGG CCGACGCGGA GCGGACCAGG TGGGGCGCCG TCCCGCTGAT CGACATGCTC AAGGAGGCAG TGCTGCGGAC CGGCTGCCTC GCGGCCGCGT CCACGGCGGC CGGCCGCGGT GACCTGGCCC CTGAGGTGCT CGCGGAGCGG CTGATGCTGG CGATCTACGC CTACGGGACC AACACCGGCA TCCGCGCGGT CGCCGGGTCC GCCCAGCACG GTCACAGTGA GGACGACATC CGCTACGTGC GCCGTCGCTA CCTGACCGCC GAGCTCGCCC GGACGGTCGC CGTCGAGATC GCCAACGCCA CCTTCGCCGT CCGGGCGCAG CAGGTCTGGG GTGCCGGCTC CACTGCGGTG GCCAGCGACT CGACGCACTT CGGCGCGTTC GACCAGAACA TCTTCACCGA GTGGCACTCC CGCTACGGCG GCCGCGGCGT GCTGATCTAC TGGCACGTGG AGCGCAAGAG CATGGCGATC CGCTCCCAGC TGATCAGCTG CTCGGCGTCC GAGGTCGCCG CGATGGTCGA AGGTGCGATG CGCCACGGCA CGGCGATGGA GGTCGAGGGC ACCTACGTCG ACTCCCACGG CCAGTCCGAG ATCGGCTTCG GCGTCACCCG CCTGCTCGGC TTCGACCTGC TACCCCGCAT CAAAGCCCGT TACCTGCGCA ACCGGGACCT GCAACGCGAG ATCAACGAGG GCCTGAACGT CGTCGAGTCC TGGAACCGCG CCAACAGCGT CATCTTCTTC GGCAAGGGCG GGGACATCGC CACCAACCGC CGCGACGAGC AGGAGCTATC GGTTACCGGC CATCGGCGAT ACCGGCCGGT CCGTCGAAGA AGCTGCGTGA GCACATGA
|
Protein sequence | MSATRCARHS PTDVADGASS VTASPTTHRL ATSIATVTYG RPIGSRCRSV EPGDMGFYEW TGRTIKYHRV QICEHLGFRE CSVADAERTR WGAVPLIDML KEAVLRTGCL AAASTAAGRG DLAPEVLAER LMLAIYAYGT NTGIRAVAGS AQHGHSEDDI RYVRRRYLTA ELARTVAVEI ANATFAVRAQ QVWGAGSTAV ASDSTHFGAF DQNIFTEWHS RYGGRGVLIY WHVERKSMAI RSQLISCSAS EVAAMVEGAM RHGTAMEVEG TYVDSHGQSE IGFGVTRLLG FDLLPRIKAR YLRNRDLQRE INEGLNVVES WNRANSVIFF GKGGDIATNR RDEQELSVTG HRRYRPVRRR SCVST
|
| |