Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2347 |
Symbol | |
ID | 3904554 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 2719661 |
End bp | 2722411 |
Gene Length | 2751 bp |
Protein Length | 916 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637879677 |
Product | transposase Tn3 |
Protein accession | YP_481443 |
Protein GI | 86741043 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4644] Transposase and inactivated derivatives, TnpA family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.669911 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.26387 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCGGTGG AGTTCCTGAC GGATGAGCAG GTTGCGGCGT ACGGGCGGTT CGATGGTGAA CCTGTGCGGG CTGAGTTGGA GCGGTTCTTC TTTCTGGATG ACGCCGACCG GGAGCTCGTC GTCCGGCGCC GGGGCGAGCA CTCGCGTCTG GGGTTCGCCG TTCAGCTGGG CACGGTGCGG TTCCTTGGGA CGTTCCTGAG CGATCCGCTG GAGGTGCCGT GGGTGGTGGT GGACTATCTC GCGGGCCAGC TGGACGTTGC GGACTCGTCG GTGGTGAAGC GGTACACCGA GCGGCTGCCG ACGCAGCATG AGCATGCTCG GGAGATCCGG TCGGTCTACG GCTACCGGGA CTTCGCCGAC CCGCAGGCAG CCACGGAGCT GCGGGCGTTC CTGGCGAGCC GGGCGTGGAC GCAGGCAGAG GGTCCGGTGG CGCTGTTCGG TCAGGCCACG GCGTGGCTGC GCCGTAGCCG GGTGCTGTTG CCAGGGGTGA GTGTTCTGGC CCGTCTGGTG GTCTCCGTGC GCAGCGAGGC CACCGACCGG GTGCACCAGG AGCTCGCCGT CGCGGCTGAG CAGGCCGATC CGCAGTTGGC CGGACGGCTG CGCAGTCTCC TCGAGGTGGC GCCGGGTGCC CGGGTAAGCG AGCTGGAGCG GCTGCGGCGG GGGCCGACGC GGACGTCGGG GCCGGGCTTG GAGCGGGCGC TGTCACGGGC CGGCGAGGTC ATCGTGGTGG GCGCCGGCGC GGCGGATGTG TCGGGGGTGC CAGCGAACCG GCTGGCGACG CTGGCCCGCT ACGGCCTGGC GGCCAAGGCG CCGCTGCTAC GCGAGCTCGC CGAGCCGCGG CGTACCGCGA CCCTGCTGGC GACGGTGCGC GGTCTGGAAG CGGCGGCGGT CGACGACGCC CTGGACCTGT TCGACGTGCT GATGGCCACC CGGCTGCTCA ACCCGGCGCG TCGCGCCGCG CAGCAGGAAC GCCTCGCCGT CCTGCCGAAG CTGGAGAAGG CGTCGGTCAC CCTCGCCGGC GCGGCGTCCG CGCTGCTGGG CCTGCTGGCC GGCACGGCCG GCGAGGCGCT CGACGTCGCC GCAGCCTGGG CGGTCGTGGA GCAGGTCGCG CCGCGGGAAC GGGTGCTGGA GGCCGCCGCT CTGGTCGAGG AGCTGGTGCC CGGCGACGCC GGCCTGGAGA CGGTGATGCG CGCCGAGCTG GCCCGCCGTT ACGGCACGGT CCGCCCTTTT CTCCTCCTGC TCGCCGAGGC GCTGCCGCTG GCAGCGACCG GCGACGGCCA GCCGGTCCTC GACGCGGTCC GGCTACTGCC GAACCTGGTC GGCCGTCGCC GGATCCGCCC GGACGAGGTC GACATCGATC TGGTGCCGCC GGTGTGGCGG CGCGCGGTGC TGGCCAACCC GACGTTGCCG GCTGGCACGG TCGACCGCGA CGGCTACGTG CTGTGCGTGC TGGAGCAGCT GCATCGGGCG CTGCGCCGGC GCGACGTGTT CGCCGTGCGT TCGCAGCGAT GGTCTGACCC GCGGGCCCGG CTGCTCGCCG GCGAGGGATG GGATCGGGTC CGCGGCGAGG TGCTCGCCGG TCTCGGCCTG GCCGCCCCGG TCGAGACCCA CCTGCGTGAG CTCGCCGGCT CGTTGGACGC CGCGTGGCGC CAGACCGTCG ACCGGCTCGC CGAGGCCGGA CCGGACAGCC CGGTGCGGGT CGAGCCGGCG GCCGACGGGC GGATGCGCCT GGCCGTGAGC CGGCTGGAGG CGCTCGGGGA ACCGGACAGC CTGGTCGCGC TGCGGGCGAC AACCGCGGCG ATGCTGCCCC GTGTCGACCT GCCCGATCTG CTGCTGGAGG TGCACGCCTG GACCGGGTTC CTCGGCGCCT ACACCCATCT GGGCGGGTCG GGCAGCCGGA TGGAGAACCT GCACATCTCC CTCGCAGCGC TGCTGGTCGC CGAGGCATGC AACGTCGGGC TGACCCCGGT GACCAAGCCG AGCGAGCCGG CGCTGAGCCG CGACCGGCTC TCCCACGTCG ACCAGAACTA CCTGCGCGCC GACACCCACG CCGCGGCGAA CGCGGCTCTG ATCACCGCCC AGGCACAGAT CGGGCTCGCC CAGGCATGGG GCGGGGGGCT GGTCGCCTCC GTCGACAGAC TGCGGTTCGT CGTCCCGGTC CGCACGATCA ACGCAGGACC CAGCCCCCGC TACTTCGGGC TCAAACGCGG CGTGACCTGG CTGAACGCGG TCAACGACCA GGTCGCAGGG ATCGGCGCGG TCGTCGTGCC CGGCACTGTC CGTGACTCGC TCTACATCCT GGACACGCTG CTCACCCTCG ACGCCGGGCC GAAGCCGGAG ATGGTCGCGA CCGACACCGC CAGCTACTCC GACATCGTCT TCGGCCTGTT CCGCCTGCTC GGCTACCGGT TCTCCCCGCG GATCGCGGAC CTGTCCGACC AGCGACTGTG GCGTGCCACC CTGCCCGGCG ACGCCGAGGG CGACTACGGG CCGCTGAACG CCATTGCCCG GCACCGGGTC AACCTCGCCC GCGTCGGTGT CGTGCTGTGG AACACCCGCT ACCTCGACGC CGCCCTCACC GCACTGCGCA CCGCAGGCCA CGACGTGCAC GACCTCGACG TCGCCCGGCT GTCCCCGCTG GCCGACCGGC ACATCAACAT GCTCGGCCGC TACGCCTTCG CCGCCCCGCC CACCGGCACG GGTCTGCGCC CTCTGCGTGA CCCGGGCGAG AACGATCAGG AGGAGTCATG A
|
Protein sequence | MPVEFLTDEQ VAAYGRFDGE PVRAELERFF FLDDADRELV VRRRGEHSRL GFAVQLGTVR FLGTFLSDPL EVPWVVVDYL AGQLDVADSS VVKRYTERLP TQHEHAREIR SVYGYRDFAD PQAATELRAF LASRAWTQAE GPVALFGQAT AWLRRSRVLL PGVSVLARLV VSVRSEATDR VHQELAVAAE QADPQLAGRL RSLLEVAPGA RVSELERLRR GPTRTSGPGL ERALSRAGEV IVVGAGAADV SGVPANRLAT LARYGLAAKA PLLRELAEPR RTATLLATVR GLEAAAVDDA LDLFDVLMAT RLLNPARRAA QQERLAVLPK LEKASVTLAG AASALLGLLA GTAGEALDVA AAWAVVEQVA PRERVLEAAA LVEELVPGDA GLETVMRAEL ARRYGTVRPF LLLLAEALPL AATGDGQPVL DAVRLLPNLV GRRRIRPDEV DIDLVPPVWR RAVLANPTLP AGTVDRDGYV LCVLEQLHRA LRRRDVFAVR SQRWSDPRAR LLAGEGWDRV RGEVLAGLGL AAPVETHLRE LAGSLDAAWR QTVDRLAEAG PDSPVRVEPA ADGRMRLAVS RLEALGEPDS LVALRATTAA MLPRVDLPDL LLEVHAWTGF LGAYTHLGGS GSRMENLHIS LAALLVAEAC NVGLTPVTKP SEPALSRDRL SHVDQNYLRA DTHAAANAAL ITAQAQIGLA QAWGGGLVAS VDRLRFVVPV RTINAGPSPR YFGLKRGVTW LNAVNDQVAG IGAVVVPGTV RDSLYILDTL LTLDAGPKPE MVATDTASYS DIVFGLFRLL GYRFSPRIAD LSDQRLWRAT LPGDAEGDYG PLNAIARHRV NLARVGVVLW NTRYLDAALT ALRTAGHDVH DLDVARLSPL ADRHINMLGR YAFAAPPTGT GLRPLRDPGE NDQEES
|
| |