Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2884 |
Symbol | |
ID | 3906015 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 3399255 |
End bp | 3400943 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637880205 |
Product | transposase IS66 |
Protein accession | YP_481971 |
Protein GI | 86741571 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000946302 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.757994 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCAGGG TTGTGGCTGA GCTGTCCGCG TCGTCGTGGG AGTCGCGTCT CGCGGCTGCC GAGGCGCGGA TCGAGGAACT CGCCGTCGGG CAGGCCGCGT CGGTGGCGGC GAACGAGCGG TTGGTGTCGG TCAACGAGCG GCTGCGTCGG GTCGTGGAGG ACTCGGCCGC GCGGCATGAG GTGGAGCTTG GCGCGGTGCG CGCCGAGCGG GACCGGGCCG TGCGGCGGGT CGAGGAGCTT GAGCTCGAGT TCGCCGAGCT GCGTCGGCGG TTGTCGATGG ACAGTACGAA CTCCTCTGTG CCGCCGTCGA AGGAGCCGGT CGGGGCGAGG GAGAAGCGGA AGGCCGAGCG GTGGGCGTCC TCGGAGCGGG TCCGGTCGAA GGACAGGAAG CCGGGTGGGC AGCCGGGCCA TCCGGGGGCG GGGCTGGCGC GGGAGGAGAC GCCGGACCGG TCGTTGTCGG CGGACCCGCC GTCGGAATGC TCGTCGTGCC AGGCGGATCT GTCCGACGCG GGGGTGCTGG CGGACGGGTG GGCGCAGGTC TGGGACCTAC TCGAACCGGT GCTGGAGAAG GTCGAGTGGG TGTTGCCTCG GCGGCGCTGC GCGTGCTGCG CGAAGGTCAC CACCGCCGTG GTTCCGGGGG TGCGGCACGC GGCGGCGGGC GCGGTGTCCT ACGGGCCCAG GCTGCACGGC GCGGCGGTGC TGCTCGCCTC CGAGGGCAAC GTGCCGGTCG AGCGGGCCGC GATGGTCATC GGCGCGCTGC TCGGGGTGCC GGTGTCGGCC GGGTTCGTCG CCCGCGCGAA CGCCCGGCTC GCCCAGGACC TGAAGGCCGC CGGGTTCGAC GAGGCGATGA AGGCGGCGCT GCGGGCCGAG CCGGTGCTGT GCGGGGACGA GTCGCCGGTC AACGTGCTGC GCAAGGATCG TGACGAGGCC ACCGGCACCG CCCTGTCGGG CACCCCGCAC CTGCTGGTGG TGCGCACCCC GGCGCCCGGG CTGGTCTGGT ATGCGGCGAT CAGCTCCCGC TCGTCCGGGG CGATCGACGC CACCGGCGTC CTCACCGGCT GGCACGGCTA CCTGACCCGC GACGACTACG CCGGCTGGCA CCAGTACGAC CCCACGCTCG CCGGCGTGCA GTTGTGCTGT GCCCATCTCA TCAGGGCGTT GCGTGCGGTG CTGATCCTCG CGCCGAAGGT CCAGAAGTGG GCGGGCCGGC TCATCGACCT GCTCCGCGAG GCCAACGGCC TGGTCGTCGC CGCCCGCGCC GCCGGGAACA GCCGCCTCGG CCAGGCCGCG ATCGACGCGC TGCGGGCCCG CTACGACGCC GACGTCCGGA TCGGCGAGCT GACGAACATG TCCCGCCCCT GGGCCGACGA GAAGAACCAT CCCGGCCTGG TCCTGGCCCG CCGGCTCGCC GCGAAGGCAG ACCAGGTCTG GCTCTTCACC ACCGACTTCA AGATCCCATG GACGAACAAC CCCAGCGAGC AGGCCATACG CCTCCCGAAA CGCCACCAGG CCGTCTCGGG CTACTGGCAC ACCCCCACCA TGGCCGCCTA CCTACGCGTG CGCTCCTACC TCGTATCCGC CCGCGACCAC GGCCTGACCG CCGTCGACGC CATCCGCCTC GCCCTCGCCG GTACGCCCTG GATGCCAGCC AGGCGCGCGG CCAGCCCCAC ACACGCTCTC GCCACCTAA
|
Protein sequence | MIRVVAELSA SSWESRLAAA EARIEELAVG QAASVAANER LVSVNERLRR VVEDSAARHE VELGAVRAER DRAVRRVEEL ELEFAELRRR LSMDSTNSSV PPSKEPVGAR EKRKAERWAS SERVRSKDRK PGGQPGHPGA GLAREETPDR SLSADPPSEC SSCQADLSDA GVLADGWAQV WDLLEPVLEK VEWVLPRRRC ACCAKVTTAV VPGVRHAAAG AVSYGPRLHG AAVLLASEGN VPVERAAMVI GALLGVPVSA GFVARANARL AQDLKAAGFD EAMKAALRAE PVLCGDESPV NVLRKDRDEA TGTALSGTPH LLVVRTPAPG LVWYAAISSR SSGAIDATGV LTGWHGYLTR DDYAGWHQYD PTLAGVQLCC AHLIRALRAV LILAPKVQKW AGRLIDLLRE ANGLVVAARA AGNSRLGQAA IDALRARYDA DVRIGELTNM SRPWADEKNH PGLVLARRLA AKADQVWLFT TDFKIPWTNN PSEQAIRLPK RHQAVSGYWH TPTMAAYLRV RSYLVSARDH GLTAVDAIRL ALAGTPWMPA RRAASPTHAL AT
|
| |