Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2078 |
Symbol | |
ID | 3905605 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 2444291 |
End bp | 2446024 |
Gene Length | 1734 bp |
Protein Length | 577 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 637879414 |
Product | transposase IS66 |
Protein accession | YP_481180 |
Protein GI | 86740780 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0408516 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.686964 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCTGCGGT GCGTGACGGT TGTCGAGTCG GGGGCGGGCG CTGCCGCGAG CGGTGAGGTT GCCGAGGGCG CGGCGCTGCT GGCGGAGAAC GCCTGGCTGC GGGCCCGGGT CGCGGAGCTG TTGACGGACA TCGCCGGGCT GGTCGCGCGG GAGGCGACGC GTGAGGCCGA GGTGGTGGAG CTGCGTCTCC AGCTCGAGGC GTTGCAGGCG GAGCTGGCGA CGTTGCGGCG GATGCTGTTC GGCCGGTCGT CGGAACGGGA GTGCGGCGGG TCGCCGGCCG TGGGTTCGCC GGATGGCGGG GACGGTTGTG GCGACGGGGC GCGGGGCGAG GCCGCCGGGT CGGCAGGCCG GCGGCGGGGG CCGGGCGCGC GCTCGGGCCG GCGGAGCTAC GACCATCTGT CCCGCGACGA GGTCGACTGC GACTTCGAGG GCGGGGGCTA TGGCTGCCTG TCGTGTGGGC AGCCGTTCAC GCCGTGGGGC GAGCATGTCG TCGAGCAGCT CGACTGGCTG GTGACGGTGC GGGTTCGGGT GTCGAGGCGG CGCCGGTATC GGCGGGGCTG CCGCTGTGGC GGGTCGTTGA CGGTGACCGC GCCGGGACCG TCGAAGGCGA TCGGGAAGGG CCTGTTCACG CACCGGTTCC TCGCGATGCT GATCGTGGAG CGCTATGTCG CGGGCCGTTC GCAGAACTCG CTGGTCACCG GGTTGGCCCG GCACGGCGCC CAGCTCTCGC CGGCGACGCT GACCGGGGCG TGCGCCCAGG TCGCGGGCCT GCTCGCCCCA CTCGCCGAGC AGATCGTCGG GCGGTCGCGG GGGTCGTGGC ACCTGCACGC CGACGAGACG ACCTGGCGGG TGTTCACCCC GACCGGCGGC GGCGGGCCGG CCCGCTGGTG GCTGTGGGTG TTCCTGGGGC CGGACAGCGT CTGTTTCGTG ATGGACGCGA CCCGCTCGAC GGCGGTGCTC GCCGAACACG TCGGCCTCGA CCCGGACAGC GGCCAGCTGA CCGACGACGC CGACGGCGGA CCGCGCCGCC TCGTGCTGTC GTCGGACTTC TACACCGTGT ACGTCTCCGC CGGCCGCCGC GCCGATGGCC TGGTCAACCT GTACTGCTGG GCGCACGCGC GGCGGTACTT CGTGCGGGCC GGCGACGCGA ACCCCGCCCA GCTCGGGATC TGGGCCCGCC AGTGGGTCGA GCGGATCCGC GCGCTCTACA CCGCGCACGG CGAGCTCGCC GCCGCCTGGC ACACCGCCGC CGCGGCCCCG TCGCCGGCCA CCGAGAAGCG GCTCGCCGCC GCGTACGCCG GCTGGGACAC CGCGATCACC GTGATCGACA CGGTTCGCCG CGAGCAGACG GCCTCGCCCG GCCTGCAGGA ACCCGCGCGC AAGGCGCTCG CGACCCTGGA CCGGGAATGG GACGGGCTGG TCGCCCACCG CGACTACCCC ATGATCGGCA TGGACAACAA CCCGGCGGAA AGGGCGATCA GGGGCCCGGT CGTGACCCGG CGCAACGCCG GCGGCTCCCG CACCGAGGAC ACCGCCCGCC ACGCCGCCAC GATCTTCACG GTCACCGCGA CCGCCGCGAT GCACAACCTG AACCTGCTGA CCTACCTGGA GAACTACCTC GACGCCTGCG GCCGGGCCGG CGGCAAGCCG CCGACCGGCG CCGACCTCGA CCGGTTCCTG CCCTGGGCCG CCAGCCCCGA GGACCTCACC ACCTGGCAAC AGCCTCCCGG CTGA
|
Protein sequence | MLRCVTVVES GAGAAASGEV AEGAALLAEN AWLRARVAEL LTDIAGLVAR EATREAEVVE LRLQLEALQA ELATLRRMLF GRSSERECGG SPAVGSPDGG DGCGDGARGE AAGSAGRRRG PGARSGRRSY DHLSRDEVDC DFEGGGYGCL SCGQPFTPWG EHVVEQLDWL VTVRVRVSRR RRYRRGCRCG GSLTVTAPGP SKAIGKGLFT HRFLAMLIVE RYVAGRSQNS LVTGLARHGA QLSPATLTGA CAQVAGLLAP LAEQIVGRSR GSWHLHADET TWRVFTPTGG GGPARWWLWV FLGPDSVCFV MDATRSTAVL AEHVGLDPDS GQLTDDADGG PRRLVLSSDF YTVYVSAGRR ADGLVNLYCW AHARRYFVRA GDANPAQLGI WARQWVERIR ALYTAHGELA AAWHTAAAAP SPATEKRLAA AYAGWDTAIT VIDTVRREQT ASPGLQEPAR KALATLDREW DGLVAHRDYP MIGMDNNPAE RAIRGPVVTR RNAGGSRTED TARHAATIFT VTATAAMHNL NLLTYLENYL DACGRAGGKP PTGADLDRFL PWAASPEDLT TWQQPPG
|
| |