Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1922 |
Symbol | |
ID | 3906871 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 2258115 |
End bp | 2259755 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637879259 |
Product | transposase IS66 |
Protein accession | YP_481026 |
Protein GI | 86740626 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3436] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.745599 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCTGTTC TGTCTGTCAC CGATGATGTC ACCGAGGTGG CGTACTGGCG TGGGCGTGCC GAGCGGGCCG AGGAGTGTGC GGAGAAAGCC GAGGCCCGTG TCGGGCAGCT GCAGCTGCGG GTCGAGGAGT TGAGCGAGCA GGTCGCGGTG CTGTCCCGGA TGCTGTTCGG TCGTTCCTCG GAGAAGACCG GCCCGTCGTC GGCTGTGGAT GAGAAACCAG AAGATCGGCA GGATTCGGGC GGTGGGGATG CCGGCCGGCC GGCGCGTCAA CGCGGGCAGC GGCCGGGGAG CCGGGGGCAT GGCCGGCGGG ACTACTCGCA TCTGCAGACC CGCGAGGAGA TCCATGATGT GCCCGAGGTC GACCGTGCCT GCCCCGGGTG TGGGGTGGCG TTCACGCCGT TGGGGACCGA CGACAGCGAA CAGGTCGACT GGCAGGTCGT GATCACCCGG ATCGTGCATC GGCGGCGGCG GTATCGGCGG TGCTGCACAT GTCCGGGGCC GCGGACAGTG ACCGCGCCGG TGCCACCCAA ACCGATTCCC AAGGGCCGGT TCACCGCGGG GTTCCTCGCC CGCCTTCTCT ACGAGAAGTA TGTCCTGGGC CTGCCGTTGC ACCGGATCGC TCGGGCGCTG GCCGCCGCCG GGCTCGGTGT TGCCGAGGGC ACTCTGTGTG GGGCGTTGAA GGACGTGCAT GGACTGCTCG GCGGGCTCGA TGAGCAGATC GTGGCGCGTA ACGCCGCCGC CGGTCATGTC CACGCGGACG AGACGACGTG GCGGGTGTTC GAGCGGGTCG AGGGCAAGGA CGGGACCCGC TGGTGGCTGT GGGTGTTCGT CGCCGCCGAC ACGGTGGTGT TCCGGATGGA CCCGACCCGC TCGGCTGCCC CGGTCGAGAA GCACTTCGGG ATCGACCGGG CCGCCGGGGC GCTGTCCGAC GGACGTCGCC TCGTCGTCTC GTCGGACTTC TACACCGTCT ACCAGTCCCT GGGCCGCGTC GACGGAGTCG ACCCGCTCTG GTGCTGGGCA CACATCCGCC GGTACTTCAT CCGGGCCGGG GACGCCCACC CCCAACTGCG GTACTGGGCC GACCAGTGGG TCGCCCGGAT CGGGATGCTC TACCTCGCTC ACCGCGCCCT CGCCGCCGAG CAGCCCACAA CCGGCGGCTA CCGCGAGGCC GCCGGCGCGT TCGAGGCCGC GCTGAGGGCG ATCGACACGG CGCGGCGCGC GGAGGCGGCG ATCCACAGCC TGCACCCGGC GGCGAAGAAG GTCCTGGCGA CCCTGGACCG GGAATGGGAC GGGCTGGCCC GCCACCAGGA CTTCCCCGAC CTGGATCTTG ACAACAATGC TGCCGAGAGA GCGCTACGGA CCCCGGTCGT CGGGCGGAAG AACTACTACG GCGCACACGC TGAGTGGGCC GCGCACCTCG CCGCCCGGGT CTGGACCATC GTCGCCACCG CGGAGCGTAA CGGCCGTGAA CCCCTCGCGT TCCTGACCGG CTACCTGAAC GCCTGCGCCA CAGCCGGCGG GAAAGCACCC GCCGGCCCCG CCCTCGAACC CTTCCTCACC TGGCAGACCA CCACCCAGAC CGGCAGCCCT CCCAGCACCG ACCCACCCCA GGACGGCCCA CCCGACGGGC CCGAGCCCTA A
|
Protein sequence | MSVLSVTDDV TEVAYWRGRA ERAEECAEKA EARVGQLQLR VEELSEQVAV LSRMLFGRSS EKTGPSSAVD EKPEDRQDSG GGDAGRPARQ RGQRPGSRGH GRRDYSHLQT REEIHDVPEV DRACPGCGVA FTPLGTDDSE QVDWQVVITR IVHRRRRYRR CCTCPGPRTV TAPVPPKPIP KGRFTAGFLA RLLYEKYVLG LPLHRIARAL AAAGLGVAEG TLCGALKDVH GLLGGLDEQI VARNAAAGHV HADETTWRVF ERVEGKDGTR WWLWVFVAAD TVVFRMDPTR SAAPVEKHFG IDRAAGALSD GRRLVVSSDF YTVYQSLGRV DGVDPLWCWA HIRRYFIRAG DAHPQLRYWA DQWVARIGML YLAHRALAAE QPTTGGYREA AGAFEAALRA IDTARRAEAA IHSLHPAAKK VLATLDREWD GLARHQDFPD LDLDNNAAER ALRTPVVGRK NYYGAHAEWA AHLAARVWTI VATAERNGRE PLAFLTGYLN ACATAGGKAP AGPALEPFLT WQTTTQTGSP PSTDPPQDGP PDGPEP
|
| |