Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1950 |
Symbol | |
ID | 3904312 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 2291191 |
End bp | 2292396 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637879287 |
Product | transposase, IS4 |
Protein accession | YP_481054 |
Protein GI | 86740654 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.143043 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCTGTAG TGTCCCCCAT GTTGTGGCTT GAGTGTCAGC AGGAGCAGGT TGAGGTCTCG ACAGCCGCGC AGGTCGCGGG GCTGGTCGCG GTGTTGCGGC GGGTACCGGA CTGGCGCGAC CCGCGGGGTG TTCGCTACGA GCTCGCGCCG GTCCTGGCGT TGTGGGTCGC GGGCAACATC GCAGGTCACG ACACCACCGT GGCGGTGTGG GAGTGGGCCT GCGCCCTCCC GGTCGGGGTG CTGGCCGGCC TCGGGTTCCC TCGTCGGGTG CCGTCGGAGC GCACGATCCG GCGGATCGTG GAGGAAGGCC CGCCCGGGCA GTTGGACCAG GCGCTGTCGG GCTGGACCGC CGCGGTCGCG CCCGGGCCCG CGGACCCGGG CGGCCTGGTA GCGGTGGCCT TTGACGGGAA GGTGATGAAG GGCGCCCGTT CCCGCCCGCC ACAGGGGTCG GTGCGCCAGG AGGCGGTCGT CGAGGCCGTC CGCCACGACA CCGGGACCGC GCTCGGGCAT CAACGGGTCG TGGCCGGCGA CGAGATCGCC TCGGTACGGA GGCTGGTCAA CCGGGTGTGT GACCACAACA CGCTGGTCAC GACCGACTGC CTGCACGCAC ACGAACCCTT GGCCCGGGCG ATCCGGGCGA AGGGCGGCCA CTGGCTTTTC TCGATCAAGG GCAACCAGCC CACCGTGCGG GCGAAGCTGG CCGGGCTGCC GTGGGACGAG TTCGGAAACC AGCACGTGAC CCGGGAGAAG GCCCACGGCC GGATCGAGGA ACGCGCGCTG AAGGCGTTGA CCCCCTCCGC GCCGTCCCTG GTCGGGTTCC GAGGTACCCG GCAGGTGGTC AAGCTGGCCC GGACCACCCG CCGTAAGAAG ACCACGACCA GCCCGGCTGC GACCTCGACC GAGGAGTTCT ACCTGGTCAC GAGCCTGTCC ACCGACCAGG CCAGCCCCGC GCAGCTGGCC CGGTGGGCTC GTGGCCACTG GACGGTCGAG GCCATTCACC ATGTCCGCGA CCGGACCATG GACGAGGACC GCCACACCAT CCGCACCAAG AACGCCGCCC TGAACTGGGC CATCGCCCGC GATACCACGA TCAGCGCACT ACGCCTGGCA GGCTACAAGA ACATCAGACA GGCCCGCCGA GCCACCATCA GAGACCCTGG GCTCGTCCTC CAGATCATCG CCCTGACCAG CCAAAACAGA CTTTGA
|
Protein sequence | MSVVSPMLWL ECQQEQVEVS TAAQVAGLVA VLRRVPDWRD PRGVRYELAP VLALWVAGNI AGHDTTVAVW EWACALPVGV LAGLGFPRRV PSERTIRRIV EEGPPGQLDQ ALSGWTAAVA PGPADPGGLV AVAFDGKVMK GARSRPPQGS VRQEAVVEAV RHDTGTALGH QRVVAGDEIA SVRRLVNRVC DHNTLVTTDC LHAHEPLARA IRAKGGHWLF SIKGNQPTVR AKLAGLPWDE FGNQHVTREK AHGRIEERAL KALTPSAPSL VGFRGTRQVV KLARTTRRKK TTTSPAATST EEFYLVTSLS TDQASPAQLA RWARGHWTVE AIHHVRDRTM DEDRHTIRTK NAALNWAIAR DTTISALRLA GYKNIRQARR ATIRDPGLVL QIIALTSQNR L
|
| |