Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6709 |
Symbol | |
ID | 5675022 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8149861 |
End bp | 8150919 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641245557 |
Product | transposase, IS4 family protein |
Protein accession | YP_001510949 |
Protein GI | 158318441 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.556081 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCTTCAG TGTCGCCTAC CCGCGTTCCC GACGGTCGAC GGGCCCTGCT CGCGGGCCGG GCCGTCGCGC GGGCCGGAGG GCTGCTCGGC GCACTGGGCG CGGTGCCCGG CCGGCGGCGG GCGCGGGGGC GTCGCTACGG GCTGGCGTCG GTGCTGGGGC TGTGGTTCGC CGGGGTTCTG GCAGGTCAGC AGACGTTCAC CGCGGTCTGG GAGTGGGCGG TCGACCTGCC GGCGGAGTTG CTGGCCGGGT TCGGGCTGAC CCGGGGGATC CCGTCGGAGC GGACGACCCG ACGCCTGGTC GAGGGCTGCG ACCCGGTCGC GCTGGACGAG GCGCTCTCGG GCTGGATCGC CCGTGCCGCC ACGGTCGGGG ACCCGGGGCC GCGGGGGCTG GCCTTCGACG GGAAGACCCT CAAAGGCACC AGGTCGTTCA CCGAGGCGGG CGCGATGAGC CAGGAGGCGG TCCTCGAGGC GGTCTGGCAC GACACCGGGA TCACCGCCGG CCACCAGCGG GTCGTCGGCG GCGACGAGAT CGCCGCCCTC GAGGCACTGG CCGGCCGGCT CGACCTGACC GACGTCCTGG TCACGACGGC CGAGAAGGGC CACGGCCGGG TCGAGGTCCG TTCGTTGAAG GCGCTGACCG TCACCACCCC GAAACTCGTC GGCTTCTGGG GCACCAAGCA GGTCATCGAA CTGCGCCGAC GGACCCGACG CAAGAAGACC GTCACCGCGG CGCCGACCGT CTCCGAGGAG GTGTTCTACC TGGTCACCAG CCTGCCCGCC GAACAGGCCC ACCCCCGCGA CCTCGCCGCC CGCGCCCGCG CCCGCGGCCA CTGGACCGTC GAAGCCATCC ACCACGTCCG CGACCGCGTC CTCGACGAGG ACCGCCACAC CGCCCGCACC GCCAACGCCC CACTCGCCTG GGCGATCGCC CGCGACACCG CCATCAGCGC GCTACGCCTC ACAGGACACA GAAGCATCGC CAAAGCCCTG CGAACCACCG CCCGCCAGCC CGAACGCGTC CTCCAGACCA TCGCCCTGAT CAGCGAAAAG GGACTTTGA
|
Protein sequence | MPSVSPTRVP DGRRALLAGR AVARAGGLLG ALGAVPGRRR ARGRRYGLAS VLGLWFAGVL AGQQTFTAVW EWAVDLPAEL LAGFGLTRGI PSERTTRRLV EGCDPVALDE ALSGWIARAA TVGDPGPRGL AFDGKTLKGT RSFTEAGAMS QEAVLEAVWH DTGITAGHQR VVGGDEIAAL EALAGRLDLT DVLVTTAEKG HGRVEVRSLK ALTVTTPKLV GFWGTKQVIE LRRRTRRKKT VTAAPTVSEE VFYLVTSLPA EQAHPRDLAA RARARGHWTV EAIHHVRDRV LDEDRHTART ANAPLAWAIA RDTAISALRL TGHRSIAKAL RTTARQPERV LQTIALISEK GL
|
| |