Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1457 |
Symbol | |
ID | 5669861 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1753223 |
End bp | 1754365 |
Gene Length | 1143 bp |
Protein Length | 380 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641240377 |
Product | integrase family protein |
Protein accession | YP_001505803 |
Protein GI | 158313295 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4973] Site-specific recombinase XerC |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.417277 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCCAGGC CCTCACTGAA CCTCGGGACC TACGGCAACC TGTACACGGC GAAGTACGGC GACGGCTACC GGGCACGCGC CCGGTACCGC GACTTCGACG GCCGCACGCG GCTCGTCGAA CGCCACGCCA AGACCAAGGG CGCCGCCGAG CAAGCCCTAC GCACGGCCCT GCGGGACCGC GCCCGTGTCG ACGTCGGCAC CGGCGCCATC ACCGCCGAAG CAAAGGTCGC CGTTCTTGCC GAGGCCTGGT ACGAGTCCGT CCAGCGGCAG GACCGCTCCC CCAACACCAC CGCCGCCTAC CGGACCCGAC TCGACAAGTC CGTGATCCCT GGCCTCGGCG AGCTACGTAT CCGAGAGCTG ACGGTCGGCG TCGTCGACCG CTTCCTGTCC ATCATCGCCG AGAAGCATGG ACCGGCCGCC GCCAAGCAGA CCCGCGCCGT CCTCTCCGGC ATGTGCGGTC TCGCGGCCCG CCACGACGCC CTGGACCGCA ACGTCGTACG CGACGCCGGA CCGATCGCCG AGGCCACCTC CAAGGATCAG CCGCGGGCCC TCACCCTCGA CCAGCTCCGA GAGCTGCGCG TCGCGCTCCG GAACGACCCG AAGGCCGTCG GGCGCGACAT CCCGGAGTTC GTCGACCTAC TCATGAGCAC CGGCGTCCGC ATCGGCGAAG CCGCCGGGCT GACGTGGGGT GCAGTGAACC TCGACGAGGG CTGGATCGAG ATCCGCTCGA CCGTCGTACG GATCAAGGGC CAAGGTTTGT TCAACAAGCC GAAGCCCAAG ACCAAGGCCG GCCACCGCCG CCTGCAACTC CCATCCTGGA TGATCCGGAC CCTGAAGCAA CGCTTCGACA ACCAGCCGGA CGACGTGACG GTCTTCCCCG CTCAGCTCGG CGGACTACGC GACCCGTCAA ACACTCAGGC CGACCTACGC GACGCATTCA AGGCCGTCGG CATGGAGTGG GCAACCTCCC ACATCGTCGG CCGCAAGTCC GTCGCCTCAG CAATGGATAG CGCTGGCCTT ACAGCACGCG CCGCCGCCGA CCAGCTCGGA CACCGCCAAG TGAGCCTCAC TCAAGACCGC TACTTCGGCC GCAAAGTCGC CGAGACTGGG GCAGCCGCGA TACTTGAAGA ACTGGATGTT TGA
|
Protein sequence | MARPSLNLGT YGNLYTAKYG DGYRARARYR DFDGRTRLVE RHAKTKGAAE QALRTALRDR ARVDVGTGAI TAEAKVAVLA EAWYESVQRQ DRSPNTTAAY RTRLDKSVIP GLGELRIREL TVGVVDRFLS IIAEKHGPAA AKQTRAVLSG MCGLAARHDA LDRNVVRDAG PIAEATSKDQ PRALTLDQLR ELRVALRNDP KAVGRDIPEF VDLLMSTGVR IGEAAGLTWG AVNLDEGWIE IRSTVVRIKG QGLFNKPKPK TKAGHRRLQL PSWMIRTLKQ RFDNQPDDVT VFPAQLGGLR DPSNTQADLR DAFKAVGMEW ATSHIVGRKS VASAMDSAGL TARAAADQLG HRQVSLTQDR YFGRKVAETG AAAILEELDV
|
| |