Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0038 |
Symbol | |
ID | 5668464 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 45693 |
End bp | 46814 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641238967 |
Product | integrase family protein |
Protein accession | YP_001504412 |
Protein GI | 158311904 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4974] Site-specific recombinase XerD |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.967515 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTGGATC GGGAGTCGGT GGCGGGCGCG GCGGGGCTGT ACCTGGTGGC TGGGGTGCCG CTGTTGCGGG CGGATGAGCA GGTGTTCGCG GCGATGCTGG AGGGCTGGGG GAGCCAGCAG ACGGCCCGCA ATCTGGCGGT GGGCACGGTC GAGGGGCGGG TCCGCGCGGT GCGGGCGTTC ACCCGACACG CCGAGGCGTT TCCCTGGGCC TGGACGGCGG CGATGGTCGA CGAGTGGTGT ACCGATCTTC GGGCGGTGAG GAACCTGCGT CGGTCGACGC TGCGGAATTA CCAGGAGGCG GTCCGGCTGT TCTGCGTCTA CGTCACCGAT CCCGCCTACG GTTGGCCGGT CCGGTGTGAG CAGGAGTTCG GCGCGCATCC GGTGCAGATC TGCCACGAGT GGAACACCAC GGTGCACGCC CAGCAGTCCG AGAGCGACCC CGGCAAGCGG GCGTTCACGG TCGATGAGCT GCAGGCGTTC TTCGACTACG CCGACGAGCA GGTCAGCCGG ATCCGGGCGG CTGGGCGCAA GGGCTGGCTG CCGGCGTTCC GCGACGCGGT GCTGTTCAAG GTCGCCTACG CCTACGGGCT GCGTCGCAGG GAGGTGCGGA TGCTCGACCT CGCCGACTTC GGCCGCAACC CGCACGGCCC CGAGTTCGGG GACTACGGCG TCTGCCAGGT CCGGTTCGGC AAGGCCTCGA AGGGCTCGCC ACCGAAACGG CGCGGCGTGC TGACCGTCTG GACGTGGACC CCGGAGATCC TCGCGGAGTG GGTGGAGGAG TTCCGGCCGC TGCTCGCCCC GCAGGACTGC CCGGCGCTCT GGCCGTCAGA ACGGGCCCCG CGGGTCGCGC TCACGCAGAT CAACGCCCGG TTCTCCACCT ACCGCGACGC GCTCGGCTTG GACCCGGCGC TCGACGTCCA CTCGCTGCGC CGTTCCTACG TGACCCATCT GATCGAGGAC GGCTACGACG CGCTGTTCGT CCAGCAGCAG GTCGGTCACG AGCACGCCTC GACGACCGCG ATCTACATCT GCGTGTCCTC GGACTTCCGG ACCCGCACCC TGCGCCGCGC GCTCGACCAG ACCCTGGCGG CGGCCCTGAC GCCGACGGAG AAGGCGCGAT GA
|
Protein sequence | MVDRESVAGA AGLYLVAGVP LLRADEQVFA AMLEGWGSQQ TARNLAVGTV EGRVRAVRAF TRHAEAFPWA WTAAMVDEWC TDLRAVRNLR RSTLRNYQEA VRLFCVYVTD PAYGWPVRCE QEFGAHPVQI CHEWNTTVHA QQSESDPGKR AFTVDELQAF FDYADEQVSR IRAAGRKGWL PAFRDAVLFK VAYAYGLRRR EVRMLDLADF GRNPHGPEFG DYGVCQVRFG KASKGSPPKR RGVLTVWTWT PEILAEWVEE FRPLLAPQDC PALWPSERAP RVALTQINAR FSTYRDALGL DPALDVHSLR RSYVTHLIED GYDALFVQQQ VGHEHASTTA IYICVSSDFR TRTLRRALDQ TLAAALTPTE KAR
|
| |