Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0458 |
Symbol | |
ID | 5668879 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 541777 |
End bp | 543945 |
Gene Length | 2169 bp |
Protein Length | 722 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641239389 |
Product | resolvase domain-containing protein |
Protein accession | YP_001504827 |
Protein GI | 158312319 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1961] Site-specific recombinases, DNA invertase Pin homologs |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCATGG CGGGGTCTGT GCTGGGCAGC AGGTCGGAGA GCCGGATCCG GCCGGAGCAC ACTGATCGCG CGGCGGTGGT CTATGTGCGG CAGTCCAGCA GGCAGCAGGT CGTCGAGCAC GCCGAGTCGA CGCGGGTGCA GTATGCGCTG GTCGAGCGGG CCGTGACTCT CGGTTGGGCC CGGTCGCGGG TGACGGTCAT TGATGATGAC CTGGGGGTGT CGGCGGCGGT CGCGGCGTCC CGGCCGGGGT TCGCGCGGCT GGTCACCGAG GTGACGATGG GCCGGGTCGG TCTGGTGCTG GGGGTGGAGA TGTCCCGGTT GGCGCGCACC GGCCGGGACT GGCACCAGCT GATCGAGTTG TGTTCGTTGG CGGGGACGTT GCTCGCCGAT CTCGACGGGG TCTACGACCC CGGTGTCTAC AACGACCGTC TGCTGCTGGG GTTGAAGGGC ACGATGTCCG AGGCCGAGCT GTGGCTGATC CGGCAGCGGA TGTGGGGCGG GAAGCTGGCC AAGGCCGAAC GCGGCGAGCT GGCGTTCGCG CTGCCGATCG GCTACTGGCG CGACCGCGGC GGGCAGGTGG TGTTCGACCC GGACGAGCAG GCCCGGACCG TGGTCGGGCT GGTCTTCGAC CTGTTCGACC GGCTCGGGAC GCTCAACGGC GTGCTGCGCT GGCTGGTCGA CCACCAGGTG CAGCTGCCGG TGCGCTCCCA CAGCGGGGTG GACAAGGGTG AGCTGACCTG GCGGCGACCC AACCGGGAAA CCCTGCAGGT CATGCTGCAC AACCCCATCT ATGCCGGGTA CCACGCCTAC GGTCGGCGCC GCGTGGATGC GCGGCGCAGG AAGGCGGGCC GGCCCAGCAC GGGGCGGGTG GTGCGATCAA TGGACGACTG GCATGTGCTG CTACCCGACC GGATGCCGGC CTACATCGGC ACCGACCGGT ATGCCGCGAA CCTGGCCCGT TTGGAGGCCA ACCGGCAGAC CGCCGCCTCA CCCGGAGCGC CCAGGCCAGG ATCGGCACTG CTGGCGGGCC TGGTGCGCTG CGGACGGTGC GGACATCGGA TGACGGTCAG CTACCACACC CCGGCCAGCC GGTTCCCGTC GCACAACTAC CACTGCGGCT ACCTGCTCGC CACCTACGGC ACCGGCCGGA CCTGCCAGCA CCTCGCCGGC CCGGCACTGG ACCGCTACGT GACCGCCCAG CTGCTCGACG CCGTCGCCCC CGCCGCCCTG GAGGTCTCGC TGGCCGCCGC TGCCCACGCC GAATCAGACC GGGCCGAGCT GGACACCCTG TGGCGTCAGC GGCTGGAACG CGCCCGCTAC GCCGCCGGCC GCGCCCGGCG CCAGTACCAG CTCGCCGAAC CGGAAAACCG GCTGGTCACC CGCCAGCTGG AAACCGACTG GGAGACGGCG CTGGCCGACC TCGACCGGCT CGAAACCGAC TACCAGCGGT TCGTCGAGGC CCGCCCGCAG ACGCTCACCG CCGCCGAACG GGCGGCCATC ACCGCGCTCG CCCACGACCT GCCCGCGCTC TGGACAGCGC CGACCACCAG CCAGACCGAC CGCAAACAGC TCCTGCGCAC CCTGATCGAC GAGATCACCG TGACGGTCGT CGGCACCAGC GAACTCGTCG ACGTCACGAT CACCTGGGCC GGCGGGCATC AGACCCACGG CCGCACCACC CGCCCGGTCG CCCGCCTCGA CCAGCTCTCC TACTACCACC ACCTGGTCGA GCGGGTCAGC GAACTGGCCA GCGCCGGCCA CTCCAGCCGC CAGATCGCCG ACCAGCTCAA CACCGAGGGA CTACGCCCAC CCAAACGCAC CACCCGCTTC GGCCCCGACC AGATCCTCAC CCTCACCCGC CGACTTGGCA TCGGGGTCCA CCACCCCCGC GACACCCGCA CCGCCCTGGC CAACCCCGGC CCCGGCCGCT GGTCGGTCGC CGGCCTCGCT GTCGCCCTGA ACATGCCGAC CGCCACTCTC TACACCTGGA TCTACCGCGG ATGGATCACG GCGGAACGCC ATCCGGACGG CAGATCCTGG ATCATCCTCG CCGACGACGT CGAGATCAGG CAGCTCCGCG AACGCCGTGA CCGCCCACCC GGCTACTACA CCCGAGCCCG CTGGACCCGA CCCCACCTGG ACCACAGCAC GAACGGAACC CGGACATGA
|
Protein sequence | MSMAGSVLGS RSESRIRPEH TDRAAVVYVR QSSRQQVVEH AESTRVQYAL VERAVTLGWA RSRVTVIDDD LGVSAAVAAS RPGFARLVTE VTMGRVGLVL GVEMSRLART GRDWHQLIEL CSLAGTLLAD LDGVYDPGVY NDRLLLGLKG TMSEAELWLI RQRMWGGKLA KAERGELAFA LPIGYWRDRG GQVVFDPDEQ ARTVVGLVFD LFDRLGTLNG VLRWLVDHQV QLPVRSHSGV DKGELTWRRP NRETLQVMLH NPIYAGYHAY GRRRVDARRR KAGRPSTGRV VRSMDDWHVL LPDRMPAYIG TDRYAANLAR LEANRQTAAS PGAPRPGSAL LAGLVRCGRC GHRMTVSYHT PASRFPSHNY HCGYLLATYG TGRTCQHLAG PALDRYVTAQ LLDAVAPAAL EVSLAAAAHA ESDRAELDTL WRQRLERARY AAGRARRQYQ LAEPENRLVT RQLETDWETA LADLDRLETD YQRFVEARPQ TLTAAERAAI TALAHDLPAL WTAPTTSQTD RKQLLRTLID EITVTVVGTS ELVDVTITWA GGHQTHGRTT RPVARLDQLS YYHHLVERVS ELASAGHSSR QIADQLNTEG LRPPKRTTRF GPDQILTLTR RLGIGVHHPR DTRTALANPG PGRWSVAGLA VALNMPTATL YTWIYRGWIT AERHPDGRSW IILADDVEIR QLRERRDRPP GYYTRARWTR PHLDHSTNGT RT
|
| |