Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3231 |
Symbol | |
ID | 5671606 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3818821 |
End bp | 3820425 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641242124 |
Product | X-Pro dipeptidyl-peptidase domain-containing protein |
Protein accession | YP_001507544 |
Protein GI | 158315036 |
COG category | [R] General function prediction only |
COG ID | [COG2936] Predicted acyl esterases |
TIGRFAM ID | [TIGR00976] putative hydrolase, CocE/NonD family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCTCA ACCTGATCAG CCGGCTGTTA CAGCGAACCC AGCAGTTACC ACCGCCACTC ACCCGCGATC TCACCGTCGC ACGCGGCCTG CGGGTGCCGA TGCGTGACGG CGTCGAGCTG ACCGCCGACC ACTGGTTCCC CAGGGCCGGC GCCGCGGGGC TGCCGACCGT GCTGATCCGC ACCACCTACG GCAGCCACAG CTCAGCCACG TACCCGATCG TGCGGCCCAT CGCCGAGCGC GGGTTCCAGG TGCTGATCAC CAACTCGCGC GGCACCTTCG GCTCCGGCGG CGCCTTCGAC CCGTTCCGCA ACGAACGCGA CGACGGCTTC GACACCCTCG ACTGGGTCAT CGGACAACCC TGGTTCGGCG ACTCCATCGT GCTCTATGGA CCCAGCTACC TCGGCTACAC CCAGTGGGCC GTCGCCGATC AGGTGCCGCC GCAGGTCAAG GCCATGATCC CGATTCAGAG CGAAGCCGCG GTCATGCTGG AATTCCTGCG CCCGGACGGC TTCGCGCTGG AGATCCTGTT CATCTGGAGC TTCGTGGTGG ACGGGCAGGA AAGCCCGCTG GCTCTGCTCA GACACCCCGC CCTGGGCGGT AGGCGGAAGA TGCGCCGACT GATGGCCAGC CTGCCGCTCG AGCAGGCCGA CCTGCGCGGC GCCGGCCACC GGATCGACTA TCTGCAGAAC ATCCTGGCCC ACGATGCCGG CTCACCGCAC TGGGCGCCTG CCGACCACAG CGCCCGGGTC GCCGACGTGA CGATCCCGGT CAGCTCCATC GCCGGCTGGC ACGACTTCTT CCTGCCCGGC CAGCTACGTG ACTTCACCGC CCTGCAGGCC GCCGGCCGCC CGGCCCGGCT GACCGTCGGA CCGTGGGCGC ACAGCATGTC CGCCGGGCCC ATCAGGCTGG GCATGGAGGA ACTACTCGAC TTCGGCCTGG CCCACGCCCG CGCGGAGCAA CCGACCGACC GAGCCCCGGT CCGTCTGTTC GTGCAGGGCG CCGACGAATG GCGGGACTTC CAGTCCTGGC CCCCGGAAGG CTACCCACAG CAACGTCTGC ACCTGCAACC GGGCGGCGGC CTGGCGACAG CGAGCCCGGC GGACTCGCCC CCGGACAACT ACCGCTACGA CCCCGCCGAC CCGACTCCCG CCGTCGGTGG ATCGCGCTTC AACGTCAACA CCGGCAGCGT CGACAACACC GCTCTCGAGG CCCGGGCGGA CGTGCTGACC TTCACCACAC TGCCGCTGGA CCGTGAGGTC GAGGTGATCG GTGAGGTCGA CGCCGAGATC TGGTTCCGGT CCAGCCTGCC GTACGCGGAC GTGTTCGTCA GGGTCTGTGA CGTCAACACC GGCGACCGCT CCTACAACGT CACCGACGGC CTGACCAGTC TCACCGAGGC GGACCAGGAC ACCCGGGCGA GGGTCCGGCT CCCCGCCACC GCGTACCGGT TCAGGAAGGG CCACCGCATC CGTGTCCAGA TCTCCAGCGG CGCTTTTCCC CGGTACAACC GCAACCCCGG CACCGGAGAA CCCCGCGGCA GCGGACAACT CTCAACGCCG CCAGTCAGAC CATTTACCAC GACCCGGCCC GCCCGTCAGC GGTGA
|
Protein sequence | MTLNLISRLL QRTQQLPPPL TRDLTVARGL RVPMRDGVEL TADHWFPRAG AAGLPTVLIR TTYGSHSSAT YPIVRPIAER GFQVLITNSR GTFGSGGAFD PFRNERDDGF DTLDWVIGQP WFGDSIVLYG PSYLGYTQWA VADQVPPQVK AMIPIQSEAA VMLEFLRPDG FALEILFIWS FVVDGQESPL ALLRHPALGG RRKMRRLMAS LPLEQADLRG AGHRIDYLQN ILAHDAGSPH WAPADHSARV ADVTIPVSSI AGWHDFFLPG QLRDFTALQA AGRPARLTVG PWAHSMSAGP IRLGMEELLD FGLAHARAEQ PTDRAPVRLF VQGADEWRDF QSWPPEGYPQ QRLHLQPGGG LATASPADSP PDNYRYDPAD PTPAVGGSRF NVNTGSVDNT ALEARADVLT FTTLPLDREV EVIGEVDAEI WFRSSLPYAD VFVRVCDVNT GDRSYNVTDG LTSLTEADQD TRARVRLPAT AYRFRKGHRI RVQISSGAFP RYNRNPGTGE PRGSGQLSTP PVRPFTTTRP ARQR
|
| |