Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3241 |
Symbol | |
ID | 5671616 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3830032 |
End bp | 3831795 |
Gene Length | 1764 bp |
Protein Length | 587 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641242134 |
Product | X-Pro dipeptidyl-peptidase domain-containing protein |
Protein accession | YP_001507554 |
Protein GI | 158315046 |
COG category | [R] General function prediction only |
COG ID | [COG2936] Predicted acyl esterases |
TIGRFAM ID | [TIGR00976] putative hydrolase, CocE/NonD family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.972915 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGATCGCCG AAGAGGAGCT GATCGTCGAG AAGGGCGTCA TGGTTCCGAT GCGCGACGGT GTGCGCCTGT ACACGGACAT CTACCGGCCG GCCGGTCCCG GGCCGTTTCC TGTGCTTGTG TCCCGTACCG CCTACTGGCT CAACGGCGGG GTCACCCAGG GCCTGAGCGG TTTCGCCAAA CTGGTCGCCC GGCAGGGCTA TGTGGCCGTG TTCCAGCAGA GTCGCGGCCG GTTCGCTTCC GAGGGTGTGT TCCACCCGGC GCTGTGCGAC GTGGACGACG GGTACGACGT GGTGGAGTGG GCGGCCGCCC AGCCGTGGTC CACCGGCAAG GTCGGTATGT TCGGCGGGTC GTACCAGGGG CTCACCCAGT GGGCGGCCGC CATCGCCCGG CCACCGCATC TCGCGTGCAT CGCGCCGTTG ACGTCCACCT GGAACACCTT CGGCAATGAG ATCTGGTACG CCGCGCCGGG CGTCCTGTCG CTGGGCAGCG CGTTCGCGTG GGCGTGGGGC GCGGTGCTCG GCGAGGCCGA GCGCCGGGGC GTTCCGGCGC CGGAGGGCGC GATCCACCAC GGTGAGGAGG GCAACGAGCC CGGGGACGTC GCGGAGGCCA TCGCGAAGCG CACCGTGGCG ATGATGGAGA TGTACGCCTT CCGCCCGCTG CGCGACGCGC CCCAGCTCGA GCTGGTGTCG TGGTGGAAGG ACTGGTGCGA CAACGGCGAC CCGAACGACC CGTACTGGCT GGTGGTCAAC GCCTCGGAGC ACGCCGTCGA CCTCGACCTG CCGATCTTCC AGGCGAGTGG CTGGTACGAC ATGTTCCTCA ACGGGACGCT CGAAGCCTTC CAGGCGCTGC GCGGCGCGGG CGCCACCCAG TACGTCCGGG ACAACCAGGA GTTGGTCATC GGACCGTGGA ACCACGGCGG CGTGTGCCCG CCCCGTCCGG ACGCACCGGC GGACACCGGG CCCCTGGGGC TCTGGGACCT CTCCGAGGGT TCCGCCTGCA TGGAGTTCTT CCGCCGTCAC CTGAAGGGCG AGCAGGTCCT GGACCCCGCG CCGGTCCGCC TGTTCGTCAT GGGCGAGAAC ATCTGGCGCG ACGAGCGGGA ATGGCCGCTG GCCCGCACCC GCTGGACGCC CTACTACCTG CACAGTGCCG GCGGGGCCAA CACCGCCGCG GGCGACGGCT CGCTGTCCAC GGAGCGTCCC GGTGACCAGC CGGACGACGT CTTCGTCTAC GACCCGCAGA ACCCGGTCGT GTCGCAGGGC CGGCTGGAGT GCTACGCCCC CGACCACGGC GCCGAGACCG CCCGCAACGA GAGCCGCGAC GACGTTCTCG TCTACACCAC GTCACCGCTC GAGCATGACC TGGAGGTCAC CGGCCCGGTG ACGCTCGAGC TCTGGGCGTC GTCGTCCGTC TCCGACACCG ACTTCACCGC GAAGCTCGTC GATGTCTTCC CGGACGGCGC GGCCATACCG CTCGCCGAGG GCGTGGTGCG GACAGGCGTG GCGTTCACGC AGCCACCGCG GCCCGAGACG CCGCGTCGTT ACCGAATCAG TCTCTGGGCG ACGAGCAACG TCTTCAGATC CGGTCACCGT ATCCGGCTCG ACGTCTCGTC GAGCGCGTTC CCGGAGTACG AGCTGAACCC GAATACCGGT CAGCGGATCA CCCACGACGC CACCGGGAAG ACGGTGCCCG CGACGCAGCG GGTGCACCAT GACCGCCGCT TTCCTTCACG CCTTGTCCTG CCGGTGATCC CGCGGCCCGC GTAG
|
Protein sequence | MIAEEELIVE KGVMVPMRDG VRLYTDIYRP AGPGPFPVLV SRTAYWLNGG VTQGLSGFAK LVARQGYVAV FQQSRGRFAS EGVFHPALCD VDDGYDVVEW AAAQPWSTGK VGMFGGSYQG LTQWAAAIAR PPHLACIAPL TSTWNTFGNE IWYAAPGVLS LGSAFAWAWG AVLGEAERRG VPAPEGAIHH GEEGNEPGDV AEAIAKRTVA MMEMYAFRPL RDAPQLELVS WWKDWCDNGD PNDPYWLVVN ASEHAVDLDL PIFQASGWYD MFLNGTLEAF QALRGAGATQ YVRDNQELVI GPWNHGGVCP PRPDAPADTG PLGLWDLSEG SACMEFFRRH LKGEQVLDPA PVRLFVMGEN IWRDEREWPL ARTRWTPYYL HSAGGANTAA GDGSLSTERP GDQPDDVFVY DPQNPVVSQG RLECYAPDHG AETARNESRD DVLVYTTSPL EHDLEVTGPV TLELWASSSV SDTDFTAKLV DVFPDGAAIP LAEGVVRTGV AFTQPPRPET PRRYRISLWA TSNVFRSGHR IRLDVSSSAF PEYELNPNTG QRITHDATGK TVPATQRVHH DRRFPSRLVL PVIPRPA
|
| |