Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3229 |
Symbol | |
ID | 5671604 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3815700 |
End bp | 3817466 |
Gene Length | 1767 bp |
Protein Length | 588 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641242122 |
Product | X-Pro dipeptidyl-peptidase domain-containing protein |
Protein accession | YP_001507542 |
Protein GI | 158315034 |
COG category | [R] General function prediction only |
COG ID | [COG2936] Predicted acyl esterases |
TIGRFAM ID | [TIGR00976] putative hydrolase, CocE/NonD family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTACC GAGTCGAGAA GAACGTCATG GTTCCGATGC GGGACGACGT GACGTTATGC ACGGATCTCT TCCTGCCGGA AGGCGGGCCC GCGCCGGCGC TCATAATCCG GATGGGCTAC AGCAAGGAGA TGTTCGAGAA GTTGTCCCTG CCGCTGATCC CGAACGTCCT CTCGCTGGTC GAGGCCGGCT ACGCGATCGT TTACCAGGAG TGCCGCGGCA CCTACGGCTC CGGGGGCGTC TTCCGGCCGC TCGTGGATGA CCCGGACGAC GGCGTCGACA CGCTCGAATG GACGGTGAAA CAGCCGTGGT GCGACGGGAA CGTCGGCAGC TACGGGCTGT CATACCACGG CATGACCCAG TGGGCGACGG CCTCACAGGC CCCCTCCGGG CTGAAGGCGA TGGCAGTGGC GGCCTCGACG ACGGACCTCT TCCGCGCCCC GTGGTACAGC GACGGCGGTG CCGTGTCCTG GCAGATGACT CTGGGCTGGG TGGCGGCCCA GATCGTCACG CTGGGCCAGT ACGCGCTCGA GCGCGGCACA GGTGACCTCG AGCCGCTGGT CGACGCGGGC GCGATGATGC TCGACCTGGA GCCGCACCTG CGCAAGCTCC CGATCACCGA TCAGCCAGCG CTGAACAAGC ATGCGCCCTG GTGGAAGGAA TGGTGGGAGC ACCCCACCCG CGACGAGTTC TGGACCGGCC TGGCGACGGC CGAACACACC CGGGACATGA CGACTCCGGC GCTGCACATC GTCGGCTGGT TCGACTTCTT CGCGCCCGAG GCGACGCGTG CCTACACCCG AATGCGTGCC CAGGCAGCCA CGCCACAGGC ACGGGAGGGC CAGCGGCTGA TCGTAGGCCC CTGGGACCAC ACCTACCAGG ATGCCGCCTA CCGGTCCCGC GAGTTCGGCC AGCTGGCCGG CGCACCGTAC GCCGACATCA CCGGCGCGCA CCTGCGGTTC TTCGACCGGC ACCTGCGCGG CAACAACAGC GCCGACGTCG GCGCAAGCCC AGTACGGATC TTCGTGATGG GTGTGGACCA GTGGCGGGAT GAGCAGGACT GGCCACTGCC CGACACCACC TACGTCGACT ACTACCTGGA CGGGCCCGGC CGCGCGAACA CCGCCGACGG CGACGGCGTG CTCACGACCG AGGCCCCCAC CACCGAGGCG GCCGAGTCCT ACCGCTTCGA CCCGCTCGAT CCGGTGCCGA CGCTGGGCGG CCGGCTCAAC CAGATGGGTT TCGGTTTCTC CGCTCTGTAC TCGGGGCCGG TCGACCAGCG CCCAGTCGAA GAGCGCAACG ACGTTCTGTG CTTCACCACG CCGGTGCTGG AGGAGCCGGT CGAGGTCACC GGGAACATAT CGCTGGTGCT GCATGCGTCG AGCTCCGCGC TGGACACCGA CTTCACCGGC AAGCTCGTCG ACGTCCACCC CGACGGCCGG GCGCTCTACC TGACCGACGG CATCCTGCGT GCCCGCTACC GCGAGTCGCT GGCCAACCCG AAGCCACTCG TGCCCGGCGA GGTCTACGAG CTCATCCTCG ACCTCGGCCT GACCTCCAAC GTCTTCCTGC CTGGCCACCG CATCCGGCTC GAGGTCTCCT CCAGCAACTT CCCGCGCTAC GACCGGAACA CAAATACCGG CAACGTGATC TCCTTCGACA CGGCCACCCC GGTCGTCGCG GGAAACCAGA TCCTCCACGG CCCGGCGCAT CCCAGCCGGC TCGTTCTGCC GATCATCCGG CGCCTGCGGA CCGACCGACA CGGCTGA
|
Protein sequence | MSYRVEKNVM VPMRDDVTLC TDLFLPEGGP APALIIRMGY SKEMFEKLSL PLIPNVLSLV EAGYAIVYQE CRGTYGSGGV FRPLVDDPDD GVDTLEWTVK QPWCDGNVGS YGLSYHGMTQ WATASQAPSG LKAMAVAAST TDLFRAPWYS DGGAVSWQMT LGWVAAQIVT LGQYALERGT GDLEPLVDAG AMMLDLEPHL RKLPITDQPA LNKHAPWWKE WWEHPTRDEF WTGLATAEHT RDMTTPALHI VGWFDFFAPE ATRAYTRMRA QAATPQAREG QRLIVGPWDH TYQDAAYRSR EFGQLAGAPY ADITGAHLRF FDRHLRGNNS ADVGASPVRI FVMGVDQWRD EQDWPLPDTT YVDYYLDGPG RANTADGDGV LTTEAPTTEA AESYRFDPLD PVPTLGGRLN QMGFGFSALY SGPVDQRPVE ERNDVLCFTT PVLEEPVEVT GNISLVLHAS SSALDTDFTG KLVDVHPDGR ALYLTDGILR ARYRESLANP KPLVPGEVYE LILDLGLTSN VFLPGHRIRL EVSSSNFPRY DRNTNTGNVI SFDTATPVVA GNQILHGPAH PSRLVLPIIR RLRTDRHG
|
| |