Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0866 |
Symbol | |
ID | 5669280 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1012199 |
End bp | 1013230 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641239793 |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_001505228 |
Protein GI | 158312720 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0862974 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.586989 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGCAG GCGCCACCGC CAACCCGGGC CCCGAGGACC AGATCGCACT CCCCGACGGG ACCGCGCCGT CCGATGATCG TGGTGAGGCG GCGCTGGACG CATACTCCCG TGTGGTGACC CGGGTGGCCG AGCGGGTCGG GCCGAGCGTG GGAAGCCTGC GGGTGCGCAC CTCGCGCGGC GCCGGAGCCG GTTCCGCGGT GCTCTTCACC GAGGACGGGT TCCTGCTCAC CTCCGCGCAT GTCGTCGAGG GACGTCCAGG CACCGGCCGG GCCGGCGAGC CGGCTGGCAC GGTGGAGTTC GTCGACGGGA CCGAGCGTGC CGTGGACCTG GTGGGAGCCG ACCCGCTCTC GGACCTCGCC GTGCTGCGGG CCCGCGGCTC CACACCGCGC CCGGCCGAGC TCGGCGACGC GGCGATGCTG CGGGTCGGCC AACTCGTCGT CGCCGTCGGC AATCCGCTGG GACTGACCGG GAGCGTCACC GCCGGCGTGG TCAGCGCCCT GAACCGCTCG CTGCCGACCC GCTCCGGTTC GGCCGTGCGC GTCGTGGACG AGGTCATCCA GACGGACGCC GCGCTGAACC CGGGAAACTC CGGCGGGGCA CTGGTCACCG CCGACGGCCG CGTGGTCGGG GTGAACACCG CCGTCGCCGG CGTCGGACTC GGCCTGGCCG TCCCCGTGAA CGCCACCACC AGGCGCATCC TCGCGGCACT GATCCGGGAT GGCCGAGTCC GCCGCGCCTA CCTCGGTGTC GCCGGTGCCC GGGTGCCGCT CCCGCCGGCC CTGGCCGAGC GGACGGGGCA ACGCCACGGC GTGCGCCTCG CGGAGGTGGT ACAGGGTAGC CCAGCCGGGC AGGCCGGACT GTTCACCGAC GATCTCGTCC TGTCGATCGC CGGGACGCCT GTCGCCGGCC CGGGCGATCT CCAGCGACTG CTGACCGAGG ACACCATCGG ACAACCCGTC GAAATGACAG TCTGGCGTCG CGGTGCCCTC GTGGACGTCA TAGCCGTGCC CCGGGAACTC GTAACCCCGT AG
|
Protein sequence | MDAGATANPG PEDQIALPDG TAPSDDRGEA ALDAYSRVVT RVAERVGPSV GSLRVRTSRG AGAGSAVLFT EDGFLLTSAH VVEGRPGTGR AGEPAGTVEF VDGTERAVDL VGADPLSDLA VLRARGSTPR PAELGDAAML RVGQLVVAVG NPLGLTGSVT AGVVSALNRS LPTRSGSAVR VVDEVIQTDA ALNPGNSGGA LVTADGRVVG VNTAVAGVGL GLAVPVNATT RRILAALIRD GRVRRAYLGV AGARVPLPPA LAERTGQRHG VRLAEVVQGS PAGQAGLFTD DLVLSIAGTP VAGPGDLQRL LTEDTIGQPV EMTVWRRGAL VDVIAVPREL VTP
|
| |