Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0886 |
Symbol | |
ID | 5669300 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1031357 |
End bp | 1034107 |
Gene Length | 2751 bp |
Protein Length | 916 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641239813 |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_001505248 |
Protein GI | 158312740 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.421013 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGACACCA CCGGCGACGA CCAGACCACC GATGCCCGTC CTGGCACGGA CAAGGCAAAA GCCCCGTCAG TGCCCGTGCC CGCGCCCCGC TCGCGGGCGG CGGAGGGCAC CGTCCCGTCG TCAGGAGGCG TGAGCCGACC GGTCCCGGCG TCCCTGCCGC TGCACGGGCC GGAACGTCGC CCCGCGACGG AGCGTCGTCC CGGCCCCGAA GCTCGGGCGG ACACGCCGCG CCGCCCCGCG TCGGCGGCGG AACGGCCACC CTTCGTCGAG CCGGTCCGGG CCACCACACG CGGGAGCGTG CCGACCACAG GCCGCCCGTC CAGCACGGGG AGCTCGTCCA CGACGGGTAA CGCCGGGGCC ACCGGCGGCA CACGCGACGA GACGACCAGG CGCGTCGGCG GGAAACGGGA CGGTGCCGGC CCACGGGGCC CGGCCGCTCG CGCGACATCC GGCCGGGAAC CAGCCGATCG GGAGGCATCC AGCCGGGAGG CGCCCAGCCG GGCTACGGCC GCACGGGAGG CAGCCGCCGC GCCGAAGGGG CCGGCGGGCC GGCAGGGCGC CGTCCCGCCG GGGGCGTCGG ATGGTGCCGC TGCCGCGAAG ACTCCGCTCG TCCGGGAAGG CCGTGCCGCC CAGAACGGCT CGTCTTCTCA GAACGGTTCC GCTACCCGGG ATGCTGCCGG GCGGAATGGC TCGTCCTCTC AGAACGGCGC CGCCAGCCGG GACGCCGCCG CGCGGACTGG CTCCATGGGC CCCGAAAGCG TGGCCCGCAC GCCGATCGCG TCGCCTACCG GGGATCCGGG TCGGCGTGAC ACCGCGTCCC GCGCGGGCGC GCCGGAGCGT GACGGTGGCG GGCGTGGCGT TCAGGGCACG GGCGCCGCTT CCGGCCCGGG GCCCGCGCCA GGAGCGGGAT CGGCCCCTGG CGCGCCGGCT GCCCGGGGAG GCTCCGATCC GGGCAGCGCC GCGAAGGCGC GTGTGACGGC GCAGCCGCGG CCAGCGGTTG CGCGGGAGAG CTCGCGGCCC TCGGTTCCGC CACCGCAGGG CCCGCTGCCG TCGGCTTCGC GGGAGGCCGT CCCACGGGAG AGCAGCCGTG AGGCCGTCTC GCGGGAGGGC ACCCGGGGGA GCGTCTCGCG GGAGGTCAGC CGGGAGGCGG TCTCGCGGGA GAGTGCTCGG GGGAGTGTTC CGCGGGAGGT CAGCCGGGAG GCGGTCTCGC GGGAGACGGG CCGGGCACCG GTCTCGCGGG AGACCAGCCG TGAGGCCGTC TCCAGGGACG GTGCCCGGGG GAGCGTCCCG CGGGAGGGCC TCCGTGAGAC GACCTCGCGG GACGCCGACC AGCGCGGGGC CGGCCGGCCG TTGGCCGACC CGCTGGTGGA TCTGGTGGGG CCGCGGCGCG CCCCGCGCCC GCCCACGCCG CCGCCCTCGG AGCTTGCGCC ATCCGGTAGG TCTGCGCCCT CCTCGGCCAC GCCCGCCCCA TCCGCCCCAT CTGTACCAGC GACCTCGGCG GCTGGGACGT CGGGCGCTGG GAAATCGGCA CCCGCCTCCG CAGCAGGGGG GTCGACGTCT GGTGTGTCCA CGACCGATGC GGCCAGAGCA ACGGCCGACG CGGCGAGAGC CGCGAGCGCG GCGACGCACC ACCGCGCGGC CGCTCCGAAG ATCCGCCATG GCCGGCGGGC GCTCGTGATC ACTGCTGTGA TCGCCGGACT GCTGGGGGGA GCGACCGGTG GGTGGGTGAC GAGCCTGATA CTGGGTGACT CCGAGGGGAC GTCGTCGCCG ACAGCGTCCG CCGAGGCCGC GCCGACGGTC ATCGACCCCG GCTCGGTGGC CGGGGTCGTC GCGCGAGTGC TGCCGTCCGT TGTGACCATC GACGTGACGG CGGGGGCCGA GGGCGGGAAC GGCTCCGGGG TGATCATTCG GTCTGAGGGC TACGTACTCA CCAACAACCA TGTCATCGCG CCGGCGGCGA ACGCGGGTGG CCAGGTAATG ATCACGATGA GTGATGGCGC CGAGCCCGTG CTCGCCGAGA TCGCCGGACG GGACGCCTCT TCAGATCTTG CGGTGCTGCG CATCCCCGGG GCCTCCGGCC TGCCGGCGGC GACGCTGGGA CGGTCCGGTT CGCTGGTCGC CGGCGCTCCG GTGATCGCGA TCGGTGCGCC CTTCGGACTC TCGGGGACGG TCACCACGGG GATCGTCAGC GCGCTCGACC GGAACCCGAC CGTGCCCGCC GAGGGCGGCG GGGCGTCCGT GATCATCGGA GCGATCCAGA TCGACGCGGC GATCAATCCC GGGAACTCCG GTGGCCCGCT GCTCGACGCC CGTGGCCAGG TCGTCGGCCT GAACACGGCG ATCGCGACGG CGCCGGGCGG GCAGGCGCCG TCGGGCAGCG TCGGCGTCGG GTTCGCGATC CCCATCGACT ACGCCGCGTC GGTGGCGGAC GAGATCATCC GCACCGGGCG GGCCACCCAC CCCTACACCG GAGTGTCGGC CGCGACGGTC ACCGCCGCCG AGGCCCGGGC GCGCGGCACC ACCCCGGGCG CGATCATCCG TGACGTCGAG CCGGCGGGCC CCGCGGCCGC GGCCGGGCTG CTGCCGGGCG ACATCATCAC CCGGGTCGAC GACACGGTCG TCACCAGCAC GAACGATCTC ACCGCGGCCA CCCGGCTGCA CCACGTCGGC GACACGGTGA CCGTGACCTT CCAGCGCAAC GGAGTGGAGA GCACAGCGCG GGTGGTCCTC CAGGAACAGT CGCCCGGCTG A
|
Protein sequence | MDTTGDDQTT DARPGTDKAK APSVPVPAPR SRAAEGTVPS SGGVSRPVPA SLPLHGPERR PATERRPGPE ARADTPRRPA SAAERPPFVE PVRATTRGSV PTTGRPSSTG SSSTTGNAGA TGGTRDETTR RVGGKRDGAG PRGPAARATS GREPADREAS SREAPSRATA AREAAAAPKG PAGRQGAVPP GASDGAAAAK TPLVREGRAA QNGSSSQNGS ATRDAAGRNG SSSQNGAASR DAAARTGSMG PESVARTPIA SPTGDPGRRD TASRAGAPER DGGGRGVQGT GAASGPGPAP GAGSAPGAPA ARGGSDPGSA AKARVTAQPR PAVARESSRP SVPPPQGPLP SASREAVPRE SSREAVSREG TRGSVSREVS REAVSRESAR GSVPREVSRE AVSRETGRAP VSRETSREAV SRDGARGSVP REGLRETTSR DADQRGAGRP LADPLVDLVG PRRAPRPPTP PPSELAPSGR SAPSSATPAP SAPSVPATSA AGTSGAGKSA PASAAGGSTS GVSTTDAARA TADAARAASA ATHHRAAAPK IRHGRRALVI TAVIAGLLGG ATGGWVTSLI LGDSEGTSSP TASAEAAPTV IDPGSVAGVV ARVLPSVVTI DVTAGAEGGN GSGVIIRSEG YVLTNNHVIA PAANAGGQVM ITMSDGAEPV LAEIAGRDAS SDLAVLRIPG ASGLPAATLG RSGSLVAGAP VIAIGAPFGL SGTVTTGIVS ALDRNPTVPA EGGGASVIIG AIQIDAAINP GNSGGPLLDA RGQVVGLNTA IATAPGGQAP SGSVGVGFAI PIDYAASVAD EIIRTGRATH PYTGVSAATV TAAEARARGT TPGAIIRDVE PAGPAAAAGL LPGDIITRVD DTVVTSTNDL TAATRLHHVG DTVTVTFQRN GVESTARVVL QEQSPG
|
| |