Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1988 |
Symbol | |
ID | 3903696 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 2334887 |
End bp | 2335930 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637879324 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_481091 |
Protein GI | 86740691 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.968187 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCCGGT TGACCGGCTG GATCAGTGGG ATCGTCGCCG CTGCCGCGAT GCTGGTGGGC GCCACCGGCT GCGGTACTGG CGGCGGGCCC GGCTCGCCGA CGGCGGACCG GGCCGCCGCC GCGGACAGAG CCGGGCTCGG CGCCGTTGCG GGCATCGTCT CCGACGTCGA GCCGTCCGTT GTCACCATCC TGGTCGGGAA CGAGCTGGGC AGCGGCATCG TCTACCGCGC CGACGGTGTC ATCGTCACCA ACCAGCATGT GATTGCCCAG GCATCCGGTG GAAAGGCCGA GGTGGCCTTC GCCGACGGCC GCCGGGTGGC CGGTCGGGTA CAGGCCGCCG ATGAGATCAG CGACATCGCC GTGGTGAAGG TGAACCGGAC CGGCCTACCC GCCGCGACGT TCCGTAAGGA TCTGCCCCAG GTGGGCGAGT TGGCGGTGGC GATCGGCAGC CCGCTGGGAT TCGAGAACAG CGTCACCGCC GGGATCATCT CCGGCGTGAA CCGGAACCTG CCCGTATCCG GTCAGCAGGG TGGGCAGGGG CGGCCGCTGG TGGACCTTAT CCAGACCGAT GCGGCGATCT CGCCCGGTAA CTCCGGCGGG GCGCTGCTGG ACAGCCAGGG CCGGGTCGTG GGCATCAACG AGGCCTACAT CCCACCGTCG ACCGGAGCCT CGTCGCTGGG CTTCGCGATA CCGTCCGCGA CCGCGGTTGA CGCCGTTGAG CAGTTGCTGC GCACCGGGAC CGTGAAGCAC GCCTTCGTTG GCGTCCAGCT CGCCACCCTC ACCTCGGCGA TCGCTGAGCG GCTCGGACTG GACGTGCGTG CCGGGGCGCT CGTGCTGGCC GTCGTGCGCG GCGGGCCCGC GGGCAAGGCA GGTGTTCTAC CCGGTGATGT CATCCGTAGC TTTAACGGCA AGTCGGTCGC CTCCGCCGGG GAGTTCTCCG CCAGGCTGCG CGAGGTATCC CCGGGTGACA TGGTGACGCT AGGCGTCCAC CGAGACGGCA GGGACCACAC AGTGCAGGTC AGGGTGTCCG ACCGGCCCGG CTGA
|
Protein sequence | MSRLTGWISG IVAAAAMLVG ATGCGTGGGP GSPTADRAAA ADRAGLGAVA GIVSDVEPSV VTILVGNELG SGIVYRADGV IVTNQHVIAQ ASGGKAEVAF ADGRRVAGRV QAADEISDIA VVKVNRTGLP AATFRKDLPQ VGELAVAIGS PLGFENSVTA GIISGVNRNL PVSGQQGGQG RPLVDLIQTD AAISPGNSGG ALLDSQGRVV GINEAYIPPS TGASSLGFAI PSATAVDAVE QLLRTGTVKH AFVGVQLATL TSAIAERLGL DVRAGALVLA VVRGGPAGKA GVLPGDVIRS FNGKSVASAG EFSARLREVS PGDMVTLGVH RDGRDHTVQV RVSDRPG
|
| |