Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0787 |
Symbol | |
ID | 5669203 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 913228 |
End bp | 914904 |
Gene Length | 1677 bp |
Protein Length | 558 aa |
Translation table | 11 |
GC content | 79% |
IMG OID | 641239715 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_001505151 |
Protein GI | 158312643 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.252141 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTCACC GAGGCGCCGC CGGCGGGGCC CGGGCCGGGC GGCACCCGTC GGCCGGAGTC ACCGGCGGGG TGCTGCTCGC GGTGCTGTCG GCGGTGCTGC TACCCGGCGC CGCGGCCGGC ACGGCGGCCG CCGCGACGCC CCAGCCGGCG GCCCCGGGTG CGAGCGCCAG GCCGGCCCCC CGGACAACCC CCGCGCCGAC GGCCTCGGCA CCGGGCCGCT CCCCGTCGGG CGGGCTGCCA TCCAGCGGGT TCTCGTCCGC CGACGCGCAG TGGTACCACG AGAGCCTCCG GCTCGCCGAG GCCCACCGGG TGAGCCGCGG CAGGGGCATC ATCGTGGCGA TCATCGACGG CGGGGTGGAC GCCACCCATC CGAAGCTGCG CGGGCAGCTC CTCTCCGGCG CCGGGGTCGG TGCGGACGCC GCTCTCGACG GGCTGCGCGA CGACGACCCC GACGGCCACG GCACGGCCAT GGCCGGGCTG GTCGCCGCCC GCGGTGACGT CGGTGACCCG GCGGTGTGGG GGGTCGCGCC CGAGGCGAAG ATCCTGCCGA TCTCCACCGG CGAGGAGGCC GACTCCGAGG AGGTCGCCCG TTCGGTGCGG ATCGCCGTCG ACCGGGGGGC GGGCGTGATC AGCATGTCCC TCGGCTCGGT GGGGCGGGCG ACGGGCGCGG AGGAGAGCGC CGTCCGCTAC GCGCTCGCGA ACGACGTGGT GGTCGTGGCC TCGGCCGGCA ACACCGAGCC GGGCGACACC GAGGTGAACT CCCCGGCGAA CATCCCGGGC GTGATCGCCG TGACCGGATC GGACTACCGC GGGATGTTCT GGGGCGGCTC GGTCCAGGGG CCGGAGGCCG TCCTGGCCGC GCCCGGACCG GGCATCCGGG CCCCGGTGCC GACCAGGGTG TCGCCCGACG GCCTGGACAC CGGAGGCGGC ACCAGCAACT CGGCGGCGAT CGTCGCCGGG GTGGCCGCGC TCGTCCGCGC CGCGAAGCCC GGCCTGGACG CACCCAACGT CATCAACCGC CTGATCCGCA CCGCGCTCGA CATGGGCCCG GTGGGGCGCG ACAGCCAGTA TGGCTTCGGG CTGGTCGAGC CGGTGGCCGC GCTGACCGCC GAGCTCCCCC TCGTGGACGC GAACCCACTG CTCACCGCCC CGATCCCACG CACCGGCAGC AGCGCGGACG GCGGCAGCGG GGCCGGGGGC GGCGCCACCC CCGACGGGGC CATCCCGGCC CTGCCGACAC CGCCGCCGCC CGCGACCGCG GCTCCCCCCG TGGGCGCGGC CGGAGCCGGC CCGGACGGGG GTGGGGACGA CCCGTCCGTG CTGACCTGGG TCGCCGGGCT CAGCCTCGCG GCGTCGGCGG GCGTCCTGCT CGGCGCGCTG GCGTACGTGC TGGCCGGCGC CCGGTTCGCC GCGACGCTGG GCCGCCGGGG CCGGGCCACG GCCCATCTGG TCACGGAGCA GGGTGCTGGC CCGCCCGCGA CGGCGCCCGT CCCGCCGGGC CCCCCGCCCG GTTGGACCAC GCAGCCCGGT TGGGCGGCTG GGCCGACCCG CGCCGCACCA CCCGACCGGA CAGGGCAGCC CGTCCCGCCG CACCCGGCCG GGCCGGCGGC GGGCGGCCCG CGCACCCCCG GCGGGGGCGT TCCCGTGGAC ACCCGCGGGT GGCGGACACC GCACTGA
|
Protein sequence | MGHRGAAGGA RAGRHPSAGV TGGVLLAVLS AVLLPGAAAG TAAAATPQPA APGASARPAP RTTPAPTASA PGRSPSGGLP SSGFSSADAQ WYHESLRLAE AHRVSRGRGI IVAIIDGGVD ATHPKLRGQL LSGAGVGADA ALDGLRDDDP DGHGTAMAGL VAARGDVGDP AVWGVAPEAK ILPISTGEEA DSEEVARSVR IAVDRGAGVI SMSLGSVGRA TGAEESAVRY ALANDVVVVA SAGNTEPGDT EVNSPANIPG VIAVTGSDYR GMFWGGSVQG PEAVLAAPGP GIRAPVPTRV SPDGLDTGGG TSNSAAIVAG VAALVRAAKP GLDAPNVINR LIRTALDMGP VGRDSQYGFG LVEPVAALTA ELPLVDANPL LTAPIPRTGS SADGGSGAGG GATPDGAIPA LPTPPPPATA APPVGAAGAG PDGGGDDPSV LTWVAGLSLA ASAGVLLGAL AYVLAGARFA ATLGRRGRAT AHLVTEQGAG PPATAPVPPG PPPGWTTQPG WAAGPTRAAP PDRTGQPVPP HPAGPAAGGP RTPGGGVPVD TRGWRTPH
|
| |