Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0323 |
Symbol | |
ID | 5668747 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 387501 |
End bp | 388682 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641239254 |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_001504695 |
Protein GI | 158312187 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0613736 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACCTTC TCGATCTCGT CCTGATCGTC CTGGTCGTGA TGTTCGCGGT CTCCGGCTAC CGCCAGGGGT TCGCGGTGGG GGCGCTGTCG TTCGTCGGTT TCCTCGGCGG CGGTGTGCTC GGCGCCAAGG TGGCGCGCCC CTTCGCCGAG CTCATCGGCC GCGAGGAGGA CGGCGCCCTG GTCGGCCTCA TCGTCGTGGT GGGCCTGGCC CTCGTCGGGC AGATCGCGGG CACCGCGCTC GGAGCGGCCC TGCGCGGCCG GCTCACCTGG CGGCCCGGCC AGCGGCTGGA CGCGGTCGCC GGCGCCGTGC TCTCCGGGAT GTCCGTGCTG CTGGTCGCCT GGCTCGTCGC GACCGCGGTC GACCGCTCGC CGTTCCAGAC CCTCGCCCGT GCCGCCCGCG GATCCGAGGT GCTGGGCACC GTCGACACGA CGATGCCCGA CGACGTCCGC CACACCTTCG CCGACCTGCG CCGGCTCATG GACGACCAGG GCTTTCCCGA GGTGTTCGCC GGCCTGGACG GCGAGCGCAT CGTGGCCACC GATCCACCGG ACCCTGCGGT CGTCAGCGCG GCGGACGTCC AGAACGCCGC GGCGAGCATC CTCAAGGTGC GGGGGCAGGC GCCGTCGTGC GGGAAGCAGG TCGAGGGCAC CGGCTTCGTC ATCGCGCCCC AGCGGGTGAT GACGAACGCG CACGTCGTGG CCGGTGTCAC CGAGGCCGTC GTGGAGCTGG ACTCCGGGTC GCTGCCGGCC GAGGTGGTGC TGTTCGACCC CGACCGGGAT GTGGCCGTCC TGCACGTGCC GGGGCTGCTC CGGCCCCCGC TGCGCTTCCA GTCCGCGCCG CCCGGTGACA TGGGCGACTC GGCGGTCGTC GCCGGCTACC CCCAGGACGG GCCATACACC ACCGTGCCCG CGCGCATCCG CAACGAGCAG GTCGCGCGGG CGCCGGACAT CTACTCCCGC GGCACGGTTC GCCGCGAGAT CTACGCGATC CGCGGGCGGG TGCGGCCGGG CAACTCCGGC GGGCCGCTGC TGTCGAGCGC CGGGACGGTC TACGGCGTGG TCTTCGCCGC GGCCACCGAC GACAACGACA CCGGGTACGT GCTGACTGCC GACGAGGTCA GCGAGCCCGC GCGGCAGGGA GCGCAGGCCC TCGTGCCGGT GAGCACCCAG AGCTGCGACT GA
|
Protein sequence | MNLLDLVLIV LVVMFAVSGY RQGFAVGALS FVGFLGGGVL GAKVARPFAE LIGREEDGAL VGLIVVVGLA LVGQIAGTAL GAALRGRLTW RPGQRLDAVA GAVLSGMSVL LVAWLVATAV DRSPFQTLAR AARGSEVLGT VDTTMPDDVR HTFADLRRLM DDQGFPEVFA GLDGERIVAT DPPDPAVVSA ADVQNAAASI LKVRGQAPSC GKQVEGTGFV IAPQRVMTNA HVVAGVTEAV VELDSGSLPA EVVLFDPDRD VAVLHVPGLL RPPLRFQSAP PGDMGDSAVV AGYPQDGPYT TVPARIRNEQ VARAPDIYSR GTVRREIYAI RGRVRPGNSG GPLLSSAGTV YGVVFAAATD DNDTGYVLTA DEVSEPARQG AQALVPVSTQ SCD
|
| |