Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2633 |
Symbol | |
ID | 3906306 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 3107056 |
End bp | 3108273 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637879958 |
Product | peptidase M50 |
Protein accession | YP_481724 |
Protein GI | 86741324 |
COG category | [R] General function prediction only |
COG ID | [COG1994] Zn-dependent proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.962625 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.303399 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGATC AGCACGGCGG CGCGGGTTCG GGCGGCGCGG GTTCGAGCGG CGCGGGTTCG AGCGCCCACG GCGGCGCGGG CCCGGCGGGG CAGGCTCCGA GTGAGCGACC CCCCGGCGTG CCGGTCGGGC GGATCCGCGG TGTTCCCATC GTCATCTCGC CGTTCGCCCT CGTCTTCGCC GTCCTCGTCG CCTACCTGCT CTCCGGCTCG ATCCGCGACC GGCTGCCGCT GGCGTCCGAT GGGCGGATCC TCGCGCTCTC CTCGTTGATC TCTATCGGCT TCCTGGCCTC CCTGCTCGCC CACGAGATCG GTCACGCTCT GACGGCGCTG GCGTTCGGTC ACACCGTGCG GTCCGTGACC CTGCACGGCT TCGCCGGGTT CACCGAGTTC GAGCCGGAGC CCCGCAGCGC CGGCCGCGAG TTCCTGATCG CCTTCGTCGG CCCGGCGGTC AACGGGGTGC TGGCCGCCGG CTGTCACCTC GGCCTGCTCG GCCTCGACGA CACCAGCGAC GCCGCCGCGG TCCTGCACGA TCTCGGACTC ATCAACGCCG CGTTGTTCCT CTTCAACCTG GCACCGGGCC TCCCGCTGGA CGGTGGACGG GTGGTCGTCG CCGCGGTGTG GGGTCTGACG CGCGACAAGC TGCGGGGGCT GCGGGCCGGT GCCTACGGCG GGTTCGTCGT CGCCGCCGGC CTGGTCGTCT GGGGTGCGTC GACCTCCGAC GGCATCGGCA TGGTGTACAC CTATGCTCTG GCGGGCTTCC TCGCGTTCGC GGCCTACCAG TCGCTGCGCG CCGCGCAGGT GCGGGAGCGG CTGCCCGGCC TGTGCGCCGG TCGCCTTGCG CGCCGGACGC TGCCCGTCGA GGGTGCCGTT CCGCTGGCGG AGGCGCTGCG GCGAGCTCAG GAGGTCGGCG CCACCGCCGT CGCGGTGATC GACCGTGACG GCAGCCCCCT GAAGATTATG AATGGCTCCG CGGTCGACGC GCTGCCGGAG CATCGGAGAC CCTGGATGAC CGTGGATGAA GTGAGTCGGG TGATCTCGCC CGGCATGGTC CTCGACGCCG ATCTGGAAGG CGAGGCCCTG CTGGCGGCCG TGCAGCGGGT GCCGGCGTCG GAGTACCTCG TCAAGCAGGC GGGCCGCCCG GTCGGCGTGC TCGCGATGGT GGATCTCGTC GCCCGTATCG ATCCGGCCGC CGCCGCCCGC ATGGTGGCGT CCCGGTGA
|
Protein sequence | MADQHGGAGS GGAGSSGAGS SAHGGAGPAG QAPSERPPGV PVGRIRGVPI VISPFALVFA VLVAYLLSGS IRDRLPLASD GRILALSSLI SIGFLASLLA HEIGHALTAL AFGHTVRSVT LHGFAGFTEF EPEPRSAGRE FLIAFVGPAV NGVLAAGCHL GLLGLDDTSD AAAVLHDLGL INAALFLFNL APGLPLDGGR VVVAAVWGLT RDKLRGLRAG AYGGFVVAAG LVVWGASTSD GIGMVYTYAL AGFLAFAAYQ SLRAAQVRER LPGLCAGRLA RRTLPVEGAV PLAEALRRAQ EVGATAVAVI DRDGSPLKIM NGSAVDALPE HRRPWMTVDE VSRVISPGMV LDADLEGEAL LAAVQRVPAS EYLVKQAGRP VGVLAMVDLV ARIDPAAAAR MVASR
|
| |