Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1623 |
Symbol | |
ID | 3905902 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 1950423 |
End bp | 1951550 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637878961 |
Product | peptidase M4, thermolysin |
Protein accession | YP_480728 |
Protein GI | 86740328 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3227] Zinc metalloprotease (elastase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCTGCC CCCGCCACGC CCTGCCCTGC GCGGTCCCAC CCCACATCCT CGAACGCATT GTCCGCAACG GCACCGAGGA GCAGCGTTCC CGGGCCCTGT CCACCCTGCT CCAGGACACC TCTCATCGCA CCATCCGGGT GCACAACGCC CTGATGCGCT CGACGCGGCG CGGTGCCGTC CCACCGCGCC CGGCGCCCGA GACCGGCCCG CGGCGTACCG TCAGCGACGC GGGCGGCACC GAGATCCTGC CGGGTGCGAT GGTTCGCGAG GAGGGTGGCG CCGCGATCGA CGACGCGGCG GTCAACGAGG CCTACGACGG CCTGGGCTCG ACGTTCGCGT TCTACTCCGA CGTCCTAGGC CGGAGCTCCA TCGATGACGA GGGGATGGCT CTGCTCGCCA CCGTGCACTA CGGCGACCAC TACGAGAACG CCTTCTGGAA CGGCCGCCAG ATGGTGTTCG GCGACGGCGA CGGCGAGCTG TTCAAGCGTT TCACGGCGTC GCTCGACATC ATCGGTCATG AACTGACCCA TGGCGTCACC GAGGACGAGG CGGCGCTGAT GTATGTGAAC CAGTCCGGCG CGCTCAATGA GTCGATCAGC GACGTCTTCG GTTCCCTGGT GAAGCAGTAC GTCCGCGGCC AGACCGCCGA GCAGGCCGAC TGGCTGATCG GCGACGAGCT GCTCACAGAC GCCGTCCAAG GCGTCGCGCT GCGGTCGATG AAGGCCCCCG GGACCGCCTA CGACGATCCG GTACTCGGCG ACGACATCCA GCCGGACCAC ATGGACCGCT ACGTCCGGAT GACCGCCGAC AACGGCGGCG TCCACATCAA CTCGGGCATC CCGAACAAGG CGTTCTACCT CGCCGCGACG GCTCTCGGCG GATACGCCTG GGAGAAGGCC GGCCGCATCT GGTACGAGAC CCTGCGCGCA CCGCAGATCC GGCCGAACAC GACGTTCCGT GCATTCGCCT CGGTGACCGT GCACCAGGCC GGCCTGCTGT TCGGTGCGGA CGCGCGGAAG GCCGTCTCCG AGGCGTGGCA GGCCGTCGGC ATCGTCGTCC GCGCAGCCGG GCAGGTCAGC GCAGCCGGGC AGGTCAGCGC AGCCGGGCAG GTCAGCGCAG CCGGGTAA
|
Protein sequence | MPCPRHALPC AVPPHILERI VRNGTEEQRS RALSTLLQDT SHRTIRVHNA LMRSTRRGAV PPRPAPETGP RRTVSDAGGT EILPGAMVRE EGGAAIDDAA VNEAYDGLGS TFAFYSDVLG RSSIDDEGMA LLATVHYGDH YENAFWNGRQ MVFGDGDGEL FKRFTASLDI IGHELTHGVT EDEAALMYVN QSGALNESIS DVFGSLVKQY VRGQTAEQAD WLIGDELLTD AVQGVALRSM KAPGTAYDDP VLGDDIQPDH MDRYVRMTAD NGGVHINSGI PNKAFYLAAT ALGGYAWEKA GRIWYETLRA PQIRPNTTFR AFASVTVHQA GLLFGADARK AVSEAWQAVG IVVRAAGQVS AAGQVSAAGQ VSAAG
|
| |