Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mflv_2917 |
Symbol | |
ID | 4974238 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium gilvum PYR-GCK |
Kingdom | Bacteria |
Replicon accession | NC_009338 |
Strand | - |
Start bp | 3088255 |
End bp | 3089994 |
Gene Length | 1740 bp |
Protein Length | 579 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640457139 |
Product | HK97 family phage prohead protease |
Protein accession | YP_001134182 |
Protein GI | 145223504 |
COG category | [R] General function prediction only |
COG ID | [COG3740] Phage head maturation protease |
TIGRFAM ID | [TIGR01543] phage prohead protease, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.595306 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.164671 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACGTA AGTCAGTTGC GTTGTCGATC AAGAGCGCCG ACGACCAGAC CGGGGTATTC ACGGGCCTGG CGTCGGTGTT CGGCAACGTT GATGCTCACG GCGACATCGT TCGCCGCGGA GCGTTCACCA AGTCGCTGGC CGCCGGCCAG CCGATCCCGC TGCTGTGGAT GCACAAGGCC GACGACCCCC GCAATTACGT CGGGGACGTC ATCGAGGCCA CCGAAACTGC CGAGGGTCTG GCGATCACCG GCAAGTTCGA CCTCGACACC GACCACGGGG CCGCCGCGTA CCGAAACGTG AAGGGCCGCC GCGTCGGTGG TCTGAGCATC GGCTACCGGA TCAACCACTC CACGAAGACG GCCGCCGGCA ACGAACTCAC CGATCTGGAC CTCGTCGAGA TCTCGGTCGT CGATCGCGGC GCGAACGATC GCGCTCTGAT CGGCGCGGTG AAGTCCGCCG GCCGACCGAC AGCACCGATC CGCGCGGCGC TGGCCCGCGA CAACGTCAAG CGCTACCACC ACACCCCGAA GGACGTACCA CCCATGTTCG AGAACACCAT GCAGACCCTG ACCAAGGACC GGGACAGCCA GCTCGCGCTG GTCAAGCAGA TCATCGACAC CGCCGACGAG CTCGGCCGCG ACCTGACCGC CGAAGAGACC AGCCAGGTCG AGGAAGCCAC CGGCAAGGCC AAGTCCCTCG ACTCGAGGAT CGCTCAGGTC GAGAAGGACA TGGCGATCTA CGCGGACACC AAGCGCACCG CCGACATCAT CGGGGGCCTC GACGACATGG CCCGCAACGC CAGCGGCGAG ACCGAGGGCG GTCACCTGGC GCTGACCGGC AAGCACGCAA AGGCGATGGC GCAGCGCGTC ATCAAGGCCA TGCCGCGCGG CCCCGGCGGC ACCAAGGCGT TCGCGGCGGG CGTACAGACC ACCTCGACGA TCGTGCTGCC CGACGTCGTG CAGACCGGGC GGCCGGCGGT GTCGGTGCTC GACGTGCTGC CGACCCGCGT CGTGCCACCG TCGTACAGCT TCCTTCGCCA GTCAGCGCGC AACAACAACG CGTCCGTGGT GCCGGTCGGC GGGACGAAAC CCACGACCGA CCCGCAGGTC GTCGGCGTCG AGAACCGTTT GCGCGTCGTG GCGCACGTGT CGACCGGTCT CGATCACTAC CTGCTGTCCG ACGCGCCGAA CCTGGAGCGG TTCGTCCAGG ACGAGATGCT CTACGGCCTG CGGGTGAAGC TGGAACAGCA GATCCTCGCC GGGGCCGGTG AGGACATCAG CGACGACGAC ATGACGGGCG TGCTCAACAC CTCCGGTGTC GTCGTGCAGG CATTCGCCAC CAACGCGCTG ACGTCGGTGC GCAAGGCCAT CACCACCCTC GAAGCCGCCG GCTACAAACC GGGCCTGATC GTGCTCAGCG CGGCCGATTG GGAGGCGGTC GAGCTGCTGA ACGCGACGTC GGGTGCCACC GATGTGCAGG GTGTGCCGGT CGATCCGGTC GCACGCCGGC TGTGGGGTGT GCCCGTGGTG CTCAATCAGG GCTTGGGCGC CAAGACTGGT CTGGTGATCG GTGACGGCGC GCTGACCGTC GACCACGACG GTCAGGTCGA GGTGAAGTGG TCCGACGCGG TGTCGGACGA CTTCCTGAAG AACCAGGTGC GGTGCCGTGT TGAGGGCCGG TTCGGTGTCA GCGTGAACCA GCCCGCCGCT GTGGTGAAGG TCGCCACCGC CGCCGCATAG
|
Protein sequence | MQRKSVALSI KSADDQTGVF TGLASVFGNV DAHGDIVRRG AFTKSLAAGQ PIPLLWMHKA DDPRNYVGDV IEATETAEGL AITGKFDLDT DHGAAAYRNV KGRRVGGLSI GYRINHSTKT AAGNELTDLD LVEISVVDRG ANDRALIGAV KSAGRPTAPI RAALARDNVK RYHHTPKDVP PMFENTMQTL TKDRDSQLAL VKQIIDTADE LGRDLTAEET SQVEEATGKA KSLDSRIAQV EKDMAIYADT KRTADIIGGL DDMARNASGE TEGGHLALTG KHAKAMAQRV IKAMPRGPGG TKAFAAGVQT TSTIVLPDVV QTGRPAVSVL DVLPTRVVPP SYSFLRQSAR NNNASVVPVG GTKPTTDPQV VGVENRLRVV AHVSTGLDHY LLSDAPNLER FVQDEMLYGL RVKLEQQILA GAGEDISDDD MTGVLNTSGV VVQAFATNAL TSVRKAITTL EAAGYKPGLI VLSAADWEAV ELLNATSGAT DVQGVPVDPV ARRLWGVPVV LNQGLGAKTG LVIGDGALTV DHDGQVEVKW SDAVSDDFLK NQVRCRVEGR FGVSVNQPAA VVKVATAAA
|
| |