Gene Mflv_2917 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_2917 
Symbol 
ID4974238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp3088255 
End bp3089994 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content67% 
IMG OID640457139 
ProductHK97 family phage prohead protease 
Protein accessionYP_001134182 
Protein GI145223504 
COG category[R] General function prediction only 
COG ID[COG3740] Phage head maturation protease 
TIGRFAM ID[TIGR01543] phage prohead protease, HK97 family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.595306 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.164671 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACGTA AGTCAGTTGC GTTGTCGATC AAGAGCGCCG ACGACCAGAC CGGGGTATTC 
ACGGGCCTGG CGTCGGTGTT CGGCAACGTT GATGCTCACG GCGACATCGT TCGCCGCGGA
GCGTTCACCA AGTCGCTGGC CGCCGGCCAG CCGATCCCGC TGCTGTGGAT GCACAAGGCC
GACGACCCCC GCAATTACGT CGGGGACGTC ATCGAGGCCA CCGAAACTGC CGAGGGTCTG
GCGATCACCG GCAAGTTCGA CCTCGACACC GACCACGGGG CCGCCGCGTA CCGAAACGTG
AAGGGCCGCC GCGTCGGTGG TCTGAGCATC GGCTACCGGA TCAACCACTC CACGAAGACG
GCCGCCGGCA ACGAACTCAC CGATCTGGAC CTCGTCGAGA TCTCGGTCGT CGATCGCGGC
GCGAACGATC GCGCTCTGAT CGGCGCGGTG AAGTCCGCCG GCCGACCGAC AGCACCGATC
CGCGCGGCGC TGGCCCGCGA CAACGTCAAG CGCTACCACC ACACCCCGAA GGACGTACCA
CCCATGTTCG AGAACACCAT GCAGACCCTG ACCAAGGACC GGGACAGCCA GCTCGCGCTG
GTCAAGCAGA TCATCGACAC CGCCGACGAG CTCGGCCGCG ACCTGACCGC CGAAGAGACC
AGCCAGGTCG AGGAAGCCAC CGGCAAGGCC AAGTCCCTCG ACTCGAGGAT CGCTCAGGTC
GAGAAGGACA TGGCGATCTA CGCGGACACC AAGCGCACCG CCGACATCAT CGGGGGCCTC
GACGACATGG CCCGCAACGC CAGCGGCGAG ACCGAGGGCG GTCACCTGGC GCTGACCGGC
AAGCACGCAA AGGCGATGGC GCAGCGCGTC ATCAAGGCCA TGCCGCGCGG CCCCGGCGGC
ACCAAGGCGT TCGCGGCGGG CGTACAGACC ACCTCGACGA TCGTGCTGCC CGACGTCGTG
CAGACCGGGC GGCCGGCGGT GTCGGTGCTC GACGTGCTGC CGACCCGCGT CGTGCCACCG
TCGTACAGCT TCCTTCGCCA GTCAGCGCGC AACAACAACG CGTCCGTGGT GCCGGTCGGC
GGGACGAAAC CCACGACCGA CCCGCAGGTC GTCGGCGTCG AGAACCGTTT GCGCGTCGTG
GCGCACGTGT CGACCGGTCT CGATCACTAC CTGCTGTCCG ACGCGCCGAA CCTGGAGCGG
TTCGTCCAGG ACGAGATGCT CTACGGCCTG CGGGTGAAGC TGGAACAGCA GATCCTCGCC
GGGGCCGGTG AGGACATCAG CGACGACGAC ATGACGGGCG TGCTCAACAC CTCCGGTGTC
GTCGTGCAGG CATTCGCCAC CAACGCGCTG ACGTCGGTGC GCAAGGCCAT CACCACCCTC
GAAGCCGCCG GCTACAAACC GGGCCTGATC GTGCTCAGCG CGGCCGATTG GGAGGCGGTC
GAGCTGCTGA ACGCGACGTC GGGTGCCACC GATGTGCAGG GTGTGCCGGT CGATCCGGTC
GCACGCCGGC TGTGGGGTGT GCCCGTGGTG CTCAATCAGG GCTTGGGCGC CAAGACTGGT
CTGGTGATCG GTGACGGCGC GCTGACCGTC GACCACGACG GTCAGGTCGA GGTGAAGTGG
TCCGACGCGG TGTCGGACGA CTTCCTGAAG AACCAGGTGC GGTGCCGTGT TGAGGGCCGG
TTCGGTGTCA GCGTGAACCA GCCCGCCGCT GTGGTGAAGG TCGCCACCGC CGCCGCATAG
 
Protein sequence
MQRKSVALSI KSADDQTGVF TGLASVFGNV DAHGDIVRRG AFTKSLAAGQ PIPLLWMHKA 
DDPRNYVGDV IEATETAEGL AITGKFDLDT DHGAAAYRNV KGRRVGGLSI GYRINHSTKT
AAGNELTDLD LVEISVVDRG ANDRALIGAV KSAGRPTAPI RAALARDNVK RYHHTPKDVP
PMFENTMQTL TKDRDSQLAL VKQIIDTADE LGRDLTAEET SQVEEATGKA KSLDSRIAQV
EKDMAIYADT KRTADIIGGL DDMARNASGE TEGGHLALTG KHAKAMAQRV IKAMPRGPGG
TKAFAAGVQT TSTIVLPDVV QTGRPAVSVL DVLPTRVVPP SYSFLRQSAR NNNASVVPVG
GTKPTTDPQV VGVENRLRVV AHVSTGLDHY LLSDAPNLER FVQDEMLYGL RVKLEQQILA
GAGEDISDDD MTGVLNTSGV VVQAFATNAL TSVRKAITTL EAAGYKPGLI VLSAADWEAV
ELLNATSGAT DVQGVPVDPV ARRLWGVPVV LNQGLGAKTG LVIGDGALTV DHDGQVEVKW
SDAVSDDFLK NQVRCRVEGR FGVSVNQPAA VVKVATAAA