Gene Mflv_0803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_0803 
Symbol 
ID4972131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp825137 
End bp828550 
Gene Length3414 bp 
Protein Length1137 aa 
Translation table11 
GC content62% 
IMG OID640454998 
Producttype III restriction enzyme, res subunit 
Protein accessionYP_001132075 
Protein GI145221397 
COG category[V] Defense mechanisms 
COG ID[COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.874877 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.710753 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAACT TCGCCTTTCT GCAGGCCATC GGCTGGCCTG AGATGCACGC CGACTGCAGG 
CGGGCGGAGA GCTACGCCAC CAGCGATCCG CGGTCGGCAT GTTTCTATAG CCGCCGGACT
GTCGAGCTGC TGGTCGACTA TCTGTATGGC GTTCTCGCCC TGCCGATTCC GTACAAGAAC
GACCTGGCCG CCAAGATCAA TGACCCAAAG TTTAAGTCGA AGGTAGGCGT CGGGATCGCG
ACGAAGCTGA ACCTGATCCG CAAGCTCGGC AACACCGCCG TGCATGATAC GCAGCAGATC
CCACAGCGGG CGGCGTTGGA TGCGTTGCGT GAGTTGCACC ACGTGATGTT GTGGGCGGCC
TTCCGGTATT CGACCAAACC GCAAGCGGTT CCGATGAAGG TGCTTTTCGA TCCGAAGATC
GCCGCGAAGG CCGCTCCGCT GAGCCGGCAG GAGGTCGCCG AGTTGGCGGC GAAGTTCGCC
GCCCAGGACG AAGCGCACGT CAAGACGCTG GCCGATAAGG ATGAGCTCGC CGCGCAGAAG
GACACCGAGA TCGCCGCATT GCGGGAGGCG GTCAAGCAGG CGCAGGCCGC CAACCAGCAG
ACCGACGACC GCGACTACAA CGAGGCGGAC ACCCGGGATC GGTTCGTCGA TGTCATGCTC
GCCGAAGCCG GCTGGGGGCT AACTGAAACT AAAGACCGCG AGTACCCGGT CACCGGCATG
CCCAATGGCG ACGGCAAGGG GTTCGTCGAC TACGTCTTGT GGGGTGAAGA CGGTCTCCCG
TTGGCGATCG TGGAGGCCAA ACGAACGACC AAGAGCCCGC AGGTCGGCCA GCAGCAGGCG
AAGCTGTACG CGGACTGCCT GGAGAAGATG ACAGGGCGTC GGCCGGTCAT CTTCTACACC
AACGGTTTCG AGCACTGGAT CTGGGATGAC ACCGGCGGCT ACCCTCCGCG GGAGATCCAA
GGCTTCTACA CCCGGGACCA GCTGGAGCTC CTAATCCAGC GACGCGGCAC CCGCAAGCCG
TTGACCGACA TGCCGATCGA CTCGGCCATC GTAGAGCGGC ACTATCAACA CCGTGCGATC
CGCGCGATCG ACGACGCGTT CACCGCCAAG GAACGCGAAG CCTTACTCGT GATGGCCACC
GGGTCGGGGA AGACCCGCAC CGTTATCGCG CTGGTGAAGC AGCTTATGGA AGCCAACTGG
GTCAAACGGG TCCTCTTCCT GGCCGACCGC ACCGCACTGG TAACCCAAGC CGCCAATGCG
TTCAAGGCGC ATCTGCCCGA TGCCACCACC GTGAACCTGG TAACCGAGAA AATCACTGAC
GGCCGGGTGT ACGTGTGCAC TTACCCGACG ATGATGAACC TCATCAACGA CACCGACTCC
GGCATCAGGA AATTCGGACC CGGCTACTTC GACCTCGTCG TCATCGACGA AGCCCACCGC
TCCGTCTACC AAAAGTACCG GGCCATCTTC GACTGGTACG ACTCGCTGCT CGTCTGCCTG
ACCGCCACCC CTAAAGACGA AGTCGACCAC AACACCTACC GACTGTTCCA CCTCGAAGAC
GGCGTACCCA CGGACGCCTA TAGCCTCGAC GACGCCGTCA AGGAAGGCTT CCTGGTACCT
GCAGTCGGGA TCTCCGTCGG CACCAAATTC CTGCGCCAAG GGATCCGCTA CGCCGATCTG
TCCGAAGAGG AGAAGGACGA CTGGGACGCC CTGGACTGGG GCGATGACGA TCCGCCCGAT
GAGGTGAGCT CTGAGGAGAT TAACCGGTTC CTGTTTAACG AGGACACCGT CGACAAAGTG
CTCGCCGAAC TAATGAGCAA GGGCCACCGC GTCGCCGAGG GTGACCGGCT CGGCAAGACC
ATCATCTTCG CCAAGAACCA AGCCCACGCC GAGTTCATCG CACGACGCTT CGACGTTCAA
TATCCGCAAT ATGCCGGAAC ATTCGCCCGG GTGATCACCC ATAGCACTGC CTATGCCCAG
TCGCTGATCG ACAACTTCTC GGTCACCGAC AAGGCACCAC ACATCGCCAT CTCGGTCGAC
ATGCTCGACA CCGGCATCGA TGTCCCCGAC GTCGTCAACC TCGTCTTCTT CAAGCTCGTT
CGGTCCAAGA CCAAGTTCTG GCAAATGATC GGCCGCGGTA CCCGCCTACG CCCTGACCTC
TTAGGCCCCG GCAAGGACAA GCAGAACTTC TACGTCTTCG ACTTCTGCGG CAACCTCGAA
TTCTTCAGCC AGGACCTGCC CGGATCCGAA GGCTCATTTC AGAAGTCATT GAACCAGCGC
CTGTTCGAAG CACGCCTCGG GCTGATCACC GCGATTGACC ACGCGTGGCC GCCCTCGGAA
CCAGAACCCG AAGAGGGACA GGGCACCGAA ACCGAACGAG GATTGCGCGT CGATGTCGCG
TGGTCGCTCC ATCGCTCCGT CGCGGGGATG AACTTGGACA ACTTCCTGGT GCGCCCCCAC
CGCAGGCTGG TCGAGCAATA CTCGCAATGG CCGGCCTGGA CATCGCTGAC GCCCGAGGTC
GCCGGAGATG TAGCCGAACA CCTCGCCGGG CTGCCATCAC TCCATAAAGA CGACGACGAG
GACGCCAAGC GCTTCGACAT GCTCGTCTTG CGCCGTCAGC TAGCCCAACT CGAAGGCGAC
GCTGTCGCGG CCGAACGGCT ACGCGAACAG ATCCAGAACA TCGCGACTGG CCTGCTGAGC
CAGACCGCGA TCCCGTCGGT GAAGGCACAA GAGGCGCTTC TCGATGAGGT CGGCGGGGAT
GAGTGGTGGA TCGACGTCAC TCTGCCGATG CTCGAATCAG TACGGCGCAA GCTACGCGGT
CTGCTCCGGT TCCTAGAGAA GGCGAAACGG AACCAGGTCT ATACCGACTT CGCCGACGAA
CTCAGCGAAG CCTCCCTCGT TGATCTCCCG GGGATCACGC CCGGCACCAA CTGGGAACGC
TTCCAAGCCA AGGCCCGCGC CTATCTCAGA CGGCACCAAG ATCACGTTGC GCTGCAACGG
TTACGACGCA ATAAGCCCTT GACTCCCGAT GATCTCGCGT CGTTGGAGCA GATGCTCATC
GAGAGCGGTA CGGGAGAGCA AGCCGATATC GAGCTGGCCA AGGAGCAGTC GCATGGCCTG
GGGTTGTTCG TGCGGTCACT AGTCGGGCTG GACCGCGAAG CCGCCGCCGA AGCTTTTGGC
GCATACCTGG ACGGCACGAA GTTCAATGCC GACCAGATCC GCTTCGTGAA CCTCATCATC
ACCGAACTGA CCGCGAACGG GTTCATGGAA CCTGTGCGGC TCTACGAATC GCCCTACATC
GACCACGCGC CCACGGGGCC CGACGACGTG TTCGGAGACG CCGATGTCGA CACGATCGTG
GCCATCCTGA ACAGCGTGCG GGACAATGCC GCTCCGAAAG ATGGCGCTGC GTGA
 
Protein sequence
MSNFAFLQAI GWPEMHADCR RAESYATSDP RSACFYSRRT VELLVDYLYG VLALPIPYKN 
DLAAKINDPK FKSKVGVGIA TKLNLIRKLG NTAVHDTQQI PQRAALDALR ELHHVMLWAA
FRYSTKPQAV PMKVLFDPKI AAKAAPLSRQ EVAELAAKFA AQDEAHVKTL ADKDELAAQK
DTEIAALREA VKQAQAANQQ TDDRDYNEAD TRDRFVDVML AEAGWGLTET KDREYPVTGM
PNGDGKGFVD YVLWGEDGLP LAIVEAKRTT KSPQVGQQQA KLYADCLEKM TGRRPVIFYT
NGFEHWIWDD TGGYPPREIQ GFYTRDQLEL LIQRRGTRKP LTDMPIDSAI VERHYQHRAI
RAIDDAFTAK EREALLVMAT GSGKTRTVIA LVKQLMEANW VKRVLFLADR TALVTQAANA
FKAHLPDATT VNLVTEKITD GRVYVCTYPT MMNLINDTDS GIRKFGPGYF DLVVIDEAHR
SVYQKYRAIF DWYDSLLVCL TATPKDEVDH NTYRLFHLED GVPTDAYSLD DAVKEGFLVP
AVGISVGTKF LRQGIRYADL SEEEKDDWDA LDWGDDDPPD EVSSEEINRF LFNEDTVDKV
LAELMSKGHR VAEGDRLGKT IIFAKNQAHA EFIARRFDVQ YPQYAGTFAR VITHSTAYAQ
SLIDNFSVTD KAPHIAISVD MLDTGIDVPD VVNLVFFKLV RSKTKFWQMI GRGTRLRPDL
LGPGKDKQNF YVFDFCGNLE FFSQDLPGSE GSFQKSLNQR LFEARLGLIT AIDHAWPPSE
PEPEEGQGTE TERGLRVDVA WSLHRSVAGM NLDNFLVRPH RRLVEQYSQW PAWTSLTPEV
AGDVAEHLAG LPSLHKDDDE DAKRFDMLVL RRQLAQLEGD AVAAERLREQ IQNIATGLLS
QTAIPSVKAQ EALLDEVGGD EWWIDVTLPM LESVRRKLRG LLRFLEKAKR NQVYTDFADE
LSEASLVDLP GITPGTNWER FQAKARAYLR RHQDHVALQR LRRNKPLTPD DLASLEQMLI
ESGTGEQADI ELAKEQSHGL GLFVRSLVGL DREAAAEAFG AYLDGTKFNA DQIRFVNLII
TELTANGFME PVRLYESPYI DHAPTGPDDV FGDADVDTIV AILNSVRDNA APKDGAA