Gene Mflv_1722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_1722 
Symbol 
ID4973047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp1790275 
End bp1793079 
Gene Length2805 bp 
Protein Length934 aa 
Translation table11 
GC content69% 
IMG OID640455930 
Productglycoside hydrolase family protein 
Protein accessionYP_001132991 
Protein GI145222313 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.594892 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGGGG GCACTTGGAG GAGCCAGAGC ACAGATCGTC ACCTCGCGTC GGATGCCGTC 
GGACGCGGCG CAGTGCGGCA AGTATGCCGC GTAGGCGCCG ACGGGGTCCG ACAAAATGAC
GGGTTTGGTC AATCCGCCCA CCGTGCGGCG CGGCGGCGGT ATTCCAGATT CATGGATTCT
GCGCAGTTCG TCGGTCGGGT CGGTGGTCTT GCGGTGGCGC TCGGTGTCGG GGCCGCGCTC
GTCACCGGCT CCGGCGTGGC CTGGGCCACC GAGGATTCGG GGTCGTCCGC GTCGGCGGGG
GCTGAATCAT CCACTGCGCG AACGTCATCG CAGGATCGCC AGGCGGCGTC CCCGTCCAAG
CCCGCGCGCG ATCGCGCCGG CGGTCGTGAC GACGCGGAGG GCGGGGGCGC CGAGGACCGG
GACGCGCAAG CGCAGGACGA CGCAACGCCC GAGACCGGCG GGGAGACCGG TGCACCCGAC
GACGCATCGG TCGACGCGTC CGGGCCGGAT ACCGGCGGGA CCACCGGCGA TGCGTCCGCC
GAGGACGACG ATCCCGACGA GGCCGAGGAG CGTGCCGCCT GGGCGGACAC CGGCGCGGAC
ACCGACGCAG ACCCCGACGC AGACCCCGAC GCGGACGCTT CTGCCGCAAC CGATGTCGCG
ACCGCCGCTG TGCCGTCCGG TGCCGGGCCC ACGGCCGTCG TCACCACAGA GGTGACCGTC
GACCCGCCCC GGGTGCCGTC CTGGCGTCCG TGGCCCACCG CATTCGATCT GCGGGGCGCG
GTGACCTACG TGGTGGATCT CGCCGTCAGC GTCGTCGACG CGCTGTTCCA CCCGTTCGCG
GCGGGCCCGC CCCCGCCGCC CGGTGATCCG TCGGCCTGGG GGCTGCTGGC GTGGGTGCGG
CGCGAGCTGT TCAACAGCAC GCCCGACGAA GTCGACAACC CGCTGCCCTA CACGCAGAGC
CTGGTCGACG GTGACGTCGT GATCACCGGC AACGTCGGCG TCGAGGACCG CGACGGCGAC
CCGCTCACCT ACGAGGTGAT CGGCAGGCCT CGTTTCGGCG GGGAGGTCAC GGTGGACGCC
GACGGGAACT TCGTCTACCG GCCGATGAAC GCGATGGCCG CGGTCGGTGG AACCGACGAG
TTCACCGTCG CCGTCACCGA CGATGCGGCG GGCCTGCACG TGCACGGGCT TCTCGGCTGG
CTGCAGTTCG TGCCGATCCT CGGGAGCTTT CTGAACCCCG GTGGGGGGCA CGGCATCACA
CGCACCATCA CCGTCACGGT CGAACCCGTC GACGGTATCG ACCTGTCGCT GCCCGACGGC
TTCAAGTGGG GCGTCGCGCA TTCGGGATTC CAGGCCGAGG GTGGCCCCGG CTCGCCGGTG
GACACCGGGT CGGACTGGTA CCGCTGGGTG CACGACCCGC TCAACAGGTT GCTCGGTCTG
GTGAAGGGAG TACCGGAGAA CGGGCCCGGC GCGTATGTCT CCTACGAGGA CGACGCCCGG
TTGGCCCGTG AGGAACTGGG CGTCAACACC TTCCGGATGG GCATCGAATG GAGTCGGATA
TTCCCGGATT CGACTGCCTC GGTGGATATC TCGGATGAAG GGGGGACGGT CAGCCTGGCC
GACCTGCAGG CACTCGACGC CCTCGCGAAC GCCGACGAGG TCGCGCACTA CCGGGATGTG
TTCGCTGCGC TGCGCTTCCA CGGACTCGAC CCGATGGTCA CCGTCAACCA CTTCACGCTG
CCGGTGTGGG TGCACGATCC CGTCCTCGCG CGCCCGTTGA TCCAACTCGG TCTGCCCGTC
GCCGCGGCCG GCTGGCTGTC CACGGAGACC GCCGTCGAGT TCGAGAAGTA CGCCGCGTAT
CTGGCGTGGA AGTACGGCGA TCAGGTCGAC AACTGGGCGA CGCTCAACGA GCCGTTCCCG
CCGGTGCTGA CCGAGTTCCT GGCGATCCCG TGGGTGGTGC CGAACTGGCC GCCGGGGGTG
CTGCGACCCG ATCTCGCCTC GACGTTCCTG GTGAACCAGG CGATCGGTCA CGTCGCGGCA
TACGACGCCA TCCACGCGTG GGACACCACC TCCGCGGTCG AGGGCGGCCC GGCGGCATTC
GTCGGATTCA CCCACAACAT GATCCCGGCG CGGCCCGCCA ACCCCGTCAA CGCACTCGAC
GTCGGGGCGG CCGAGGCCTG GAACCATTAC TACAACCACT GGTTCCCGAA CGCGGTGATC
GACGGATGGA TCGACCTGGA CTTCGACGGG ATCAAATCCG CCGACGAGAT CCGCCCCGAC
ATGGCCGACA AGGTCGACTT CCTCGGCGTG CAGTACTACG GATCGCAGCC GATGGTCGGT
TTCGGTGTCG CACCGCTGCC GGGATTCCCG TTCCTGCGCG GCTTCCCGAT CAGGTGCTCG
GCCGAGGAGA CCACCTGCAG CGACTTCAAC CAGCCGATCG ATCCCGGCGG TTTCCGGGAG
GTGCTCGAAG TCGCCGCCTC GTACGGGAAA CCGTTGTGGG TCACCGAGAA CGGCATCGCC
GACGCCGGGG ATGCCAAGCG GCCGCCGTAC CTGGTCAACC ACGTCGCGGT GGTTCAGGAT
CTGGTGGCGC ACGGGTTGGA CATCCGCGGC TACACCTACT GGTCGTTCGT CGACAACCTG
GAATGGTCGG AGGGCTACGA CCTGCAGTTC GGTCTGTACG GGTCGGACCC GGACACACCC
GAGCTCGAAC GCATCCCCAA GGTGGCGAGT ATCGCCGCGC TGAAGGGGAT CACGACCGCG
AACGGGCTGC CCGTGGCCCT GCTGCAGAAC TATCTGCCCG GTTAG
 
Protein sequence
MGGGTWRSQS TDRHLASDAV GRGAVRQVCR VGADGVRQND GFGQSAHRAA RRRYSRFMDS 
AQFVGRVGGL AVALGVGAAL VTGSGVAWAT EDSGSSASAG AESSTARTSS QDRQAASPSK
PARDRAGGRD DAEGGGAEDR DAQAQDDATP ETGGETGAPD DASVDASGPD TGGTTGDASA
EDDDPDEAEE RAAWADTGAD TDADPDADPD ADASAATDVA TAAVPSGAGP TAVVTTEVTV
DPPRVPSWRP WPTAFDLRGA VTYVVDLAVS VVDALFHPFA AGPPPPPGDP SAWGLLAWVR
RELFNSTPDE VDNPLPYTQS LVDGDVVITG NVGVEDRDGD PLTYEVIGRP RFGGEVTVDA
DGNFVYRPMN AMAAVGGTDE FTVAVTDDAA GLHVHGLLGW LQFVPILGSF LNPGGGHGIT
RTITVTVEPV DGIDLSLPDG FKWGVAHSGF QAEGGPGSPV DTGSDWYRWV HDPLNRLLGL
VKGVPENGPG AYVSYEDDAR LAREELGVNT FRMGIEWSRI FPDSTASVDI SDEGGTVSLA
DLQALDALAN ADEVAHYRDV FAALRFHGLD PMVTVNHFTL PVWVHDPVLA RPLIQLGLPV
AAAGWLSTET AVEFEKYAAY LAWKYGDQVD NWATLNEPFP PVLTEFLAIP WVVPNWPPGV
LRPDLASTFL VNQAIGHVAA YDAIHAWDTT SAVEGGPAAF VGFTHNMIPA RPANPVNALD
VGAAEAWNHY YNHWFPNAVI DGWIDLDFDG IKSADEIRPD MADKVDFLGV QYYGSQPMVG
FGVAPLPGFP FLRGFPIRCS AEETTCSDFN QPIDPGGFRE VLEVAASYGK PLWVTENGIA
DAGDAKRPPY LVNHVAVVQD LVAHGLDIRG YTYWSFVDNL EWSEGYDLQF GLYGSDPDTP
ELERIPKVAS IAALKGITTA NGLPVALLQN YLPG