Gene Mflv_4989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_4989 
Symbol 
ID4976300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp5308673 
End bp5312233 
Gene Length3561 bp 
Protein Length1186 aa 
Translation table11 
GC content68% 
IMG OID640459216 
ProductHAD family hydrolase 
Protein accessionYP_001136243 
Protein GI145225565 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1554] Trehalose and maltose hydrolases (possible phosphorylases)
[COG1877] Trehalose-6-phosphatase 
TIGRFAM ID[TIGR00685] trehalose-phosphatase
[TIGR01484] HAD-superfamily hydrolase, subfamily IIB 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.294557 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGTGT TCATCGATCC GCGCCACCAC GACGCGGTGA TCTTCGAGGT CGACGATCCG 
GTGGCCGACG AAGTGGCGCT GTCCGCCTCG CAGGGGGATC TGGCGCGTCG GCTCACCGCA
GCCGGAATCG GCGTCGGCTG TGCCGAGTCG GCGATGCTTC GGGAGACCGC ACGACGTCTC
GGGGTGCACC CGGGGCGCTG CGTCGTCGTC GCAGGGGGAG AAGCGGGGGT CAGGACCGCG
CGCGCGGGTG GCTTCGCGCT GATCGTCGGC ATCGACCGGG CGGAGCGCGC CGAGGATCTC
CGGAGCTGTG GCGCCGACGT GGTGGTGCCC GATCTCGTCG CGGTGACGGT CCGTGACGGT
TACCGACGGA TGTCGGAGCT CGCCGATGCG CTGCAGTCCT ACGGCGAGAT CGTGCCGCTG
GCCGAGACAC GTACCCCTGT CGTACTGCTG GACTTCGACG GGACACTGTC CGACATCGTG
GGCGATCCCG ACACGGCCGC CCTGGTGCCC GGGGCGCGGT CGGTCCTCGA CGCGTTGGCG
GCGCGCTGCC CCGTCGCCGT GGTCAGCGGA CGTGCGCTGG CCGACATCCG GGATCGCATC
GGCGTCCCCG GTATCTGGTA CGCCGGCAGC CACGGCTTCG AACTCTGTTC GCCCGACGGC
GGGATCCAGG AGAACGAGGC GGGCCTGGAA ATAGTGCGCG TACTCGCCGG CGCGCTGGCC
GAGGTGCGTG AGCGGGTCGG CGCAGTAGAT GGTGTGCTGA TCGAGGACAA GCGGTTCTCG
ATCGCGGTGC ACTATCGCAA CGTCGCCGCG GAATCCGTCG ACGAGGTCGT CACCGCGGTC
CGGAACATCG CACAGTGCAA CGGATTACGG GCTGATGGTG GGCGCAGGGT GGTCGAGCTG
AAACCGGACA CCGGTTGGCA CAAGGGCCGG GCGGTCGAAT GGATCCTCGA CCGCATCGAC
GGCGACGAGC TTCTGCTACC CGTGTACATC GGGGACGACC TCACCGACGA GGACGGCTTC
GACGCCGTGC GGCTGCGCGG GATCGGAGTC GCGGTGCGCA GCGCCGAATC GGGAGATCGC
CGATCGGCCG CCCGGTTCGC CCTCGACAGT CCGGCGGCCG TCTGCGCCTT CCTTGCCCGG
TTGTCGGACC AGCTCGCCGT CGAGCAGGAC CTCACCAACG ATCCGTGGAC GCTGACGTTC
GGCGGTTATC TGCCCGAAGA CGAAAGGCTG CGGGAGGCGC TGTGCACGCT GGGTAACGGC
TATCTCGCGA CCCGGGGCGC CGCTCCCGAG TGCGACGCGA GCCGCCTGCA CTACCCGGCG
ACGTATGTGG CCGGCATCTA CAACCGGCTC TTCGACGAGA TCGCCGGGAC GACCGTCGAC
AACGAGAGCC TGGTGAATCT GCCCAACTGG CTACCGGTGA CCTTCCGCGT CGACGGCGGG
GCGTGGTTCG ACATCGATGC GGTGGAATTC ACGTCCTACG TCACCACGCT GGACCTGCGC
CGGGCTACGC TGACGCGCGA GTTCGTGATG CGGGATCAGG CCGGCCGCAT CACCCGGATC
CGGCAGCGCC GACTGGTGGC GATGCACCGG CCGCACGTGG CGGCGATGGC GACGACCGTG
CGTGCCGAGA ACTGGTCGGG GAGGCTGCAA TTGCGGTCCG TGCTCGACGG GGGTGTCGAG
AATCTGGGTG TCGAACGGTA CCGGGCGCTG TCGTCCCGGC ACCTGACCGT CGACGCCATG
CGCGAACTGT CCCGTGATGC CGTGCTTCTG CAGACCCACA CGGGGGAGTC GCAGATCCAG
ATCGCCGTCG CCGCACGCCA CCGCGTGACC GGCGGGGAAC CGGAGTCCGT GGACCAGAGG
GTATTCCGGG ACGACTGCCG GATCGGGCAC GACATCGAGG TCGTTGTCAC CGCGGGGCAG
GCGGTCACGC TGGAGAAGGT GATCGCGGTG TACACCGGTC GGGACCACGG CATGTCCGGG
CCCGTGACGG CTGCCGAGCG CGAGATCGCC GGCGCGGACA CCTTCGACCG GCTGGAGGAC
GGACATCGGC TGGCCTGGGC CCACCTGTGG GAGCGCTTCA ACATCGACAT GGGACACGAC
CCGAATCTGC TGCGTCTCGT CCGCCTACAC CAGTTGCACC TGCTGCAGAC CCTGTCGCCC
CACACCGCGG ACCTCGATGT CGGGGTGCCT GCGCGGGGAC TGCACGGTGA GGCCTACCGC
GGGCATGTCT TCTGGGACGA GCTGTTCGTC TTCCCCGTGC TGAACATGCG GTTGCCCAAG
GTCACCCGGT CCCTGCTGCT GTACCGCTAT CGGCGGCTCC CCGAGGCCCG GCGTGCCGCA
GCCGAGGCAG GATACGCCGG GGCGATGTTC CCGTGGCAGT CCGGTAGCGA CGGACGCGAG
GAAAGTCAAC GGCTGCACCT GAATCCGCGT TCGGGCCGGT GGAATCCCGA CGCCAGCGCC
CGCGCCCATC ACGTGGGTCT GGCCATCGCC TTCAACATCT GGCAGCACTA CCAGGTCACC
GGGGACATCG GCTTTCTCAT CGACTACGGC GCCGAGATGC TGGTGGAGAT CACCAAGTTC
TGGGTGAGTG CGGCGAACCT GGACCCGCAC CGCGACCGAT ATGTGATCCG CGGTGTCATC
GGACCGGATG AGTTCCATTC CGGTTATCCG GGAAGGGAAT ACGACGGAGT CGACAACAAC
GCCTACACCA ACCTGATGGC GGTGTGGGTG ATCGTGCGGA CACTGGAGGC CCTCGAGCGG
TTGCCGTTGT CCTACCGGCT GGCGCTTCTG GAGACGGTCG GCGTCGGAGA CGAGGACCTG
GCGCACTGGG AGGACGTCAG CCGCCGGATG TTCGTGCCGT TCCACGACGG GGTCATCACG
CAGTTCGAAG GGTATGACCG ACTGCGCGAA CTCGATTGGG AGGCCTACCG CAACCGCTAC
GACGATCTGC AGCGGCTGGA CCGCATCCTC GAAGCCGAGG GCGACAGCGT CAACAACTAT
CGGGCGGGCA AGCAGGCCGA CACGTTGATG CTGTTCTATC TGTTGTCGGC CGACGAACTC
TACGAGTTGT TCGACCGCCT CGGGTACAAC TTCGCACCGG AACAGATTCC GGCCACCATC
GACTACTACC AGAAGCGCAC CTCACACGGA TCGACCCTGA GCGCAGTGGT GCACTCCTGG
GTGCTGGCCC GTGGTGACCG CCGTGAAGCC ATGCACTACT TCCGCCGGGT GCTGGCCTCC
GACGTCGTCG ACATCCAGCG CGGCACCACC GCCGAGGGGA TCCACCTGGC GGCGATGGCC
GGCAGCATCG ACCTGCTGCA GCGGTGTTTC ACCGGGTTGG AGCTGCGCCG GGACCGGATC
GTGGTCGGGC CGATGTGGCC CGAACCGCTC GGCAGGCTGG CCTTCACGTT CCGCTACCGC
GGGCACCGGC TCCGGCTGAC CGTGTTGGGA CGGTCCTCGA CCCTGAGCGC CGAGCCCAGC
GAAGCCTCAC CGATCCTCGT CGAATGCCGT GGTCAGGCAC AGACTCTCGT CGCTGGTGGG
ACGGTCGAGT TCACCCGGTG A
 
Protein sequence
MPVFIDPRHH DAVIFEVDDP VADEVALSAS QGDLARRLTA AGIGVGCAES AMLRETARRL 
GVHPGRCVVV AGGEAGVRTA RAGGFALIVG IDRAERAEDL RSCGADVVVP DLVAVTVRDG
YRRMSELADA LQSYGEIVPL AETRTPVVLL DFDGTLSDIV GDPDTAALVP GARSVLDALA
ARCPVAVVSG RALADIRDRI GVPGIWYAGS HGFELCSPDG GIQENEAGLE IVRVLAGALA
EVRERVGAVD GVLIEDKRFS IAVHYRNVAA ESVDEVVTAV RNIAQCNGLR ADGGRRVVEL
KPDTGWHKGR AVEWILDRID GDELLLPVYI GDDLTDEDGF DAVRLRGIGV AVRSAESGDR
RSAARFALDS PAAVCAFLAR LSDQLAVEQD LTNDPWTLTF GGYLPEDERL REALCTLGNG
YLATRGAAPE CDASRLHYPA TYVAGIYNRL FDEIAGTTVD NESLVNLPNW LPVTFRVDGG
AWFDIDAVEF TSYVTTLDLR RATLTREFVM RDQAGRITRI RQRRLVAMHR PHVAAMATTV
RAENWSGRLQ LRSVLDGGVE NLGVERYRAL SSRHLTVDAM RELSRDAVLL QTHTGESQIQ
IAVAARHRVT GGEPESVDQR VFRDDCRIGH DIEVVVTAGQ AVTLEKVIAV YTGRDHGMSG
PVTAAEREIA GADTFDRLED GHRLAWAHLW ERFNIDMGHD PNLLRLVRLH QLHLLQTLSP
HTADLDVGVP ARGLHGEAYR GHVFWDELFV FPVLNMRLPK VTRSLLLYRY RRLPEARRAA
AEAGYAGAMF PWQSGSDGRE ESQRLHLNPR SGRWNPDASA RAHHVGLAIA FNIWQHYQVT
GDIGFLIDYG AEMLVEITKF WVSAANLDPH RDRYVIRGVI GPDEFHSGYP GREYDGVDNN
AYTNLMAVWV IVRTLEALER LPLSYRLALL ETVGVGDEDL AHWEDVSRRM FVPFHDGVIT
QFEGYDRLRE LDWEAYRNRY DDLQRLDRIL EAEGDSVNNY RAGKQADTLM LFYLLSADEL
YELFDRLGYN FAPEQIPATI DYYQKRTSHG STLSAVVHSW VLARGDRREA MHYFRRVLAS
DVVDIQRGTT AEGIHLAAMA GSIDLLQRCF TGLELRRDRI VVGPMWPEPL GRLAFTFRYR
GHRLRLTVLG RSSTLSAEPS EASPILVECR GQAQTLVAGG TVEFTR