Gene Mvan_4107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4107 
Symbol 
ID4648866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4402566 
End bp4406243 
Gene Length3678 bp 
Protein Length1225 aa 
Translation table11 
GC content66% 
IMG OID639807574 
ProductHAD family hydrolase 
Protein accessionYP_954890 
Protein GI120405061 
COG category[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0637] Predicted phosphatase/phosphohexomutase
[COG1554] Trehalose and maltose hydrolases (possible phosphorylases)
[COG1877] Trehalose-6-phosphatase 
TIGRFAM ID[TIGR00685] trehalose-phosphatase
[TIGR01484] HAD-superfamily hydrolase, subfamily IIB
[TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.130759 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.249958 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTCC CTGTCACCAT CGATCCGCGG TACCACGACG CGGTGCTGCT CGACCTCGAC 
GGCGCGCTGA CCAGTGAGGC CCCGCTCTTC GGGCCTACGG TCGATCTGGC CAGGAAGTTG
CAGAGCATCG GGGTCGCCGC GGCGGCCTAC TCGTCGAATG GACAGTGCAG GCAGGCGCTG
AAGGCCGCCG GCATCGACGA CCTGTTCGGC GTCTGCATCG ACGGAATGGA GGGCGACCGC
GGAACGGCCG AGAAGCCCGA TCCCGCAGTG CTTCTGGAGG CCGCCCGGCG GATCGGAGCG
CGACCGCAGC GATGCGTCGT GGTCGAGAAC TCCGCGGCAG GCGTGGCGGC CGGCCGTGCC
GGCGGTTTCG CGCTCGTGGT CGGCATCGAC GGTACGGAGG TTGCCGACGA GTTGACCCGC
CACGGTGCCG ACGTCGTGCT CGCCGGTCTC GCCGACATCG CCGTCCGCAC GGGTGACAGG
CGGATCTCAG AACTGCCGAA CGCGCTGGCT TCCTACGGCC AGTTGATCGG CATCACCAGC
GCGCGCGAGT CCATGCTGTT CCTGGACTAC GACGGCACGC TGTCCCCCAT CGTCTCGGAC
CCCGCCGCGG CCAGGCTCGT CGAGGGTGCC GACGAGGCGC TCGAACTCGT CTCCAAGGTG
TGCCCGGTGG CCATCCTGAG CGGGCGCGAC CTCGCCGACG TCAGCGCCCG GGTCGGCACC
CCCGGCGTCT GGTATGCAGG CAGTCACGGC TTCGAGTTGA CCTCACCGGA CGGCGCTTAT
CACTGCAACG ATGCCGCTGC CGTGTTCGTG CCGGTCCTGG AGGAGGCCGC CGCCGAGTTG
AACAAGACCC TGGCCCAGAT CGCAGGAGTG CGCGTCGAGC ACAAGCGCTT TGCCGTCGCG
GTGCACTACC GCGAGGTGGC ACCGGAACAG GTCAGCGAGA TCGTCTCGGC CACACATCAA
CTCGGTGCGC GTCGCGGCCT GCGGGTGACC AGTGGGCGGA TGCTCGTCGA GCTGCGGCCC
GACCTCGACT GGGACAAGGG CACCACGCTG GCCTGGATCC GCGACCGCAT CGACCCGTCC
GGCTCACTGC TGCCGATCTA CATCGGCGAC GACCTGACCG ACGAGGACGC CTTCGATGCG
GTCCGGTTCG ACGGCATCGG GATCGTCGTC GGGCATGACG AGGACGGCGA CCGCAAGACC
GCCGCGAACT TCACCCTCCA AAGTCCGGAG CAGGTGCGTG AGTTCATCCA ACGCGGATCG
CGGTGGCTGG CCTACAAACA TCAGGTCTCG GGTGAGGCGT GGGATTACGT CTTCGACGGA
TACGACCCGC AGAACGAGAA GCTCCGCGAG GCGTTGTGCA CCGTGGGCAA CGGCTATTTC
GCCACCCGCG GTGCCGCGCC CGAGTCGAAG GCCGGGCAGG TGCACTACCC GGGCACCTAT
GCGGCCGGGG TGTACAACCG CCTCGTCGAC AACGTGTCGG GAACCGAGAT CGACAACGAG
AGTCTGGTCA ACCTGCCCAA CTGGCTCGCG TTGACCTTCC GTGTCGACCA CGGCGACTGG
TTCGACATCG ACGCGGTCAC CGTGCTGTCC TACCGGCAGA CGCTCGACCT TCGGGGAGCG
GTGCTGACCC GGCAGGTCCG CTTCCGCGAC CACGCTGGGC GGACGAGCTC GTTGACGCAA
CGACGGTTCG TGGCCATGCA CCTGCCCCAC GCCGGCGCTC TCGAGACGAC GGTCGTCGCG
GAAGACTGGT CAGGGACGAT CGAATTCCGC TCGACCGTCG ACGGCAACGT GACGAACTCG
CTGGTCGAGC GTTACCGCGA CCTGGCCAAC GAGCATCTCG ATTCGGCTGA CACGCAGGAA
CTCTCGAACA ACTCCGTCCT GCTGACCATG CAGACCAACC AATCCCGCAT TCCCATCGCG
ATGGCCGCAC GCAACACCGT GTGGCGCGAC GGCGAGCCCG TCCCGGCCGC CTTCGCGCTG
TTCGATCAGG GCGCCGAGAT CGGTCACGAC ATAGCGGTCC ACCTGTCGGC CGGGGATGCG
GTGACGCTCG AGAAGGTCGT CACCGTCTAC ACCGGGCGCG ACGTCGCGAT CTCCGAACCC
GGCGTGAATG CGCAGCGCTG GGTCACCCGG CTCGGCCGAT TCGACGAGTT GCTGGACGGA
CATCGAACCG CCTGGACGCA CCTGTGGGAA CGGCTGTCGA TCGATTTCGA CGATTTCACC
GACGAGTTGC GGATCCTGCG GCTGCATCTG CTGCATCTGC TGCAGACGGT GTCACTCAAC
ACCGCTGACC TCGATGCCGG GGTACCTGCA CGCGGACTGC ACGGTGAGGC GTATCGCGGG
CACATCTTCT GGGACGAACT GTTCATCTTC CCCGTCCTCA ACCTGCGGGC GCCGATGATC
ACCCGGTCCC TGCTGGGTTA CCGCTACCGT CGTCTGCCCG AGGCGCGGCA CGCCGCCCGC
GCGGCGGGCC ATTCCGGTGC GATGTTTCCC TGGCAGTCCG GCAGCGACGG GCGCGAGGAA
AGTCAACGGC TGCACCTCAA TCCGCGCAGC GGCCGCTGGA ACCCCGACGC CAGCGCCCGC
GCGCACCACA TCGGCATCGC CGTCGCCTAC AGTGCGTGGA AGTTCTACCA GGTCACCGGC
GACCTCGCGT ACCTGATCGA CTACGGCGCG GAACTGATCG CCGAGGTCGC ACGATTCTTC
GTCAGCCTGG CCAGCTACGA CGAAGACCGG GAACGGTTTG GGATCAAAGG CGTCATCGGC
CCCGACGAAT TCCACTCCGG TTACCCCGAA GCTCCCTATG ACGGCATCGA CAACAACGCG
TACACGAACG TCATGGCGGT ATGGGTGATC ATGCGCGCGA TCGACGCACT GAACCTGCTC
CCCCTGCCCA ACCGCCTCGA CCTGCTGGAG GCACTCGGGC TGCACAACGA GGAGCTGGCC
CACTGGGACG ACGTCAGCAG GCGGATGTAC GTCCCGTTCC ACGACGGCGT CATCAGCCAG
TTCGAGGGCT ACGGGGATCT GGCGGATCTC GACTGGGATC GTCTTCGCCG GCAGTACGGC
AACATCCAGC GTCTGGACCG CATCCTGGAG GCGGAGAACG ACGACGTGAA CCGCTACAAG
GCGTCGAAGC AGGCCGACGC GCTGATGCTG CTCTATCTAC TCTCGGCCGA CGAGTTGCGG
GAGATACTCG ACCGACTCGG CTACCGTTTC CTGCCCGAGC AGGTGCCGAA GATGGTGGAC
TACTACCTGG CCCGCACATC ACACGGGTCT ACGCTCAGCG GTGTCGTCCA CACCTGGGTA
CTCGCCAGGG CCAACCGCAA TCGCGCCCTG GAGTTCTTCC AGCAGGCGTT GAAGTCGGAC
GTCTCCGACA TCCAGGGCGG CACCACCTCC GAAGGCATCC ACCTGGCGGC CATGGCGGGC
ACCGTCGACC TAATGCAGCG CTGCTTCACC GGGCTGGAGA CTCGTTCCAA CCGCATCATC
CTGTCCCCGT ACTGGCCGGA ATCCCTTGGC GTGCTGGCGA TACCGATCCA CTATCGCGGC
CTGCACCTGC ACCTGAGGGT CAGCGGAAAA GGCGTGATCA TCAGCGTGGA TCCACGAGAC
GCCGCTGGAA TCGAGGTGGA GTGCCGCGGT CAGGTCGTTC AACTGATGCC GGGAACCACC
GTCCGGTTCC CGGGCTGA
 
Protein sequence
MKLPVTIDPR YHDAVLLDLD GALTSEAPLF GPTVDLARKL QSIGVAAAAY SSNGQCRQAL 
KAAGIDDLFG VCIDGMEGDR GTAEKPDPAV LLEAARRIGA RPQRCVVVEN SAAGVAAGRA
GGFALVVGID GTEVADELTR HGADVVLAGL ADIAVRTGDR RISELPNALA SYGQLIGITS
ARESMLFLDY DGTLSPIVSD PAAARLVEGA DEALELVSKV CPVAILSGRD LADVSARVGT
PGVWYAGSHG FELTSPDGAY HCNDAAAVFV PVLEEAAAEL NKTLAQIAGV RVEHKRFAVA
VHYREVAPEQ VSEIVSATHQ LGARRGLRVT SGRMLVELRP DLDWDKGTTL AWIRDRIDPS
GSLLPIYIGD DLTDEDAFDA VRFDGIGIVV GHDEDGDRKT AANFTLQSPE QVREFIQRGS
RWLAYKHQVS GEAWDYVFDG YDPQNEKLRE ALCTVGNGYF ATRGAAPESK AGQVHYPGTY
AAGVYNRLVD NVSGTEIDNE SLVNLPNWLA LTFRVDHGDW FDIDAVTVLS YRQTLDLRGA
VLTRQVRFRD HAGRTSSLTQ RRFVAMHLPH AGALETTVVA EDWSGTIEFR STVDGNVTNS
LVERYRDLAN EHLDSADTQE LSNNSVLLTM QTNQSRIPIA MAARNTVWRD GEPVPAAFAL
FDQGAEIGHD IAVHLSAGDA VTLEKVVTVY TGRDVAISEP GVNAQRWVTR LGRFDELLDG
HRTAWTHLWE RLSIDFDDFT DELRILRLHL LHLLQTVSLN TADLDAGVPA RGLHGEAYRG
HIFWDELFIF PVLNLRAPMI TRSLLGYRYR RLPEARHAAR AAGHSGAMFP WQSGSDGREE
SQRLHLNPRS GRWNPDASAR AHHIGIAVAY SAWKFYQVTG DLAYLIDYGA ELIAEVARFF
VSLASYDEDR ERFGIKGVIG PDEFHSGYPE APYDGIDNNA YTNVMAVWVI MRAIDALNLL
PLPNRLDLLE ALGLHNEELA HWDDVSRRMY VPFHDGVISQ FEGYGDLADL DWDRLRRQYG
NIQRLDRILE AENDDVNRYK ASKQADALML LYLLSADELR EILDRLGYRF LPEQVPKMVD
YYLARTSHGS TLSGVVHTWV LARANRNRAL EFFQQALKSD VSDIQGGTTS EGIHLAAMAG
TVDLMQRCFT GLETRSNRII LSPYWPESLG VLAIPIHYRG LHLHLRVSGK GVIISVDPRD
AAGIEVECRG QVVQLMPGTT VRFPG