Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_4107 |
Symbol | |
ID | 4648866 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | + |
Start bp | 4402566 |
End bp | 4406243 |
Gene Length | 3678 bp |
Protein Length | 1225 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639807574 |
Product | HAD family hydrolase |
Protein accession | YP_954890 |
Protein GI | 120405061 |
COG category | [G] Carbohydrate transport and metabolism [R] General function prediction only |
COG ID | [COG0637] Predicted phosphatase/phosphohexomutase [COG1554] Trehalose and maltose hydrolases (possible phosphorylases) [COG1877] Trehalose-6-phosphatase |
TIGRFAM ID | [TIGR00685] trehalose-phosphatase [TIGR01484] HAD-superfamily hydrolase, subfamily IIB [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.130759 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.249958 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCTCC CTGTCACCAT CGATCCGCGG TACCACGACG CGGTGCTGCT CGACCTCGAC GGCGCGCTGA CCAGTGAGGC CCCGCTCTTC GGGCCTACGG TCGATCTGGC CAGGAAGTTG CAGAGCATCG GGGTCGCCGC GGCGGCCTAC TCGTCGAATG GACAGTGCAG GCAGGCGCTG AAGGCCGCCG GCATCGACGA CCTGTTCGGC GTCTGCATCG ACGGAATGGA GGGCGACCGC GGAACGGCCG AGAAGCCCGA TCCCGCAGTG CTTCTGGAGG CCGCCCGGCG GATCGGAGCG CGACCGCAGC GATGCGTCGT GGTCGAGAAC TCCGCGGCAG GCGTGGCGGC CGGCCGTGCC GGCGGTTTCG CGCTCGTGGT CGGCATCGAC GGTACGGAGG TTGCCGACGA GTTGACCCGC CACGGTGCCG ACGTCGTGCT CGCCGGTCTC GCCGACATCG CCGTCCGCAC GGGTGACAGG CGGATCTCAG AACTGCCGAA CGCGCTGGCT TCCTACGGCC AGTTGATCGG CATCACCAGC GCGCGCGAGT CCATGCTGTT CCTGGACTAC GACGGCACGC TGTCCCCCAT CGTCTCGGAC CCCGCCGCGG CCAGGCTCGT CGAGGGTGCC GACGAGGCGC TCGAACTCGT CTCCAAGGTG TGCCCGGTGG CCATCCTGAG CGGGCGCGAC CTCGCCGACG TCAGCGCCCG GGTCGGCACC CCCGGCGTCT GGTATGCAGG CAGTCACGGC TTCGAGTTGA CCTCACCGGA CGGCGCTTAT CACTGCAACG ATGCCGCTGC CGTGTTCGTG CCGGTCCTGG AGGAGGCCGC CGCCGAGTTG AACAAGACCC TGGCCCAGAT CGCAGGAGTG CGCGTCGAGC ACAAGCGCTT TGCCGTCGCG GTGCACTACC GCGAGGTGGC ACCGGAACAG GTCAGCGAGA TCGTCTCGGC CACACATCAA CTCGGTGCGC GTCGCGGCCT GCGGGTGACC AGTGGGCGGA TGCTCGTCGA GCTGCGGCCC GACCTCGACT GGGACAAGGG CACCACGCTG GCCTGGATCC GCGACCGCAT CGACCCGTCC GGCTCACTGC TGCCGATCTA CATCGGCGAC GACCTGACCG ACGAGGACGC CTTCGATGCG GTCCGGTTCG ACGGCATCGG GATCGTCGTC GGGCATGACG AGGACGGCGA CCGCAAGACC GCCGCGAACT TCACCCTCCA AAGTCCGGAG CAGGTGCGTG AGTTCATCCA ACGCGGATCG CGGTGGCTGG CCTACAAACA TCAGGTCTCG GGTGAGGCGT GGGATTACGT CTTCGACGGA TACGACCCGC AGAACGAGAA GCTCCGCGAG GCGTTGTGCA CCGTGGGCAA CGGCTATTTC GCCACCCGCG GTGCCGCGCC CGAGTCGAAG GCCGGGCAGG TGCACTACCC GGGCACCTAT GCGGCCGGGG TGTACAACCG CCTCGTCGAC AACGTGTCGG GAACCGAGAT CGACAACGAG AGTCTGGTCA ACCTGCCCAA CTGGCTCGCG TTGACCTTCC GTGTCGACCA CGGCGACTGG TTCGACATCG ACGCGGTCAC CGTGCTGTCC TACCGGCAGA CGCTCGACCT TCGGGGAGCG GTGCTGACCC GGCAGGTCCG CTTCCGCGAC CACGCTGGGC GGACGAGCTC GTTGACGCAA CGACGGTTCG TGGCCATGCA CCTGCCCCAC GCCGGCGCTC TCGAGACGAC GGTCGTCGCG GAAGACTGGT CAGGGACGAT CGAATTCCGC TCGACCGTCG ACGGCAACGT GACGAACTCG CTGGTCGAGC GTTACCGCGA CCTGGCCAAC GAGCATCTCG ATTCGGCTGA CACGCAGGAA CTCTCGAACA ACTCCGTCCT GCTGACCATG CAGACCAACC AATCCCGCAT TCCCATCGCG ATGGCCGCAC GCAACACCGT GTGGCGCGAC GGCGAGCCCG TCCCGGCCGC CTTCGCGCTG TTCGATCAGG GCGCCGAGAT CGGTCACGAC ATAGCGGTCC ACCTGTCGGC CGGGGATGCG GTGACGCTCG AGAAGGTCGT CACCGTCTAC ACCGGGCGCG ACGTCGCGAT CTCCGAACCC GGCGTGAATG CGCAGCGCTG GGTCACCCGG CTCGGCCGAT TCGACGAGTT GCTGGACGGA CATCGAACCG CCTGGACGCA CCTGTGGGAA CGGCTGTCGA TCGATTTCGA CGATTTCACC GACGAGTTGC GGATCCTGCG GCTGCATCTG CTGCATCTGC TGCAGACGGT GTCACTCAAC ACCGCTGACC TCGATGCCGG GGTACCTGCA CGCGGACTGC ACGGTGAGGC GTATCGCGGG CACATCTTCT GGGACGAACT GTTCATCTTC CCCGTCCTCA ACCTGCGGGC GCCGATGATC ACCCGGTCCC TGCTGGGTTA CCGCTACCGT CGTCTGCCCG AGGCGCGGCA CGCCGCCCGC GCGGCGGGCC ATTCCGGTGC GATGTTTCCC TGGCAGTCCG GCAGCGACGG GCGCGAGGAA AGTCAACGGC TGCACCTCAA TCCGCGCAGC GGCCGCTGGA ACCCCGACGC CAGCGCCCGC GCGCACCACA TCGGCATCGC CGTCGCCTAC AGTGCGTGGA AGTTCTACCA GGTCACCGGC GACCTCGCGT ACCTGATCGA CTACGGCGCG GAACTGATCG CCGAGGTCGC ACGATTCTTC GTCAGCCTGG CCAGCTACGA CGAAGACCGG GAACGGTTTG GGATCAAAGG CGTCATCGGC CCCGACGAAT TCCACTCCGG TTACCCCGAA GCTCCCTATG ACGGCATCGA CAACAACGCG TACACGAACG TCATGGCGGT ATGGGTGATC ATGCGCGCGA TCGACGCACT GAACCTGCTC CCCCTGCCCA ACCGCCTCGA CCTGCTGGAG GCACTCGGGC TGCACAACGA GGAGCTGGCC CACTGGGACG ACGTCAGCAG GCGGATGTAC GTCCCGTTCC ACGACGGCGT CATCAGCCAG TTCGAGGGCT ACGGGGATCT GGCGGATCTC GACTGGGATC GTCTTCGCCG GCAGTACGGC AACATCCAGC GTCTGGACCG CATCCTGGAG GCGGAGAACG ACGACGTGAA CCGCTACAAG GCGTCGAAGC AGGCCGACGC GCTGATGCTG CTCTATCTAC TCTCGGCCGA CGAGTTGCGG GAGATACTCG ACCGACTCGG CTACCGTTTC CTGCCCGAGC AGGTGCCGAA GATGGTGGAC TACTACCTGG CCCGCACATC ACACGGGTCT ACGCTCAGCG GTGTCGTCCA CACCTGGGTA CTCGCCAGGG CCAACCGCAA TCGCGCCCTG GAGTTCTTCC AGCAGGCGTT GAAGTCGGAC GTCTCCGACA TCCAGGGCGG CACCACCTCC GAAGGCATCC ACCTGGCGGC CATGGCGGGC ACCGTCGACC TAATGCAGCG CTGCTTCACC GGGCTGGAGA CTCGTTCCAA CCGCATCATC CTGTCCCCGT ACTGGCCGGA ATCCCTTGGC GTGCTGGCGA TACCGATCCA CTATCGCGGC CTGCACCTGC ACCTGAGGGT CAGCGGAAAA GGCGTGATCA TCAGCGTGGA TCCACGAGAC GCCGCTGGAA TCGAGGTGGA GTGCCGCGGT CAGGTCGTTC AACTGATGCC GGGAACCACC GTCCGGTTCC CGGGCTGA
|
Protein sequence | MKLPVTIDPR YHDAVLLDLD GALTSEAPLF GPTVDLARKL QSIGVAAAAY SSNGQCRQAL KAAGIDDLFG VCIDGMEGDR GTAEKPDPAV LLEAARRIGA RPQRCVVVEN SAAGVAAGRA GGFALVVGID GTEVADELTR HGADVVLAGL ADIAVRTGDR RISELPNALA SYGQLIGITS ARESMLFLDY DGTLSPIVSD PAAARLVEGA DEALELVSKV CPVAILSGRD LADVSARVGT PGVWYAGSHG FELTSPDGAY HCNDAAAVFV PVLEEAAAEL NKTLAQIAGV RVEHKRFAVA VHYREVAPEQ VSEIVSATHQ LGARRGLRVT SGRMLVELRP DLDWDKGTTL AWIRDRIDPS GSLLPIYIGD DLTDEDAFDA VRFDGIGIVV GHDEDGDRKT AANFTLQSPE QVREFIQRGS RWLAYKHQVS GEAWDYVFDG YDPQNEKLRE ALCTVGNGYF ATRGAAPESK AGQVHYPGTY AAGVYNRLVD NVSGTEIDNE SLVNLPNWLA LTFRVDHGDW FDIDAVTVLS YRQTLDLRGA VLTRQVRFRD HAGRTSSLTQ RRFVAMHLPH AGALETTVVA EDWSGTIEFR STVDGNVTNS LVERYRDLAN EHLDSADTQE LSNNSVLLTM QTNQSRIPIA MAARNTVWRD GEPVPAAFAL FDQGAEIGHD IAVHLSAGDA VTLEKVVTVY TGRDVAISEP GVNAQRWVTR LGRFDELLDG HRTAWTHLWE RLSIDFDDFT DELRILRLHL LHLLQTVSLN TADLDAGVPA RGLHGEAYRG HIFWDELFIF PVLNLRAPMI TRSLLGYRYR RLPEARHAAR AAGHSGAMFP WQSGSDGREE SQRLHLNPRS GRWNPDASAR AHHIGIAVAY SAWKFYQVTG DLAYLIDYGA ELIAEVARFF VSLASYDEDR ERFGIKGVIG PDEFHSGYPE APYDGIDNNA YTNVMAVWVI MRAIDALNLL PLPNRLDLLE ALGLHNEELA HWDDVSRRMY VPFHDGVISQ FEGYGDLADL DWDRLRRQYG NIQRLDRILE AENDDVNRYK ASKQADALML LYLLSADELR EILDRLGYRF LPEQVPKMVD YYLARTSHGS TLSGVVHTWV LARANRNRAL EFFQQALKSD VSDIQGGTTS EGIHLAAMAG TVDLMQRCFT GLETRSNRII LSPYWPESLG VLAIPIHYRG LHLHLRVSGK GVIISVDPRD AAGIEVECRG QVVQLMPGTT VRFPG
|
| |