Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmcs_3407 |
Symbol | |
ID | 4112239 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. MCS |
Kingdom | Bacteria |
Replicon accession | NC_008146 |
Strand | + |
Start bp | 3613581 |
End bp | 3617228 |
Gene Length | 3648 bp |
Protein Length | 1215 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638032540 |
Product | HAD family hydrolase |
Protein accession | YP_640570 |
Protein GI | 108800373 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1554] Trehalose and maltose hydrolases (possible phosphorylases) [COG1877] Trehalose-6-phosphatase |
TIGRFAM ID | [TIGR00685] trehalose-phosphatase [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type [TIGR01484] HAD-superfamily hydrolase, subfamily IIB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.377185 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGTGC CGGTCGTCAT CGATCCCCGC TACCACGACG CGGTGATCTT CGACCTCGAC GGCGTGCTCA GCGACCCGGC GCCGACGGTG ACGCTGGCGC GCGAACTGCA GGACGTCGGC GTCAAGACCG CGGCCTGTTC GTCGAACAGT CACGGCCGGG ACATGGTGAA AAGCATTGGG GCCGAGGGTT TCGATGTGTG CGTCGACGGC TCCGCCGAAG ACCGGGCCGG CGCCCCGTGT CTGGAGGCAG CGCGCCGACT CGGTGTGCGC CCGCAGCGCG CCGTGCTCAT CGAGGATTCG GCCGCGGGCC GGACACCGGG CCGCGACGGT GGGTTCGCCC TCGTGATCCG GGTCGACCGG ACCCGGCACC CCGACGAACC GATCGACGGC GATGCCGACC GGGTCGTCGC GGGTCTGGCC GGTATCGCGG TCCGCACCGG TGACAGGCGC GTCTCCGAGG TCCCCAACGC GTTGGAGTCC TACGGACAGC TCATCGGTAT CACCGGCGCC CGCGAGTCGG TGTTGTTCCT GGACTACGAC GGAACCCTGT CCCCCATCGT CTCCGAACCG GACGCCGCGG TGCTGGTCGA CGGCGCGGCC GAGGCGCTGG CGCTCGTCGC CGCGGTGTGC CCGGTCGCGA TCCTGAGCGG CCGCGACCTC GCGGATGTCC GCACCCGCGT GGGCGCACCG GGGCTGTGGT ACGCCGGCAG CCACGGATTC GAGCTGACCG GACCGGACGG CACCTACCAC CAGAACGAGG CGGCCGCCGC GTTCGTGCCC GTCCTCGAAC GGGCCGCCGG GGATCTGCGC GACCTGCTGG GGCACGTTCC CGGAATCTTC GTCGAGCACA AGCGCTTCGC CGTCGCGGTC CACTACCGGG AGGTCGGGTC CTCCGACGAG GTCGCCGAGA TCGTCTCGAC CACACACAGA CTCGGCAGGC AGGCCGGTCT CCGGGTGACG AGCGGCCGCA TGCTCGTCGA ACTGCGACCC GACATCGACT GGGACAAGGG CACCACCCTG AGGTGGATCC GGGACCGCAT CGACGCGGCC GGTTCGCTGA TGCCCATCTA CATCGGTGAC GACCTCACCG ACGAAGACGC GTTCGATGCG GTCCGCTTCG ACGGCGTCGG CATCGTCGTG CGCCACGACG AGGACAGCGA CCGCAAGACG GCGGCCCGAT TCAGCCTGCA GTCACCCGAC CAGGTGCGCG AATTCCTCCA ACGGGGGTCG CAGTGGCTGG CCTACAAGCA CCAGGTCGCA AGCGAGGCCT GGGATCTCAC CTACGAGGGG TACGACCCGC AGAGCGAGAA GCTGCGCGAG GCGTTGTGCA CCGTCGGGAA CGGTTACTTC GCCACGAGGG GCGCGGCACC GGAATCGAAG GCCGGCCAGG TGCACTACCC CGGCACCTAC GCCGCGGGCA TCTTCAACCG CCTCGTCGAC GACGTATCGG GTACCGCCGT CGACAACGAG AGCCTGGTGA ACCTGCCGGA CTGGCTCTCG CTGACCTTCC GGATCGACGG CGGTGACTGG TTCGACATCG ACGAGGTCGA AGTGCTCTCC TACCGCCAGA CTCTCGACCT GCGCGGGGCG ACGCTGACGC GGGAGGTCCG CTTTCGCGAC GACGCCGGGC GCACCACTTC GGTTACGCAG CAACGTTTTG TCGCGATGCA CCTGCCGCAC GTCGGCGCTC TGCAGACCAA GATCGTCGCC GAGGACTGGT CGGGCCGGAT CACGATCCGC TCGACGCTCG ACGGGAACGT GACGAACTCG CTGGTCGAGC GGTACCGCGA CCTCGGTAAA GAGCACCTCG AACTGATCGA GAAGCGGCAG CTCAGCGACG ATTCGGTGTT GTTGACGGTC CGCACGACCC AGTCGGGCAT TCCGATCGCC ATGGCGGCCC GCTGCATCGT CTGGCGTGAC GACGCCCCGG TCGGGGCGTC CTACCGACTC GTCGGGGACG GCGCCGAGAT CGGTCACGAG ATCACCGTCG AGCAGTCGGT CGGCGAGGCG CTGACGGTGG AGAAGCTGGT CACCCTCTTC ACCGGCCGCG ACGTCGCGAC CTCCGATCCC GCGGTGGACG CCGAGCGGTG GGTGGCGCGA CTCGGCCGGT ACGCCGAGGT ACGCGAGGGA CACCTCACCG ACTGGACACA CCTGTGGGAA CGCCTGTCCA TCGAATTCGA CGATTTCACC GACGAAGTGC GCATCCTGCG GCTGCATCTG CTGCATCTGC TGCAGACCGT CTCCCCCAAC AGCGCCGACC TCGACGTGGG TGTGCCCGCG CGTGGCCTGC ACGGTGAGGC ATACCGCGGC CACATCTTCT GGGATGAGCT GTTCATCTTC CCGGTGCTCA ATCTGCGGCT GCCCATGATC ACCCGATCGC TGCTGGCGTA CCGCTACCGG CGGCTGCCGG AGGCCCGGCA CGCCGCCAGG GAGGCCGGTT ACGCCGGCGC GATGTTCCCC TGGCAGTCCG GAAGCGACGG CCGCGAGGAA AGCCAACGGC TGCACCTGAA TCCGCGCAGC GGCAACTGGA ATCCCGATGC GAGCGCTCGC GCGCACCACA TCGGTATCGC CGTGGCCTAC AGCGCGTGGA AGTTCTACCA GGCCACCGGG GATCTCGCCT ACCTGATCGA CTACGGCGCC GAGATGCTGG CCGAGGTCGC GCGCTTCTGG GTGAGTCTGG CCAGCTACGA CGAAGACCGC GGCCGCTACA GCATCCGCGG GGTCATCGGG CCCGACGAAT TCCACTCCGG CTACCCCGAC GCACCCTACG ACGGCATCGA CAACAACGCG TACACGAACG TGATGGCGGT GTGGGTGATC ATGCGGGCGC TCGACGCGTT GGACCTGCTT CCGCTGCCGA ACCGGCTCGA CCTGCTGGAG ACGCTCGGGC TGACCAGCGG TGAACTGGCG CACTGGGACG ACGTGAGTCG CCGGATGTAC GTGCCGTTCC ACGACGGTGT CATCAGCCAG TTCGAGGGCT ACGGCGAGCT CGAGGAGCTG GACTGGGACC GCTACCGCGC CCACTACGGC AACATCCAGC GCCTCGATCG CATCCTCGAG GCGGAGAACG ACGACGTGAA CCGGTACAAG GCGTCCAAAC AGGCCGACGT GCTGATGTTG CTGTACCTGA TGTCGGTGCC CGAACTCGGC GAGATCCTGA ACCGGCTCGG TTATCACTTC CCCGCGGACC AGGTTCCGAG CATGGTCGAC TACTACCTGG CCCGCACGTC GCACGGGTCC ACGCTCAGCG GAGTCGTGCA CACCTGGGTC CTGGCGCGCG CCAACCGCGA CCGTGCGATG GAGTTCTTCG AACTGGCGCT CAAGTCCGAC GTCTCCGACA TCCAGGGCGG CACCACGTCC GAGGGCATCC ACCTGGCCGC CATGGCCGGC ACCGTCGATC TGATGCAACG CTGCTTCACC GGATTGGAGA CCCGGTCCAA CCGCCTCATC CTTTCCCCGT ACTGGCCGGA AAGCCTTGGG GTGCTGGCGG TCCCGATCCA CTACCGGGGC CTGCACCTGC ACCTGCGGGT CAGCGGTAAG GGTGTGATCA TCAGCGTCGA CCCACGCGAG GCCGCCGGGG TGGTGGTGGA ATGCCGGGGG CGGGTGGTGC AGCTGATGCC GGGCACCACC GTCCGCTTCC CCGGCTGA
|
Protein sequence | MTVPVVIDPR YHDAVIFDLD GVLSDPAPTV TLARELQDVG VKTAACSSNS HGRDMVKSIG AEGFDVCVDG SAEDRAGAPC LEAARRLGVR PQRAVLIEDS AAGRTPGRDG GFALVIRVDR TRHPDEPIDG DADRVVAGLA GIAVRTGDRR VSEVPNALES YGQLIGITGA RESVLFLDYD GTLSPIVSEP DAAVLVDGAA EALALVAAVC PVAILSGRDL ADVRTRVGAP GLWYAGSHGF ELTGPDGTYH QNEAAAAFVP VLERAAGDLR DLLGHVPGIF VEHKRFAVAV HYREVGSSDE VAEIVSTTHR LGRQAGLRVT SGRMLVELRP DIDWDKGTTL RWIRDRIDAA GSLMPIYIGD DLTDEDAFDA VRFDGVGIVV RHDEDSDRKT AARFSLQSPD QVREFLQRGS QWLAYKHQVA SEAWDLTYEG YDPQSEKLRE ALCTVGNGYF ATRGAAPESK AGQVHYPGTY AAGIFNRLVD DVSGTAVDNE SLVNLPDWLS LTFRIDGGDW FDIDEVEVLS YRQTLDLRGA TLTREVRFRD DAGRTTSVTQ QRFVAMHLPH VGALQTKIVA EDWSGRITIR STLDGNVTNS LVERYRDLGK EHLELIEKRQ LSDDSVLLTV RTTQSGIPIA MAARCIVWRD DAPVGASYRL VGDGAEIGHE ITVEQSVGEA LTVEKLVTLF TGRDVATSDP AVDAERWVAR LGRYAEVREG HLTDWTHLWE RLSIEFDDFT DEVRILRLHL LHLLQTVSPN SADLDVGVPA RGLHGEAYRG HIFWDELFIF PVLNLRLPMI TRSLLAYRYR RLPEARHAAR EAGYAGAMFP WQSGSDGREE SQRLHLNPRS GNWNPDASAR AHHIGIAVAY SAWKFYQATG DLAYLIDYGA EMLAEVARFW VSLASYDEDR GRYSIRGVIG PDEFHSGYPD APYDGIDNNA YTNVMAVWVI MRALDALDLL PLPNRLDLLE TLGLTSGELA HWDDVSRRMY VPFHDGVISQ FEGYGELEEL DWDRYRAHYG NIQRLDRILE AENDDVNRYK ASKQADVLML LYLMSVPELG EILNRLGYHF PADQVPSMVD YYLARTSHGS TLSGVVHTWV LARANRDRAM EFFELALKSD VSDIQGGTTS EGIHLAAMAG TVDLMQRCFT GLETRSNRLI LSPYWPESLG VLAVPIHYRG LHLHLRVSGK GVIISVDPRE AAGVVVECRG RVVQLMPGTT VRFPG
|
| |