Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_4629 |
Symbol | |
ID | 4646646 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | + |
Start bp | 4967651 |
End bp | 4969141 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639808099 |
Product | sulfatase |
Protein accession | YP_955410 |
Protein GI | 120405581 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.284621 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGACA AGCCGAACAT CCTCATCATC TGGGGCGACG ACATCGGCCA GAGCAACCTG AGCTGCTACA GCGACGGGCT GATGGGATAC CGCACGGCCA ACATCGACCG GATCGCCGCC GAAGGTGCAC GCTTCACCGA CTACTACGGC GAGCAGAGCT GCACAGCGGG CCGGGCCGCG TTCATCACCG GCCAGAACCC CTATCGGACC GGACTGACCA AGGTCGGTAT GCCCGGTGCG CCCATCGGGC TGCAGGCCGA GGACCCGACG ATCGCGACCG CACTCAAGGC GCAGGGGTAC GCCACCGGCC AGTTCGGCAA GAACCACCTG GGCGACCGCG ACGAATTCCT GCCCACCATG CACGGATTCG ACGAATTCTT CGGCAACCTG TACCACCTCA ACGCCGAAGA GGAACCGGAG AACGTCGACT ATCCGAAGGA CCCGGAGTTC CGCAAGAAGT TCGGACCCCG CGGCGTCATC CGGGCGTGGG CCAACGGCGA CGGCACCCAG CGCATCGAGG ACACCGGCCC GCTGACCAGG AAGCGCATGG AGACCTGCGA CGAAGAATTC CGCGACGCCG CAATCGATTT CATCAAGCGC CAGCACGAGG CCGACACCCC GTTCTTCGTG TGGTTCAACT CCACCCACAT GCACTTCCGG ACCCACCCCA AGCCGGAGAG CATCGGCCGG TCGGGGCGTT GGCAGTCGGA GTACCACGAC ACGATGCTCG ACCACGACGA CATCGTCGGC GACCTGCTGA ACACCCTCGA CGAGCTCGGC ATCGCCGATG GCACGATCGT CATGTACTCC ACCGACAACG GCCCGCACAT GAACAGCTGG CCGGACGCGG GCATGACGCC GTTCCGCAAC GAGAAGAACT CGAACTGGGA AGGCGCGTAC CGGGTTCCGG CAGTGGTGCG CTGGCCGGGA CGGATTCCGG CCGGCACCGT ACTGAACGGC ATCGTCAGCC ACAACGACTG GTTCGTCACC CTGCTGGCCG CGGCGGGAGA TCCCGACATC GCCGAAAGAC TGCGCGCCGG AACCGATCTC AACGGCACCA CCTACAAGGT GCACCTCGAC GGGCACAATC AGCTTGACTA TCTCACCGGA GCGACCGACG AAAGCCCGCG GCAGTATTTC TTCTACGTCT CCGACGACGG CGACCTGACG GCCCTGCGAT ACGACAACTG GAAGATCGTC TTCCTGGAGC AGCGGGCGGC GGGCACGTTG CAGGTGTGGC TCGAGCCCTA CACCGAGTTG CGCGCACCGA AGCTGTTCAA CCTGCGCACC GACCCGTACG AGCGGGCCGA CATCACGTCG AACACGTACT TCGACTGGGT ACTCGACCGG GTTTTCGTCT TCACTCCGGC GCAGGCGTTC GTCGCACAGA TGCTGCAGAC CCTCGTCGAG TTCCCGCAAC GTCAGGCTTC TGCGAGCTTC AACCTCGAGC AGGTGATGGC GAAGCTGCAG GCCGGCATCC CCGATTCCTG A
|
Protein sequence | MGDKPNILII WGDDIGQSNL SCYSDGLMGY RTANIDRIAA EGARFTDYYG EQSCTAGRAA FITGQNPYRT GLTKVGMPGA PIGLQAEDPT IATALKAQGY ATGQFGKNHL GDRDEFLPTM HGFDEFFGNL YHLNAEEEPE NVDYPKDPEF RKKFGPRGVI RAWANGDGTQ RIEDTGPLTR KRMETCDEEF RDAAIDFIKR QHEADTPFFV WFNSTHMHFR THPKPESIGR SGRWQSEYHD TMLDHDDIVG DLLNTLDELG IADGTIVMYS TDNGPHMNSW PDAGMTPFRN EKNSNWEGAY RVPAVVRWPG RIPAGTVLNG IVSHNDWFVT LLAAAGDPDI AERLRAGTDL NGTTYKVHLD GHNQLDYLTG ATDESPRQYF FYVSDDGDLT ALRYDNWKIV FLEQRAAGTL QVWLEPYTEL RAPKLFNLRT DPYERADITS NTYFDWVLDR VFVFTPAQAF VAQMLQTLVE FPQRQASASF NLEQVMAKLQ AGIPDS
|
| |