Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_4274 |
Symbol | |
ID | 4648324 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | + |
Start bp | 4583446 |
End bp | 4585269 |
Gene Length | 1824 bp |
Protein Length | 607 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639807741 |
Product | sulfatase |
Protein accession | YP_955057 |
Protein GI | 120405228 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.374649 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACCG AACGGCCCGA CATCGTCGTC GTCATGACCG ACGAGGAGCG CGCGACACCT CCCTACGAAC CCGACACCGT TCGCGCATGG CGCAGTCGGA CACTGGGCGG CAGAAGGTGG TTCGACGAGA ACGGCGTCAG CTTCCTACGC CACTACACAG GCTCACTGGC CTGCGTGCCG AGTCGCCCGA CCATCTTCAC CGGCCAGTAC CCCGACCTGC ACGGGGTGAC CCAGACCGAC GGCATCGGCA AGGCACACGA CGATTCCCGG CTGCGCTGGC TGCGCCGCGG CGAGGTGCCC ACCCTGGGCA ACTGGTTGCG CGCCGCCGGC TACGACACCC ACTACGACGG CAAGTGGCAC ATCTCGCACG CCGATCTCAT CGACCCCGGC ACCGGCCGGT CGCTCGACAC CAACGACGAC GACGGCGTCG TCGACCCGGC CGCGGTGCAC AGGTATCTGG AGGCGGATCC GCTGTCCCCA TACGGCTTTT CGGGTTGGGT CGGCCCGGAA CCGCACGGCG CCAAGCTTTC CAACGCCGGG ATCCGCCGTG ACCCGCTGAT CGCCGACCGC GTGGTCGCCT GGCTGAAAGA CCGCTACGCG CGGCGACGCG CGGGTGACCC CGACGCGATG CGGCCGTTCC TGCTGGTGGC CAGCTTCGTC AATCCCCATG ACATCGTGCT GTTTCCGGCC TGGGCGCGCC GCAACCCGCT GCCCGCCTCA CCGCTGGACC CTCCCCCGGT TCCTGCCGCA CCGACCGCGG ACGAGGACCT GTCCACCAAA CCCGCCGCGC AGATCGCGTT CCGCGAGGCG TACTACTCCG GATACGGCCC GGCATGGTCG ATCGAGCGGA CCTACCGGCG CAACGCCCAG CGCTACCGCG ACCTGTACTA CCGACTGCAC GCCGAGGTCG ACACCCCCAT CGACCGGGTG CGCCGCGCGG TCACCGAGGG TGGATCCGGC GACGGACCTG ACGACACCGT GCTGGTGCGC ACCGCTGACC ACGGCGACCT GCTCGGCGCC CACGGCGGGT TGCACCAGAA ATGGTTCAAC CTCTACGACG AGGCCACCCG GGTCCCGTTC GTCATCGCCA GGGTGGGCGC CCGTCCGACC ACGGCGCGCA CCGTGACGGC GCCGACGTCG CACGTCGACC TGGTGCCGAC CCTGCTGTCG GCGGCGGGCG TCGACGTCGA CGCGGCCGCA ACCGTGCTGG CAGAGTCGTT CTCGGAGGTG CATCCGCTGC CGGGAAGCGA CCTGATGCCC GTGGTCGACG GCGCGCCGGC CGACGATCAT CGGTGCGTGT ACCTGATGAC GCGGGACAAC GTCCTCGAAG GCGACACCGG TGCGTCCGGG TTGGCCCGCG CATTGAAGCT GACCTCGAAA GTGCCTGCAC CCCTGCGCAT CCGGATTCCG GCCCACACGG CCGCCAACTT CGAGGGTCTG GTGATCCGGG TCGACGAAGA TGCCGCGCCG GGCGGACGCG GACACCTGTG GAAACTGGTG CGTTCGTTCG ACGATCCCGG CACCTGGACA GAACCGGGTG TGCGCCATCT GGCGGCCGAC GGCATCGGCG GACCCATGTA CCGCACCGAT CCGCTCGACG ATCAGTGGGA GCTCTACGAC CTCACCGATG ATCCCGTGGA GCAGCACAAC CGCTGGACCG ATCCCGACCT GCACGAACTG CGCGCGTATC TACGGGCCCA GCTCAAATCC GTGCGGGCGG AATCCATCCC GGAGCGCAAC CGCCCGTGGC CCTACGTCCG GCGTCAGCCC CCGCAGCCGG CACGATGGTC ACCGGGGCGC GCACTGCGCC GGCTGCGGGA CTGA
|
Protein sequence | MTTERPDIVV VMTDEERATP PYEPDTVRAW RSRTLGGRRW FDENGVSFLR HYTGSLACVP SRPTIFTGQY PDLHGVTQTD GIGKAHDDSR LRWLRRGEVP TLGNWLRAAG YDTHYDGKWH ISHADLIDPG TGRSLDTNDD DGVVDPAAVH RYLEADPLSP YGFSGWVGPE PHGAKLSNAG IRRDPLIADR VVAWLKDRYA RRRAGDPDAM RPFLLVASFV NPHDIVLFPA WARRNPLPAS PLDPPPVPAA PTADEDLSTK PAAQIAFREA YYSGYGPAWS IERTYRRNAQ RYRDLYYRLH AEVDTPIDRV RRAVTEGGSG DGPDDTVLVR TADHGDLLGA HGGLHQKWFN LYDEATRVPF VIARVGARPT TARTVTAPTS HVDLVPTLLS AAGVDVDAAA TVLAESFSEV HPLPGSDLMP VVDGAPADDH RCVYLMTRDN VLEGDTGASG LARALKLTSK VPAPLRIRIP AHTAANFEGL VIRVDEDAAP GGRGHLWKLV RSFDDPGTWT EPGVRHLAAD GIGGPMYRTD PLDDQWELYD LTDDPVEQHN RWTDPDLHEL RAYLRAQLKS VRAESIPERN RPWPYVRRQP PQPARWSPGR ALRRLRD
|
| |