Gene Tmz1t_1480 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1480 
Symbol 
ID7083563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1650141 
End bp1652510 
Gene Length2370 bp 
Protein Length789 aa 
Translation table11 
GC content66% 
IMG OID643698498 
Productsulfatase 
Protein accessionYP_002355135 
Protein GI217969901 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAGC AGGACAACAA ACCCCGTACC CACCTGCCGA TGCGCAATAC CGCGCCGGCG 
CGCTTCGTTA ACTACGATGC CAAGGATCCG GACACGGTCA CGACCCCGAT CGAGCAGTTG
CGTCCGCCCG AAGGCGCGCC GAACGTGCTG ATCGTGCTCA TCGACGACTG CGGCTTCGGC
GCGCCCAGCA CCTTCGGCGG CCCCTGCGCC ACGCCCTTTG CCGACAAGCT GGCCGCGGGC
GGCCTGCGCT ACAACCGCTT CCACACCACC GCGATCTGCT CGGCCACGCG CCAGTCCCTG
CTCACCGGCC GCAACCACCA CAACGCCGGC ATGGCCGGCA TCACCGAGAT CGCCACCGGC
CAGCCGGGCT ACTCCTCGGT GCTGCCCAAC TCCATGGCGC CGCTGGCCAA GACGCTCAAG
CTCAACGGCT ACAACACCGC ACAGTTCGGC AAATGCCACG AGGTGCCGGT GTGGCAGTCG
AGCCCGGTCG GCCCCTTCGA CCAGTGGCCC ACGGGCGGGG GCGGCTTCGA GTACTTCTAC
GGCTTCATCG GTGGCGAGGC GCACCAGTGG TATCCCGCGC TCGTCGAGGG CACCACCCCG
GTCGCGCCGC CGAAGACGCC CGAGGAGGGT TACCACCTGA TGGAGGACAT GACCGACAAG
GCCATCGGCT GGATCCAGCA GCAGAAGGCC CTCTCGCCCG ACAAGCCCTT CTTCGCCTAT
TTCGCGCCCG GCGCGACCCA CGCGCCGCAC CATGTGCCCA AGGAATGGGC CGACAAGTAC
AAGGGCAAGT TCGACGCCGG CTGGGATGTG CTGCGCGAGG AGACCTTCGC CCGCCAGAAG
AAGCTCGGCG TCATTCCTGC CGACTGCGAA CTCACTGCCC GGCCCAAGGA CGTTCCCGGC
TGGGACGACA TGCCCGAGGC GTTCAAGCCG GTGCTGCGTC GGCAGATGGA GGTCTATGCG
GGCTTCCTCG AGTTCACCGA CCATCACGTG GGCCGCCTGT TCGAGGCGAT CGAGAATCTC
GGCATCGCCG ACGACACGCT GGTGTATTAC ATCATCGGCG ACAACGGCGC CTCGGCCGAG
GGCTCGTTCA ACGGCTGCTT CAACGAGATG AGCTACTTCA ACGGCCTGCA GTCCCTCGAG
ACCGCCGAGT ACCTGACGGA GCGGATCGAC AAGCTCGGCG GGCCGGAGTC GTACAACCAC
TTCGCGGTGG GTTGGGCGCA TGCGCTGAAC ACGCCCTACC AGTGGACCAA GCAGGTCGCC
TCGCACTTCG GCGGCACGCG CAACGGCACC ATCGTGCACT GGCCGAAGGG CATCCAGGCC
AAGGGCGAGT TGCGCACGCA TTTTGCCCAC GTGATCGACG TGGCGCCGAC CATCCTCGAG
GCGGCGGGCA TTCCCGAGCC GGTCTCGGTC GATGGCATCC AGCAGGACCC GATCGATGGG
GTCAGCATGC TGAAGACGTT CAACGACGCC AAGGCACCGG AAGCCCACGA GACGCAGTAC
TTCGAGGTCA TGGGCAACCG GGCCATCTAT CACAAGGGCT GGACCGCGGT GACCAAGCAC
TACACGCCGT GGATTCCGAA CGTGAAGCCC GCGCTCGACG ACGACGTCTG GGAGCTCTAC
GACACCACCA CGGACTGGGC GCAGTCCAAG GACCTGTCGA AGGAGATGCC GGACAAGCTG
CGCGAACTGC AGCGCCTGTG GATCATCGAG GCGACGCGCA ACAAGGTGCT GCCGATCGAT
GACCGGATGT TCGAGAAGAT AAATCCGGAC ACCGCCGGCC GGCCGACGCT GGTGAAGGGC
AAGACCCAGC TCCTGGCCGG TGGCATGAGC CACCTGGGCG AGAACTGTGT CCTCAACATC
AAGAACAAGT CGCACTCCGT CACCGCAGCG ATCGTGGTCC CGCAAGGCGG GGCAGAGGGC
GTCATCATCG CGCAGGGCGC CGATATCGGC GGCTGGAGCC TGTATGCCAA GGGCGGCAAG
CTCAAGTACT GCTACAACTG GGGCGGCTTC AAGCACTTCA TGATCGAAGG CGCTTCGGTC
ATGGCTCCGG GCGAGCATCA GGTGCGCATG GAGTTCGCCT ACGCCGGTGG CGGCCTCGGC
AAGGGCGGCA AGGTGACGCT GTACGTCGAT GGCCAGCAGG ACGGCGAGGG CGAAGTCGGC
GCAACGCTGG CGATGATCTT CTCGGCCGAC GACGGTCTGG ATGTCGGCAA GGACGGCGGC
TCGGCGGTAT CGCCGGACTA CAAGCCTGGC AACAATTCCT TCAACGGCAA GGTGAAGGGC
GTGCAGCTCG CGATCGACGA GGCCGCGGAA GACCTGGATC ACCTGCTCGA TCCGGCGGAC
GTCATCCGCA TGGCGATGGC CCGCCAGTAA
 
Protein sequence
MSKQDNKPRT HLPMRNTAPA RFVNYDAKDP DTVTTPIEQL RPPEGAPNVL IVLIDDCGFG 
APSTFGGPCA TPFADKLAAG GLRYNRFHTT AICSATRQSL LTGRNHHNAG MAGITEIATG
QPGYSSVLPN SMAPLAKTLK LNGYNTAQFG KCHEVPVWQS SPVGPFDQWP TGGGGFEYFY
GFIGGEAHQW YPALVEGTTP VAPPKTPEEG YHLMEDMTDK AIGWIQQQKA LSPDKPFFAY
FAPGATHAPH HVPKEWADKY KGKFDAGWDV LREETFARQK KLGVIPADCE LTARPKDVPG
WDDMPEAFKP VLRRQMEVYA GFLEFTDHHV GRLFEAIENL GIADDTLVYY IIGDNGASAE
GSFNGCFNEM SYFNGLQSLE TAEYLTERID KLGGPESYNH FAVGWAHALN TPYQWTKQVA
SHFGGTRNGT IVHWPKGIQA KGELRTHFAH VIDVAPTILE AAGIPEPVSV DGIQQDPIDG
VSMLKTFNDA KAPEAHETQY FEVMGNRAIY HKGWTAVTKH YTPWIPNVKP ALDDDVWELY
DTTTDWAQSK DLSKEMPDKL RELQRLWIIE ATRNKVLPID DRMFEKINPD TAGRPTLVKG
KTQLLAGGMS HLGENCVLNI KNKSHSVTAA IVVPQGGAEG VIIAQGADIG GWSLYAKGGK
LKYCYNWGGF KHFMIEGASV MAPGEHQVRM EFAYAGGGLG KGGKVTLYVD GQQDGEGEVG
ATLAMIFSAD DGLDVGKDGG SAVSPDYKPG NNSFNGKVKG VQLAIDEAAE DLDHLLDPAD
VIRMAMARQ