Gene Tmz1t_3990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3990 
Symbol 
ID7873636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4387593 
End bp4389341 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content69% 
IMG OID643700927 
Productsulfatase 
Protein accessionYP_002890950 
Protein GI237654636 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCAG CGCTCGCGAC CCTGCTGATC CCGGTGGCAA CGCTCGCCAT CGGCGTCCAG 
GCCGGCCACG CCGCCTCGCC GGTCCAGCAC GACCGCCCCA ACATCCTGCT GATCATGGCC
GACGACCTCG GCTACACCGA CCTCGGCAGC TACGGCAGCG AAATCGCCAC TCCCAACCTC
GACACCCTGG CCGACACCGG GGTCAAGATG ACCCAGTTCT ACGCCTCGCC GTTCTGCTCG
CCGACGCGCG CGATGCTGAT GTCCGGCACC GACAACCACC TCGCCGGCTT CGGCGACATG
GCCGAGCTGA TGCTGCCCGA GCAGCGCGGC AAGCCGGGCT ACGAGGGCTA CCTCAACGAG
CGCGTGGTGC CGATGGCGCA GGTGCTGCGC GATGCCGGCT ATCGCACGCT GATGACCGGC
AAGTGGCACC TCGGCGTGCC CGAGCAGTAC AGCCCGGCCG CGCGCGGCTT CGACCAGTCG
TATGCGCTGG TGCATGGCGG CTCCAGCCAC TGGAGCGACG GCGCGGGCAT CGTCGCCGCC
GATCCGGCCA AGCCGCCGAA GGCCATCTAC CGCGAGAACG GCAAGGAGAC GACGCTGCCG
AAGGACTTCT TCTCGTCCGA CTTCTTCACC TCGCGGCTGA TCGAGTACAT CGACGCCGGC
AAGGGTTCGG GCAAGCCCTT CTTCGCCTAC CTCGCCTTCA CCGCGCCGCA CTGGCCGCTG
CACGCGCACG ACGCCGACAT CGCCAGGTAC GAGCAGCGCT ACAAGGACGG CTACGACAAG
CTGCGCCGCG AGCGCCTCGA GCGCATGAAG AAGCTCGGCC TGGTGGCCGC CGACACGCCG
GTGTTCGAAG GCCATCCGCT GTGGCCGAAG TGGGACAGCC TGAGCGCGGC GGAGAAGGAA
TCCGAGGCCA GGCGCATGGC GGTGTACGCC GCGATGGTCG ACAACATGGA CCAGAACATC
GGCCGCATGC TCGACTACCT GAAGAAGACC GGGCAGCTCG ACAACACCTT CATCTTCTTC
CTGTCCGACA ACGGCGCCGA CGGCAACTCG GTGTACGACG TGGCGCGCAC CCGCGAGTGG
ATCCACAAGG ACATGGACAA CAGCATCGCG CACATCGGCA AGTCCGGCTC CTACGCCGAG
TACGGGCCGG GCTGGGCGCA GGTGGGTTCG ACGCCGTTCC GCATGTTCAA GTCCTTCATG
TACGAGGGCG GCATCGCCGT GCCGGCGATC GCCTGGGGCC CGGGCGTCAA GGGCGGCAAG
CTCGAGTCGG CGATGGCCCA CGTGACCGAC ATCGCGCCCA CGCTGTTCGA GCTCGCCGGC
GCGAAGCACC CCGGCACCGA GTACCAGGGC AGGCCCGTGC TGCCGCTGCG CGGCGCCTCG
ATGCTGCCGC TGCTGCAGGG GCGCGGGCAG GCCGTGCATG GCGCGGACAA GGCGATCGGC
TGGGAGCTGG GCGGGCGCAA GGCGCTGCGC AAGGGCGACT GGAAGATCGT GTCGGCGAAC
CAGCCCTGGG GCACCGGCGA CTGGGAGCTC TTCAACGTCG CGCAGGACCG CAGCGAGAGC
CGCAACCTCG CCGCCGCCAA CCCGCAGAAG CTGGGCGAGA TGCTGGTGGC CTGGCGCGAC
TACGTGCGCG AGACCGGCAC GCTGGAGATC CCCAACCTCG CCAACCGCCC CGGCTACAGC
AACGGCGCGA AGTACTACGA GGACCTGAAG TACGAGGCCA CCCTCGTCCC GCGCACGGCC
AAGCCCTGA
 
Protein sequence
MKAALATLLI PVATLAIGVQ AGHAASPVQH DRPNILLIMA DDLGYTDLGS YGSEIATPNL 
DTLADTGVKM TQFYASPFCS PTRAMLMSGT DNHLAGFGDM AELMLPEQRG KPGYEGYLNE
RVVPMAQVLR DAGYRTLMTG KWHLGVPEQY SPAARGFDQS YALVHGGSSH WSDGAGIVAA
DPAKPPKAIY RENGKETTLP KDFFSSDFFT SRLIEYIDAG KGSGKPFFAY LAFTAPHWPL
HAHDADIARY EQRYKDGYDK LRRERLERMK KLGLVAADTP VFEGHPLWPK WDSLSAAEKE
SEARRMAVYA AMVDNMDQNI GRMLDYLKKT GQLDNTFIFF LSDNGADGNS VYDVARTREW
IHKDMDNSIA HIGKSGSYAE YGPGWAQVGS TPFRMFKSFM YEGGIAVPAI AWGPGVKGGK
LESAMAHVTD IAPTLFELAG AKHPGTEYQG RPVLPLRGAS MLPLLQGRGQ AVHGADKAIG
WELGGRKALR KGDWKIVSAN QPWGTGDWEL FNVAQDRSES RNLAAANPQK LGEMLVAWRD
YVRETGTLEI PNLANRPGYS NGAKYYEDLK YEATLVPRTA KP