Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3990 |
Symbol | |
ID | 7873636 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4387593 |
End bp | 4389341 |
Gene Length | 1749 bp |
Protein Length | 582 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643700927 |
Product | sulfatase |
Protein accession | YP_002890950 |
Protein GI | 237654636 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGCAG CGCTCGCGAC CCTGCTGATC CCGGTGGCAA CGCTCGCCAT CGGCGTCCAG GCCGGCCACG CCGCCTCGCC GGTCCAGCAC GACCGCCCCA ACATCCTGCT GATCATGGCC GACGACCTCG GCTACACCGA CCTCGGCAGC TACGGCAGCG AAATCGCCAC TCCCAACCTC GACACCCTGG CCGACACCGG GGTCAAGATG ACCCAGTTCT ACGCCTCGCC GTTCTGCTCG CCGACGCGCG CGATGCTGAT GTCCGGCACC GACAACCACC TCGCCGGCTT CGGCGACATG GCCGAGCTGA TGCTGCCCGA GCAGCGCGGC AAGCCGGGCT ACGAGGGCTA CCTCAACGAG CGCGTGGTGC CGATGGCGCA GGTGCTGCGC GATGCCGGCT ATCGCACGCT GATGACCGGC AAGTGGCACC TCGGCGTGCC CGAGCAGTAC AGCCCGGCCG CGCGCGGCTT CGACCAGTCG TATGCGCTGG TGCATGGCGG CTCCAGCCAC TGGAGCGACG GCGCGGGCAT CGTCGCCGCC GATCCGGCCA AGCCGCCGAA GGCCATCTAC CGCGAGAACG GCAAGGAGAC GACGCTGCCG AAGGACTTCT TCTCGTCCGA CTTCTTCACC TCGCGGCTGA TCGAGTACAT CGACGCCGGC AAGGGTTCGG GCAAGCCCTT CTTCGCCTAC CTCGCCTTCA CCGCGCCGCA CTGGCCGCTG CACGCGCACG ACGCCGACAT CGCCAGGTAC GAGCAGCGCT ACAAGGACGG CTACGACAAG CTGCGCCGCG AGCGCCTCGA GCGCATGAAG AAGCTCGGCC TGGTGGCCGC CGACACGCCG GTGTTCGAAG GCCATCCGCT GTGGCCGAAG TGGGACAGCC TGAGCGCGGC GGAGAAGGAA TCCGAGGCCA GGCGCATGGC GGTGTACGCC GCGATGGTCG ACAACATGGA CCAGAACATC GGCCGCATGC TCGACTACCT GAAGAAGACC GGGCAGCTCG ACAACACCTT CATCTTCTTC CTGTCCGACA ACGGCGCCGA CGGCAACTCG GTGTACGACG TGGCGCGCAC CCGCGAGTGG ATCCACAAGG ACATGGACAA CAGCATCGCG CACATCGGCA AGTCCGGCTC CTACGCCGAG TACGGGCCGG GCTGGGCGCA GGTGGGTTCG ACGCCGTTCC GCATGTTCAA GTCCTTCATG TACGAGGGCG GCATCGCCGT GCCGGCGATC GCCTGGGGCC CGGGCGTCAA GGGCGGCAAG CTCGAGTCGG CGATGGCCCA CGTGACCGAC ATCGCGCCCA CGCTGTTCGA GCTCGCCGGC GCGAAGCACC CCGGCACCGA GTACCAGGGC AGGCCCGTGC TGCCGCTGCG CGGCGCCTCG ATGCTGCCGC TGCTGCAGGG GCGCGGGCAG GCCGTGCATG GCGCGGACAA GGCGATCGGC TGGGAGCTGG GCGGGCGCAA GGCGCTGCGC AAGGGCGACT GGAAGATCGT GTCGGCGAAC CAGCCCTGGG GCACCGGCGA CTGGGAGCTC TTCAACGTCG CGCAGGACCG CAGCGAGAGC CGCAACCTCG CCGCCGCCAA CCCGCAGAAG CTGGGCGAGA TGCTGGTGGC CTGGCGCGAC TACGTGCGCG AGACCGGCAC GCTGGAGATC CCCAACCTCG CCAACCGCCC CGGCTACAGC AACGGCGCGA AGTACTACGA GGACCTGAAG TACGAGGCCA CCCTCGTCCC GCGCACGGCC AAGCCCTGA
|
Protein sequence | MKAALATLLI PVATLAIGVQ AGHAASPVQH DRPNILLIMA DDLGYTDLGS YGSEIATPNL DTLADTGVKM TQFYASPFCS PTRAMLMSGT DNHLAGFGDM AELMLPEQRG KPGYEGYLNE RVVPMAQVLR DAGYRTLMTG KWHLGVPEQY SPAARGFDQS YALVHGGSSH WSDGAGIVAA DPAKPPKAIY RENGKETTLP KDFFSSDFFT SRLIEYIDAG KGSGKPFFAY LAFTAPHWPL HAHDADIARY EQRYKDGYDK LRRERLERMK KLGLVAADTP VFEGHPLWPK WDSLSAAEKE SEARRMAVYA AMVDNMDQNI GRMLDYLKKT GQLDNTFIFF LSDNGADGNS VYDVARTREW IHKDMDNSIA HIGKSGSYAE YGPGWAQVGS TPFRMFKSFM YEGGIAVPAI AWGPGVKGGK LESAMAHVTD IAPTLFELAG AKHPGTEYQG RPVLPLRGAS MLPLLQGRGQ AVHGADKAIG WELGGRKALR KGDWKIVSAN QPWGTGDWEL FNVAQDRSES RNLAAANPQK LGEMLVAWRD YVRETGTLEI PNLANRPGYS NGAKYYEDLK YEATLVPRTA KP
|
| |