Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1853 |
Symbol | |
ID | 7084276 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2086792 |
End bp | 2087856 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643698876 |
Product | arsenical-resistance protein |
Protein accession | YP_002355501 |
Protein GI | 217970267 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0798] Arsenite efflux pump ACR3 and related permeases |
TIGRFAM ID | [TIGR00832] arsenical-resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00716983 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGACTCT TCGAGCGCTA CCTCACCGTC TGGGTGGGGC TGGGCATCCT CGCCGGGGTG GGCCTCGGGC TGCTCGCGCC CGGGGCCTTC CAGGCCATCG CCGGGCTCGA GTTCGCGCAG GTGAACCTGG TGGTGGCGGT GTTCATCTGG GTGATGATCT ACCCGATGAT GATCCAGATC GACTGGCACG CCGTTCGCGA TGTCGGCAAG AAGCCGCAGG GACTCGTCCT GACCCTGGTC GTGAACTGGC TGATCAAGCC CTTCACGATG GCGGCGCTCG GCGTGCTGTT CTTCCAGCAC CTGTTCGCGC CCTGGGTGGA TCCGGCTTCG GCCAGCGAGT ACATCGCCGG CATGATCCTG CTCGGGGTGG CGCCGTGCAC GGCGATGGTG TTCGTATGGA GCCAGCTGGT GAAGGGCGAC CCCAACTACA CCCTGGTGCA GGTGTCGGTG AACGATCTCG TCATGGTGTT CGCCTTCGCC CCCATCGCCG CCTTCCTGCT CGGCGTCACC GATCTCGTCG TGCCGTGGGA GACCCTGCTG CTGTCGACCG TGCTCTACGT GGTGCTGCCG CTGCTCGCCG GCATGGCGAC GCGGCATGTG CTGGAGCGTC GCTCGGCGCA GGCGGTGGCC GGGTTCGTCG CTCGCCTCAA GCCCTGGTCC ATCGTCGGTC TGATCGCCAC CGTGGTGCTG CTGTTCGGTT TCCAGGCGCG CACCCTCGTG GAACAGCCGC TGGTGATCGG CCTGATCGCC GTGCCGCTGC TGGTGCAGAG CTACGGCATC TTCCTCATCG CCTATCTCGC CGCGAAGGTG ATGAGGCTGC CGCACGAGGT CGCCGGCCCG GCCTGCCTGA TCGGCACCTC GAATTTCTTC GAGCTCGCCG TGGCGGTGGC GATCTCGCTG TTCGGGCTGA ATTCGGGGGC GGCGCTGGCG ACCGTGGTCG GTGTGCTGGT GGAGGTGCCG GTGATGCTGT CGCTGGTGGC GATCGTCAAT CGAACGCGGC ATTGGTTTCC GTCCGCGGCA GGGCCGCAGG CGGCTCTCGA CCGACCGGGT GGCGCGCGTG GCTGA
|
Protein sequence | MGLFERYLTV WVGLGILAGV GLGLLAPGAF QAIAGLEFAQ VNLVVAVFIW VMIYPMMIQI DWHAVRDVGK KPQGLVLTLV VNWLIKPFTM AALGVLFFQH LFAPWVDPAS ASEYIAGMIL LGVAPCTAMV FVWSQLVKGD PNYTLVQVSV NDLVMVFAFA PIAAFLLGVT DLVVPWETLL LSTVLYVVLP LLAGMATRHV LERRSAQAVA GFVARLKPWS IVGLIATVVL LFGFQARTLV EQPLVIGLIA VPLLVQSYGI FLIAYLAAKV MRLPHEVAGP ACLIGTSNFF ELAVAVAISL FGLNSGAALA TVVGVLVEVP VMLSLVAIVN RTRHWFPSAA GPQAALDRPG GARG
|
| |