Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0647 |
Symbol | |
ID | 7084585 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 735070 |
End bp | 736425 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643697673 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002354315 |
Protein GI | 217969081 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00895] benzoate transport |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGAAAAG TAGATGTGCA TCAAACAATC GACGAGGCGA AGTTCAACAA ATTTCACCTG AACGTGCTGT TCTGGTGTGC GCTCGTCATC ATCTTCGACG GCTACGACCT GGTGATCTAC GGGGTCGTGC TGCCGGTGCT GATGAAGGAG TGGGACCTCA CGCCGCTGCA GGCGGGCTCG CTCGGCAGCG CCGCGCTGTT CGGCATGATG TTCGGCGCGC TGATCTTCGG GCCGCTGTCC GACCGCATCG GCCGCAAGAA GGTGATCATG ATCACGGTGA TCATCTTCAG CGGCGTGACC TTCCTCAATG GCTTCGCCGA GACGCCCACC CAGTTCGCCG CGATGCGCTT CATCGCCGGC CTCGGCATCG GCGGCGTGAT GCCCAACGTG GTCGCGCTGA TGACCGAGTA CATGCCCAGG AAGATCCGCA GCACCATGGT CGCGGTGATG TTCAGCGGCT ACTCGGTCGG CGGCATGACC TCGGCCTTCC TCGGCATGTA CCTGATGCCC AGCCACGGCT GGCAGTCGGT GTTCTTCGTC GCCGGCATCC CGCTGCTGCT GCTGCCGCTG ATCTGGATCT TCCTCCCCGA GGCCGTCGGC TTCCTGCTCA AGGAAGGTCG CAAGGAAGAG GCTGGCCGCC TGCTCGCCCG CGTCGAGCCG TCCTACTCGC CCGCGGCCGG CGACAACTTC CACCTGGTCA CCGGCCAGGG CAGGAGCGTC GCCCTGGTCG CGCTGTTCCA GAACGGTCGC CTGGTCAGCA CGCTGATGTT CTGGGTCGCC TTCTTCAGCT GCCTGCTGAT GGTGTATGCG CTCGGCTCCT GGCTGCCGAA GCTGATGAAC AAGGCCGGCT ACGAGCTCGG CTCCAGCCTG AGCTTCCTGC TGGTGCTCAA CTTCGGTGCG ATCTTCGGCG CGATCGGCGG CGGCTGGCTG GCGGACCGCT TCCACCTGCG CCGGGTGCTG ACCATCATGT TCGTGATCGC GGCGCTGTCG ATCAGCGCGC TCGCGATCAA GAACGACATG ATCGTCCTCT ACCTGCTGGT GGCGATCGCC GGTGCGACCA CGATCGGCTC GCAGATCCTG CTCTACGCCT ACGTCGCGCA GTACTATCCG CTGGCGATCC GCTCCACCGG CATCGGCTGG GCCTCGGGCG TGGGGCGCCT GGGGGCGATT TCGGGGCCGA TGCTGGGCGG TGCGCTGATC TCGCTGCCGC TGGAGATGAA CTTCCTCGCC TTCGCCATCC CGGGTGCGGT CGCCGGCCTG GCGATCTCCA TGGTCGGGCG CAGCGCGGTG CATGCGCCGG AAGAGGAGAA GAACCTCGCC TTCGTGGGCG AGACCGGCAA GCTCGAGGTG GATTGA
|
Protein sequence | MRKVDVHQTI DEAKFNKFHL NVLFWCALVI IFDGYDLVIY GVVLPVLMKE WDLTPLQAGS LGSAALFGMM FGALIFGPLS DRIGRKKVIM ITVIIFSGVT FLNGFAETPT QFAAMRFIAG LGIGGVMPNV VALMTEYMPR KIRSTMVAVM FSGYSVGGMT SAFLGMYLMP SHGWQSVFFV AGIPLLLLPL IWIFLPEAVG FLLKEGRKEE AGRLLARVEP SYSPAAGDNF HLVTGQGRSV ALVALFQNGR LVSTLMFWVA FFSCLLMVYA LGSWLPKLMN KAGYELGSSL SFLLVLNFGA IFGAIGGGWL ADRFHLRRVL TIMFVIAALS ISALAIKNDM IVLYLLVAIA GATTIGSQIL LYAYVAQYYP LAIRSTGIGW ASGVGRLGAI SGPMLGGALI SLPLEMNFLA FAIPGAVAGL AISMVGRSAV HAPEEEKNLA FVGETGKLEV D
|
| |