Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3308 |
Symbol | |
ID | 7874206 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3624736 |
End bp | 3625941 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643700242 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002890280 |
Protein GI | 237653966 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0575835 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGTTC CCGCACGCGA CCCCATGACG CGCGAAGAAC GCCGCGCCGG CATGGGCCTG GCGGCGATCT TCGCGCTGCG CATGCTCGGC CTGTTCCTGA TCCTGCCGGT GTTCGCGGTG CACGCCGCCA CCATCCCCGG GGGCGGCGAC CTCACCTTGG TGGGCCTGGC GATCGGCGCC TACGGCCTCA CCCAGGCCTG CCTGCAGATC GCCTACGGCG CCGCCTCCGA CCGCTTCGGC CGCAAGCCGG TGATCGTCTT CGGCCTGGTG CTGTTCGTGC TCGGCAGCGT GGTCGCGGCG CTCGCCGACG GCATCCACAT GATCATCGTC GGCCGCGTGC TGCAGGGTGC GGGGGCGATC TCCGCCGCGG TCACCGCGCT CGCCGCCGAC CTCACCCGCG ACCAGCACCG CACCAAGGTC ATGGCGATGA TCGGCTCCTC GATCGGCCTG GTGTTCGCGC TCTCGATGGT GGCGGCACCG CTGCTCTACG CCGCCGTCGG CATGGATGGC ATCTTCTGGC TCACCGCCGT GCTTGCCGCG GGTGCGATCG GGGTTCTGCT GTGGGCGGTG CCGGTCGCTC CGCCGGTGCC GCGCGCCACC GGCCGGCTCA TCGACGTGCT GCGCGACGGC CAGCTCATGC GGCTCAACTT CGGCGTGTTC GCGCTCCACC TCATCCAGAC CACGATGTGG GTGATGGTGC CCTCGGCCCT GGTCTCCGGC GGCGGCCTGC CGGTGCCCGA GCACTGGAAG GTCTACCTGC CGGCGGTGCT GCTGTCCTTC GTGGTCATGG TGCCGGCGGT GATCGTCGCC GAGCGCCACG ACAAGCTCAA GCCGGTGTTC AATGCCGCCA TCGTCCTGCT CGCCGTCGTG CAGATCGGGC TGTGGCTTTT CGGTGACGGA TTGATACCAT TGGCATTGTT GCTGACGCTC TTCTTTATCT CCTTCAACGT GCTCGAGGCC ACCCAACCGT CGTGGATCTC GCGTATCGCG CCGCCCGGCT CCAAGGGCAC GGCGCTGGGC GTCTACAACA CCCTGCAGTC GATCGGCCTG TTCCTCGGCG GCGTGCTCGG CGGCTGGCTG GGGCAGACCT TCGGCCCGGC CGCGGTGAGC CTGTCGTGCG CGGCGCTCGC GCTGCTCTGG CTCGCACTGG CGAGCACGAT GAATCCCCCG CCGCTGCGCG CGGCTCCGGC CGGCGCGCCG CGCTGA
|
Protein sequence | MSVPARDPMT REERRAGMGL AAIFALRMLG LFLILPVFAV HAATIPGGGD LTLVGLAIGA YGLTQACLQI AYGAASDRFG RKPVIVFGLV LFVLGSVVAA LADGIHMIIV GRVLQGAGAI SAAVTALAAD LTRDQHRTKV MAMIGSSIGL VFALSMVAAP LLYAAVGMDG IFWLTAVLAA GAIGVLLWAV PVAPPVPRAT GRLIDVLRDG QLMRLNFGVF ALHLIQTTMW VMVPSALVSG GGLPVPEHWK VYLPAVLLSF VVMVPAVIVA ERHDKLKPVF NAAIVLLAVV QIGLWLFGDG LIPLALLLTL FFISFNVLEA TQPSWISRIA PPGSKGTALG VYNTLQSIGL FLGGVLGGWL GQTFGPAAVS LSCAALALLW LALASTMNPP PLRAAPAGAP R
|
| |