Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3073 |
Symbol | |
ID | 7874543 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3327070 |
End bp | 3328239 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643699996 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002890048 |
Protein GI | 237653734 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTCCCT ACTGGCGCCT GTCGGCGTAC TACTTCTTCT ACTTCGCCTT CGTCGGCGCG TTCTCGCCCT ACTTCACGCT CTACCTGCAG TCGATCGCGC TGTCGGCCAC CGACATCGCG CTGCTGATGT CGCTGATGCA GCTGATGCGC GTGCTGGCTC CCAACCTGTG GGGCTGGCTG GCGGAACGCC TGGGGATGCG TATCGCGATC GTGCGTCTGT CGGCGCTCGC CAGCCTGGCG GGCTTCTCGG TGTTTTTCCT GACTACCGAG TTCGCCGGGC TGTTCGCCGC GATGGCGCTG ATGGCCTTCT TCTGGAGCGC GGTGCTGCCG CTGATCGAGG GTCTCGCCTT CGCCCACCTC GGCGAGGCTT CGCACCGCTA TGGCCGCATC CGCGTATGGG GCTCGGTCGG CTTCATCGTC GCCGTGCTCG CGCTCGGGCA TTCGCTCGAC CGCCTGCCGA TCGAGGCGGT GCTGTGGATC ACCATGTCCA TCCTCGTCGG CATCGTGCTG TGCAGCTTCA TCGTCCCCGA GGCGCCGCGC CCGCCGCTGC AGCGCGACGC CGCGAGCTTC GGCGACACCC TGCGCCGTCC CGAGGTGCGT GCCCTGCTCG GCGCCTGCTT CCTGATGTCG GCGGCACACG GCGCGCTCTA CGTGTTCTAT TCGATCCACC TGGTCGGCAT GGGTTACGAC AAGGGCGTGG TGGGCTGGAT GTGGACGCTG GGGGTGCTCG CCGAGATCGG GGTCTTCATG TGGATGCCGC GCATCTCGGT GCGCTTTTCG CTGCGTGCGA TCCTGCTGTT CTCCTTCGCC TGCGCGGTGG CGCGCTTCCT GATGATCGGC TGGGGCGCGC ACAGCCTGGC GCTGCTGCTG CTCGCCCAGG TTCTGCACGG CGCCACCTTC GGCGCCTACC ACGCCGCGGC GATCGCGGTG GTCAACGAGT GGTTTCCGGG CCGCCTGCAG TCGCGCGGCC AGGCGCTCTA CGGCAGCATC TCCTTCGGGG CCGGCGGCAT GCTGGGCGGC TTGCTGAGCG GTTACACCTG GGAGGGCATC GGGCCGGCGT GGACGTATAC GATCGGCTCC GGCTTCGCGC TCGCCGGCCT GCTCTGGTTG CTGCTGGGCT GGAAAGGGGA ACCCCGCCCC GCGCACGCAC TCCACGAACC GAGGAGGTAA
|
Protein sequence | MIPYWRLSAY YFFYFAFVGA FSPYFTLYLQ SIALSATDIA LLMSLMQLMR VLAPNLWGWL AERLGMRIAI VRLSALASLA GFSVFFLTTE FAGLFAAMAL MAFFWSAVLP LIEGLAFAHL GEASHRYGRI RVWGSVGFIV AVLALGHSLD RLPIEAVLWI TMSILVGIVL CSFIVPEAPR PPLQRDAASF GDTLRRPEVR ALLGACFLMS AAHGALYVFY SIHLVGMGYD KGVVGWMWTL GVLAEIGVFM WMPRISVRFS LRAILLFSFA CAVARFLMIG WGAHSLALLL LAQVLHGATF GAYHAAAIAV VNEWFPGRLQ SRGQALYGSI SFGAGGMLGG LLSGYTWEGI GPAWTYTIGS GFALAGLLWL LLGWKGEPRP AHALHEPRR
|
| |