Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1810 |
Symbol | |
ID | 7084232 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2032887 |
End bp | 2034089 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643698832 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002355458 |
Protein GI | 217970224 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.390541 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGGAC CTCATCTCCT GCCGCTGCTG CTGCTCTACG CCCTGATGCT GCCGGTCACC GGCATGGTGC CGGTGCTGCC CGAGTTCACC GCGCAGCGCT TTCCCGGTCT GGGCCAGTTC GCCAGCCACT TCTTCATGTC GATCAACATG ATCGGCGCGC TGCTCGGCGC GCCGATCGCC GGCCTGCTCT CCGACCGCCT CGGCAAGCGC CGCCTGCTCG CGGTAGGCGC GCTCGCCGTG AACGGCATCG CGCTGCTGGG CATCGCCTGG GCCTGGCGCA GCACGGAGAG CTACGCGCTG CTCCTCGCGC TGCGCCTCGT CGAGGGCTTC GCCCACATGT CGGCGCTGTC GCTGCTGATG GCGCTCGCCG CCGACCACGC CGGCAAGGCC GGGCTGGGCG CGCGCATGGG CGCGGTGGGC GCCTCGATCA GCCTGGGGGT GGCCACCGGC GCGCCGCTCG GGGGCATCAT CGGCGACATC GACCCCTTCT GGGTGCCCCT CGGCGGCGGC CTGCTCTCGC TGGCGATGGC CGCGCTCGGC TTCGTCCGCC TCGCCGAGGG CGGCGGCACG CGGCCGCGCA TGACGGCATC CGAGATCGTC GACACCCTGC GCAACCGTCG CCAGTTGCTG ATCCCGCTCG CGTTCTCCTT CGCCGACCGC CTCACCGTAG GCTTCATCGT GTCGACGCTG TCGCTCTACC TCGGCCTGGT GATCGGTTTC GATGCGCGCC AGATCGGCAT CGCGATGGCG GCCTTCCTGA TCCCCTTCTC GGTGCTCACC TGGCCCGCCG GCCACCTGTC GCGGCATTGG GATCCGTTGT GGATGATGGT GATCGGCAGC GTGCTCTACG GCGTCTTCCT CGCCGTGCTC GGCTTCGTCC CGGGCGATCG GGTGGTGGCG ACGATGGCCG CGGGCGGCGT GATCGCGGCG CTGATGTATG CGCCCTCACT GGTGCTGGCC GCGCAATATG GCGGCAGCGA CTGCCGCGCC AGCGCGCTGG CCGCCTTCAA CATGGCGGGC TCGCTCGGCT TTGCCGCCGG CCCGCTGCTC AGCAGCGCGC TGCTCGCCTT CTTCGGCCTG GTGCTGGAAC GCCCCTACCC GCCAGTCTTC GTGGCGATCG GGCTGATCGA GGTCGTGCTC GCCCTCGCGG TGCTGCTGCT GGTGCGCCGC GGCCGGCTGC AGGCCGGCGC CGCGACGGCC TGA
|
Protein sequence | MTGPHLLPLL LLYALMLPVT GMVPVLPEFT AQRFPGLGQF ASHFFMSINM IGALLGAPIA GLLSDRLGKR RLLAVGALAV NGIALLGIAW AWRSTESYAL LLALRLVEGF AHMSALSLLM ALAADHAGKA GLGARMGAVG ASISLGVATG APLGGIIGDI DPFWVPLGGG LLSLAMAALG FVRLAEGGGT RPRMTASEIV DTLRNRRQLL IPLAFSFADR LTVGFIVSTL SLYLGLVIGF DARQIGIAMA AFLIPFSVLT WPAGHLSRHW DPLWMMVIGS VLYGVFLAVL GFVPGDRVVA TMAAGGVIAA LMYAPSLVLA AQYGGSDCRA SALAAFNMAG SLGFAAGPLL SSALLAFFGL VLERPYPPVF VAIGLIEVVL ALAVLLLVRR GRLQAGAATA
|
| |