Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1945 |
Symbol | |
ID | 7084413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2187037 |
End bp | 2188218 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643698970 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002355592 |
Protein GI | 217970358 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.932651 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAGA ATCTCTGGCT GCTCGCGATC GCGCAGGGCC TGTTCCTGAC CAACAACGTG GTGTTCATCG CCATCAACGG CCTGGTGGGC TTCAGCCTGG CGCCGGTGGG CTGGATGGCG ACGCTGCCGG TGATGGGCTA CGTGGTGGGC GGGGCGTTGT CGACCGGGCT GGTGGCGCGC AGCCAGCGGC GCTGGGGGCG GCAGGTCTCC TTTCAGCTCG GCCTGCTGGT GGCCTTCCTG ACCGCGCTGC TGTGCGCCTT CGCGCTGTCG CTCGGCAGCT TCTGGCTGCT CGTGGCGGGC ACGGTGGTCG CCGGCTACTA CAGCGCCAAC GGCCAGCTCT ACCGTTTCGC CGCCGGCGAG CTGGCGCCGT CGTCCTTCCG CGAGAAGGCG GTGTCGCTGG TGCTGGCAGG CGGCCTCATC GGTGCGGTGA TCGGTCCCAA CCTCGCCAAC CACACGCGCG AGGCGATGGC CGTGCCCTTC CTCGGCTCCT ATCTCGCGCT CGCGGTGGTG GCGCTGGTGT CGATGGCGGT GATGGCCGGA ATCCGGTTTC CGCCGCAGCC GGTGAAGAAG GCGGGTGCCG ACGGCGGACG GCCGCTGGGC GAGATCCTGC GCCAGCCGGT GTTCATCGTG TCCGTGCTGG CCGCGGCGAT GAGCTACGGC GTGATGAACC TGCTGATGGC GGCCACGCCG CTGGCGATGG ACGTGTGCGG CATGCCCTTC TCGGACGCGG CGCTGGTGCT GGAGTGGCAC GTGATCGGCA TGTTCGCGCC CGGCTTCTTC ACCGGCCACC TGATCAAGCG CTTCGGCGTG CTGCGCATCA TGGGCGTGGG CGTGGTGCTG AACTTCGCCT GCGTGGCGGT GGCGCTCTCG GGCCAGGACC TGCACCAGTT CCTGCTCGCG CTCTTCCTGC TCGGCGTGGG CTGGAACTTC CTGTTCACCG GCGCGACCAC GCTGTCGATG GAAGCCTACC GGCCGGAGGA GAAGGACAAG GCGCAGGCGG CGATCAATTT CGTGGTGTTC GCGGTGATGG CGTTTACTTC TTTCGCCTCG GGCGCGCTGG TGACGACGCA GGGCTGGAAT CTGCTCAACG TGGGCTCGAT CGCGCCGCTC GCGCTTACCG CCGCGGCGCT GGGGTGGCTT GCGCTGCTGC GGCGGCGTGC GGCGGACGGG CAGGGCGCAT GA
|
Protein sequence | MKKNLWLLAI AQGLFLTNNV VFIAINGLVG FSLAPVGWMA TLPVMGYVVG GALSTGLVAR SQRRWGRQVS FQLGLLVAFL TALLCAFALS LGSFWLLVAG TVVAGYYSAN GQLYRFAAGE LAPSSFREKA VSLVLAGGLI GAVIGPNLAN HTREAMAVPF LGSYLALAVV ALVSMAVMAG IRFPPQPVKK AGADGGRPLG EILRQPVFIV SVLAAAMSYG VMNLLMAATP LAMDVCGMPF SDAALVLEWH VIGMFAPGFF TGHLIKRFGV LRIMGVGVVL NFACVAVALS GQDLHQFLLA LFLLGVGWNF LFTGATTLSM EAYRPEEKDK AQAAINFVVF AVMAFTSFAS GALVTTQGWN LLNVGSIAPL ALTAAALGWL ALLRRRAADG QGA
|
| |