Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2555 |
Symbol | |
ID | 7873994 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2755949 |
End bp | 2757253 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643699477 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002889534 |
Protein GI | 237653220 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0238485 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGACGATG GGTTCGGCGC CATCCTAAGA TCGCTGCAGA TCCGCTCGGC CTATCGCCAC CGCTACGCCA TGCCCGCACA CTCCGCCGCT CGTCCCGGCC CCTCGCCCGA CCAGCTCGAG GCCGGTACGC CCGGCTACCG CCGCGCCAAC CTGGCGATGT TCGTCGGCGG CTTCGCGACC TTCGCCATGG TCTATAGCAC CCAGCCCCTG CTGCCCCTGC TCGCTGCCGA ATTCGGTGTG GGTGCGGCGA GTGCCAGCCT GACGGTGTCG GCCACCACTG CCGGACTCGC GCTGATGCTG ATTCCGGCCA GCGTGCTCGC CGACCGCCTC GGCCGCCAGC AGGTCATGAA GGCGGCGCTT GTGATCGCCG CCCTGTTCGC GCTGGCCACC GCCTTTGCGC CCGACTACGG CTCGCTGCTG GTCTTGCGCA CACTCCTCGG CGGGGTCATC GCCGGGCTCC CCGCCGCCGC CATGGCCTAC ATCGGCGAGG AGATCGCCCC CGGCGCGCAG GCACGCGCGA TGGGGCTCTA CATTGCCGGC AACGCCCTCG GCGGCATGAG CGGACGCTTC GTCGCCGCGC TGCTGACCGA CTGGAGCTCC TGGCGGGTCG CGCTCGGGAT GATCGGCGTG CTGGGGGCGG TCTCCGCACT GGTGTTCTGG CGCCGCCTGC CGGCCTCGCG CAACTTCCGC GCACGCGCCG CCACGCCGGC GCGGATCTTC GCGGACACTC TGGCGATCTA CCGCGATCCC GGCCTGCCCG CGCTCTTCCT GGTCGCCTTC CTCGCCATGG GCGGCTTCGT CGGGCTGTAC AACTTTCTCG GCTTCCGCCT GCTCGAGCCG CCCTACGGGC TGGGCCAGAG CGCGATCGGC GCGATCTTCC TGCTCTACCT GGTCGGCACC TGGGCCTCGG CGGCGAGCGG CCGCCTCGCC GAGCGCCGCG GCCGGCGCAA GGTGCTGTGG CCGATGGTCG CGGTGATGGG CGCCGGGCTC GCGCTCACCC TGGCGCGGCC GCTGTGGCTG ATCATCGCCG GCGTCGGCGT GTTCACCTTC GGCTTCTTCG CGGCGCACGC CCTCGCCAGC GGCTGGGTGG GCCGGCGCGC CGGCGAACGG CGCGCACTCG CCTCGGCGCT TTATCTGTCG AGCTACTACC TCGGCGCCAG CGTGCTCGGC AGCCTTGCCG GCACGGCCTG GGCCGATTGG CGATGGGCGG GCGTGGCCGC GCTGGTGGGG CTGTGCGTGC TGTTCACCGG AGCCTGCGCG CTGTGGCTGC GGCGACTGCC GGCGCCCGCG GGCGGTACGG ACTGA
|
Protein sequence | MDDGFGAILR SLQIRSAYRH RYAMPAHSAA RPGPSPDQLE AGTPGYRRAN LAMFVGGFAT FAMVYSTQPL LPLLAAEFGV GAASASLTVS ATTAGLALML IPASVLADRL GRQQVMKAAL VIAALFALAT AFAPDYGSLL VLRTLLGGVI AGLPAAAMAY IGEEIAPGAQ ARAMGLYIAG NALGGMSGRF VAALLTDWSS WRVALGMIGV LGAVSALVFW RRLPASRNFR ARAATPARIF ADTLAIYRDP GLPALFLVAF LAMGGFVGLY NFLGFRLLEP PYGLGQSAIG AIFLLYLVGT WASAASGRLA ERRGRRKVLW PMVAVMGAGL ALTLARPLWL IIAGVGVFTF GFFAAHALAS GWVGRRAGER RALASALYLS SYYLGASVLG SLAGTAWADW RWAGVAALVG LCVLFTGACA LWLRRLPAPA GGTD
|
| |