Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3931 |
Symbol | |
ID | 7873577 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4327373 |
End bp | 4328626 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643700868 |
Product | urea ABC transporter, urea binding protein |
Protein accession | YP_002890891 |
Protein GI | 237654577 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR03407] urea ABC transporter, urea binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.606906 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCGTC GCAACTTCGT CAAAGCCCTC ACGCTTTCGG CTTCCATCGC CGCGATCGGC CTGCCCGCCG GCGCGCACGC CGCCGACACC ATCAAGGTCG GCATCCTGCA TTCGCTGTCG GGCACGATGG CGATCTCCGA GACCGCGCTC AAGAACGTGG CGCTGATGAC CATCGAGGAG ATCAACGCCG GCGGCGGCGT GCTCGGCAGG AAGCTCGAGC CGGTGGTGGT CGACCCGGCC TCGAACTGGC CGCTGTTCGC CGAGCGCGCG CGCCAGCTGC TGGCGCAGGA CAAGGTCGCG GCGGTGTTCG GCTGCTGGAC CTCGGTGTCG CGCAAGTCGG TGCTGCCGGT GTTCAAGGAG TTGAACGGCC TGCTCTTCTA CCCGGTGCAG TACGAGGGCG AGGAGCTCGA GAAGAACGTC TTCTACACCG GCGCCGCGCC CAACCAGCAG GCGATTCCCG CGGTCGAGTA CCTGATGAGC GAGGAAGGCG GCGGCGCGAA GCGCTTCGTG CTGCTCGGCA CCGACTACGT GTATCCGCGC ACGACCAACA AGATCCTGCG CGCCTTCCTG AAGAGCAAGG GGGTGAGCGA TGCGGACATC CTGGAGGACT ACACGCCCTT CGGCCACGCC GACTACCAGA CCATCATCGC GCGCATCAAG CAGTTCGCCT CCGAGGGCAA GAAGACGGCC GTGGTGTCGA CCATCAACGG CGACTCCAAC GTGCCCTTCT ACAAGGAACT GGGCAACGCC GGACTGAAGG CGACGGACGT GCCGGTGGTG GCCTTCTCCG TCGGTGAGGA GGAGCTGCGC GGCGTCGACA CCAAGCCCCT GCTCGGCCAC CTCGCGGCGT GGAACTACTT CATGTCGGTC GACAACCCGC AGAACAAGGC CTTCATCGAC AAGTACCGCG CGTGGGCGAA GAAGAACGGC GTGCCCAACG CCGACACCGT GGTCACCAAC GACCCGATGG AGGCCACCTA CGTCGGCCTG CACATGTGGA AGCAGGCGGT CGAGAAGGCC GCCAGCACGG ACGTCGACAA GGTCATCGCG GCGATGGGCG GGCAGAGCTT CAAGGCGCCG TCGGGCTTCA CGCTGACCAT GGACGCGACC AATCACCACC TGCACAAGCC GGTGCTGATC GGCGAGGTGC GCGCGGACGG CCAGTTCGAC GTGGTATGGC AGACCAAGGG GCCGATCCGC GCCCAGCCGT GGAGCCCGTT CATCGAGGGC AACGAGGGCA AGCAGGGGCT GTGA
|
Protein sequence | MNRRNFVKAL TLSASIAAIG LPAGAHAADT IKVGILHSLS GTMAISETAL KNVALMTIEE INAGGGVLGR KLEPVVVDPA SNWPLFAERA RQLLAQDKVA AVFGCWTSVS RKSVLPVFKE LNGLLFYPVQ YEGEELEKNV FYTGAAPNQQ AIPAVEYLMS EEGGGAKRFV LLGTDYVYPR TTNKILRAFL KSKGVSDADI LEDYTPFGHA DYQTIIARIK QFASEGKKTA VVSTINGDSN VPFYKELGNA GLKATDVPVV AFSVGEEELR GVDTKPLLGH LAAWNYFMSV DNPQNKAFID KYRAWAKKNG VPNADTVVTN DPMEATYVGL HMWKQAVEKA ASTDVDKVIA AMGGQSFKAP SGFTLTMDAT NHHLHKPVLI GEVRADGQFD VVWQTKGPIR AQPWSPFIEG NEGKQGL
|
| |