Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1517 |
Symbol | |
ID | 7083599 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1695128 |
End bp | 1696276 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643698534 |
Product | Extracellular ligand-binding receptor |
Protein accession | YP_002355171 |
Protein GI | 217969937 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.702531 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTTGC GTAACACCCT GCTGGCCGCC CTCGGCCTCG CCTTCGCCGC CACTGCCGCC CATGCCGAGA TCAAGGTCGG TGTCGTGCTC TCGGCCACCG GCCCCGCCGC TTCGCTGGGT ATCCCGGAAA AGAACACGAT CGCGCTGCTG CCGGCCACCA TCGGCGGCGA AAAGGTGAGC TACATCGTGC TCGACGACGC CTCCGACACC ACCACGGCGG TCAAGAACGC CCGCAAGCTG ACCGTCGAGG ACGGCGTCGA TGTGATCATC GGCTCCACCA CCAGCCCCGC TTCGCTGGCC ATGGTCGACG TCGCCGCCGA GACGAAGACG CCGATGATCT CGATGGCCGC GTCGGCGCGC ATCGTCGCGC CGATGGACGA CAAGAAGCGC TGGGTCTTCA AGACCCCCCA GAACGACCAG CAGATGGCGT CGGCGATCGT CGAGCACATG GTCGCCAACA AGGTGAAGAA GGTGTCCTTC ATCGGCTTCG CCAACGCCTA CGGCGAGGGC TGGTACGAGC AGTTCAAGAA GCTGGCCGAA GCCAAGGGCA TCGAGATCGC CGCCAGCGAG CGCTTCAACC CGGCCGACAC CTCGGTGACC GGCCAGGCGC TCAAGCTGAT GTCGGTGAAG CCGGACGCGG TGTTCATCGC CGGCTCGGGC ACGCCCTCGG CGCTGCCGCA GAAGACGCTG CGCGAGCGCG GCTACAAGGG GCCGATCTAC CAGACCCACG GCGTGGCCAA CAACGACTTC CTGCGCATCT GCGGCAAGGA CTGCGAAGGC ACGCTGCTGC CGGTCGGCCC GGTGCAGATG GCGCGCAGCC TGCCCGACAG CCACCCCGTC AAGGCGAGCG CGCTGGCCTA CGTGGAGAAG TACGAGGCCG CCAACGGCGC GGGCTCGGTG TCGAGCTTCG GGGCCTACGC GTGGGACGCC GGGGTGTTGC TGCAGGCGGC CGTCCCTGCC GCGCTCAAGG CGGCCAAGCC GGGCTCGGCG GAGTTCCGCA CGGCGCTGCG CGATGCGCTC GAGGGCGTGA AGGAAGTCGC CGGCGCCACC GGCATCTACA CGATGAGCCC CGACGATCAC CTCGGCCTGG ACGACCGCTC GCGCGTGATG ATCGAGATCC GCAACGGCAC CTGGTCGCTG CTGAAGTAA
|
Protein sequence | MKLRNTLLAA LGLAFAATAA HAEIKVGVVL SATGPAASLG IPEKNTIALL PATIGGEKVS YIVLDDASDT TTAVKNARKL TVEDGVDVII GSTTSPASLA MVDVAAETKT PMISMAASAR IVAPMDDKKR WVFKTPQNDQ QMASAIVEHM VANKVKKVSF IGFANAYGEG WYEQFKKLAE AKGIEIAASE RFNPADTSVT GQALKLMSVK PDAVFIAGSG TPSALPQKTL RERGYKGPIY QTHGVANNDF LRICGKDCEG TLLPVGPVQM ARSLPDSHPV KASALAYVEK YEAANGAGSV SSFGAYAWDA GVLLQAAVPA ALKAAKPGSA EFRTALRDAL EGVKEVAGAT GIYTMSPDDH LGLDDRSRVM IEIRNGTWSL LK
|
| |