Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2471 |
Symbol | |
ID | 7874154 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2668379 |
End bp | 2669494 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643699393 |
Product | Extracellular ligand-binding receptor |
Protein accession | YP_002889450 |
Protein GI | 237653136 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAGAA CCGTCGTCGC AGTCGCCCTG ATGAGCATCG GCCTCAGTGC CGCCCAAGCG CAGGAAGTGG TCGTCAAGCT GGGCGGGGCC GCCCCGCTCA CCGGTAACCA GTCGCACCTT GGCAAGGATC TCGAGAACGG CACCCGCCTC GCCATCGAGG AGGCCAACGC CAAGGGCGTG ACGATCGGCG GCAAGAAGGT CAAGTTCGAG CTGGTCGGCG AGGACGACCA GGCCGACCCG CGCACCGGCA CCACGGTGGC GCAACGCCTG GTCGATGCCG GGGTGAAGGG GGTGATCGGG CATCTGAACT CGGGCACCTC GATTCCGGCC TCGCGCATCT ACGACCAGGC GGGCATTCCG CAGGTGTCGC CGGCCTCGAC CAACCCGAAG CTGACGCTGC AGAACTTCAG CGGCGTGTTC CGCACCATCG CCAACGACGT GCAGCAAGGT ACCGGGCTCG GCAAGTACGC CACCGGCGCG CTCGGCGCCA AGCGCGTCGC GATCATCGAC GACCGCACCG CCTACGGCCA GGGCCTGGCC GACGAGACCG CGAAGGCGGT GAAGGAGACG GGCGGCGAGG TCGTCGCGCG CGAATTCACC ACCGACAAGG CCACCGACTT CAACGCCATC CTCACCAAGA TCCGCGCCAC CAACCCCGAG GTCGTCTTCT ACGGCGGCAT GGACGCCCAG GCCGGCCCGA TGGTGCGGCA GATGAAGCAG CTCGGCATCA CCGCCAAGTT CCTCACCGGC GACGGCGGCT GCAGCCCCGA GATGATCAAG CTCGCCGGCG ACGGCATGTC GTCCAGCGCC TACTGCTCGA TGCCGGGCCT GCCGCTGGAG AAGATGCCGG GCGGCGCCGA CTTCCGCGAG CGTTACAAGA AGCGCTTCAA CGCCGACGTG CAGGTGTATG CCCCCTACGC CTACGACGCC GCGATGGCCA TCATCACCGC GATGCAGAAG GCCGACTCGG TCGAGCCGTC CAAGTACCTG CCCGAGCTCA AGAAGAGCAA CTTCCCGGGC GTGATCGGCA ACGTCTCCTT CGACGCCAAC GGCGACATCA AGGAAGGCGC GGTGACGATC TACAACTTCA AGGACGGCGC CTGGGTGCCG CTGTAA
|
Protein sequence | MKRTVVAVAL MSIGLSAAQA QEVVVKLGGA APLTGNQSHL GKDLENGTRL AIEEANAKGV TIGGKKVKFE LVGEDDQADP RTGTTVAQRL VDAGVKGVIG HLNSGTSIPA SRIYDQAGIP QVSPASTNPK LTLQNFSGVF RTIANDVQQG TGLGKYATGA LGAKRVAIID DRTAYGQGLA DETAKAVKET GGEVVAREFT TDKATDFNAI LTKIRATNPE VVFYGGMDAQ AGPMVRQMKQ LGITAKFLTG DGGCSPEMIK LAGDGMSSSA YCSMPGLPLE KMPGGADFRE RYKKRFNADV QVYAPYAYDA AMAIITAMQK ADSVEPSKYL PELKKSNFPG VIGNVSFDAN GDIKEGAVTI YNFKDGAWVP L
|
| |