Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3135 |
Symbol | |
ID | 7874277 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3391230 |
End bp | 3392417 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643700063 |
Product | Extracellular ligand-binding receptor |
Protein accession | YP_002890109 |
Protein GI | 237653795 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCGGAA AGGACAGGAA GCTGCGCCTC GCAGTGAGCG CGCTCGGGCT CGGCCTGGGC ATGTTGCTGG CGCAGGGCGC GGCGGAGGCG GCGGACAAGG TCAAGGTGGG GCTGATGCTG CCCTACACGG GCACCTACGC CTCGCTCGGC AACGCGATCA CCAACGGCTT CAAGCAGTAC GTGGCGGAGC AGGGCGGCAA GCTCGGCGGT CGCGAGGTCG AGTATTTCGT CGTCGATGAC GAGTCCGATC CGGCCAAGGC GACCGAGAAT GCCAACAAGC TGGTCAAACG CGACAACGTC GACGTGCTGG TCGGCACCGT GCATTCGGGC GTGGCGCTGG CGATGGCCAA GGTCGCGCGT GACTCCAAGA CCCTGATGAT CATCCCGAAC GCGGGCGCGG ACGAGCTTAC CGGTCCGCTG TGCGCGCCGA ATGTGTTCCG CACCTCGTTC TCGGCCTGGC AGCCGGCGAA TGCGATGGGC AAGGTGGTCG CCGAACGCGG CCACAAGAAC GTGGTCACCC TGGCCTGGAA GTACTCCTTC GGCGAACAGT CGGTCGCCGG CTTCAAGGAG TCCTTCGAGC AGGCCGGCGG CAAGGTGGTC AAGGAGCTCT ACCTGCCCTT CCCGAACGTG GAGTTCCAGC CCTTCCTGAC CGAGATCGCC AACCTCAGGC CGGATGCGGT GTTCGTATTC TTCTCCGGCG CCGGGGCGGC GAAGTTCGTC AAGGACTACG AGGCGGCAGG CCTGAAGGCG GGCCTCCCGC TCTACGCCCC GGGCTTCCTC ACCGACGGCA CGCTGGAGGC CATGGGCGGC GCCGGCGAGG GCGTGCTGAC CACGCTGCAC TACGCCGACG GCCTCGACAA CGCCAGGGAG AAGTCCTTCC GCAGCGGCTA TGTCGCGGCC TACAAGGCGC AACCCGACGT CTTCGCGGTG CAGGGCTACG ACAGTGCGCA GCTGCTCGCC GCGGGGCTCT CGGGCGCGCC GGCCGGAGCC TTCGACAAGG AGGCGGTCAT GAAGGCGATG AGCGCCGCCA CCATCGACAG TCCGCGCGGT AGCTTCACCC TGTCCAAGGC CAACAACCCG GTGCAGGACA TCTACCTGCG CAAGGTCGAG GGCGGGCAGA ACAAGGTGAT CGGCGTCGCC GCACCCAAGC TCGCCGACCC GGCGCGCGGC TGCAAGCTCA TGAACTGA
|
Protein sequence | MVGKDRKLRL AVSALGLGLG MLLAQGAAEA ADKVKVGLML PYTGTYASLG NAITNGFKQY VAEQGGKLGG REVEYFVVDD ESDPAKATEN ANKLVKRDNV DVLVGTVHSG VALAMAKVAR DSKTLMIIPN AGADELTGPL CAPNVFRTSF SAWQPANAMG KVVAERGHKN VVTLAWKYSF GEQSVAGFKE SFEQAGGKVV KELYLPFPNV EFQPFLTEIA NLRPDAVFVF FSGAGAAKFV KDYEAAGLKA GLPLYAPGFL TDGTLEAMGG AGEGVLTTLH YADGLDNARE KSFRSGYVAA YKAQPDVFAV QGYDSAQLLA AGLSGAPAGA FDKEAVMKAM SAATIDSPRG SFTLSKANNP VQDIYLRKVE GGQNKVIGVA APKLADPARG CKLMN
|
| |