Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2561 |
Symbol | |
ID | 7874000 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2761899 |
End bp | 2762690 |
Gene Length | 792 bp |
Protein Length | 263 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643699483 |
Product | extracellular solute-binding protein family 3 |
Protein accession | YP_002889540 |
Protein GI | 237653226 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | [TIGR01096] lysine-arginine-ornithine-binding periplasmic protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.52541 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAT ACGCCCTTCT GTGCGCGGTG CTCGCCCTTC ACGCCGGCGC CGGTTTTGCC AAGGACTGGA ACGAGATCCG CCTGGCGTCC GAGGGCGCCT ACCCTCCCTT CAACCTCATC GCCGCCGACG GAAGCCTGCA AGGTTTCGAT ATCGACATCG GCAATGCCTT GTGCGAGGAG ATGAAGGCCA AGTGCACCTG GGTGAAGCAG GAGTGGGACG GCATGATCCC GGCGCTGGTC TCGCGCAAGT TCGATGCGAT CATTGCCTCG ATGTCGATCA CCGACGAGCG CAAGGCCAAG GTGGATTTCA CCGAGAAGTA CTATGCGTCG CCGCTGGCCC TGATCGCGAA GAAGGGCTCG ACCCTGCGGC CCGATCTCCC CTCCCTGGCC GGGAAGAAGG TCGGCGTGCA GCGCGGCACG GTATCGGACA ACTTCGCCAC CAAGTATTGG GACGGCAAAG GCATGCAGAT CATCCGTTAC GCCAAGCAGG ACGAGGCCTA CCTCGACCTG CGCGCCGGAC GTCTCGATGC GGCCTTCTCC GATTACCTCG AGGCCTACGG CGGTTTCCTG ACCAAACCCG AGGGCGCGGG CTACGACGTC GCCGGCGAAC GCCTCTTCGG CAAGGACGCC GACGAGAAGG CGGTGATCGG CGAGGGCATC GGCATCGCGG TGCGCAAGCG CGACAAGGAG CTCACCGAGA AGCTGAACCA GGCACTCGGC GCGATCCGCG CCAACGGCAA GTACGATGCG ATCCAGAAGA AGTACTTCCC GATGGACATC TACGGCAACT GA
|
Protein sequence | MKKYALLCAV LALHAGAGFA KDWNEIRLAS EGAYPPFNLI AADGSLQGFD IDIGNALCEE MKAKCTWVKQ EWDGMIPALV SRKFDAIIAS MSITDERKAK VDFTEKYYAS PLALIAKKGS TLRPDLPSLA GKKVGVQRGT VSDNFATKYW DGKGMQIIRY AKQDEAYLDL RAGRLDAAFS DYLEAYGGFL TKPEGAGYDV AGERLFGKDA DEKAVIGEGI GIAVRKRDKE LTEKLNQALG AIRANGKYDA IQKKYFPMDI YGN
|
| |