Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1650 |
Symbol | |
ID | 7084069 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1850733 |
End bp | 1851995 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643698670 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002355301 |
Protein GI | 217970067 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0169608 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAGA TGCGTTTCAA GGCCTTGGCC GCAGCGGTCG CTGCCGGCGG TCTCTTCTTC GCGCCCGGCT CGGTGGTCCA GGCCAGCGCC CAGCAGCAGG TCCAGCTCTT CCATCGCCTG CCCGATGCCA AGGCGGGGGC GCTGAAGGAC TTGGTCGAGC GCTTCAACGC GCAGTCCAAG GACGTGCAGG TCGTGATGTC CGCGGCGGAC TGGCGCTCGG GCGCGCCCCA CCTGATGATC CTCGAGGGCG ACGACGAGGA GGAGTTCGTC GCCGGCAAGC CGCGCTTCAA GCCGCTCTTC CAGCTCATGA AGGAAAGTGG CGTGCCGCTG CAGACCCTGC GTCCGCCCGC GATGATGACG CGCACCCCGG TCGATGCCAA GGGACAGCTG CTGGCGCTGC CGGTCGGCCT GTCGACCCCG GTGCTGTTCC TCAACCGCGA CGCGCTGCGC CAGGCCGGCC TCAACCCGGA GACCACCCGG ATCAACACCT GGTTCGATCT GCAGGAAACC CTCGGGCGCC TCGCCGATAC CGGTCACACC TGCCCCTATA CCGTGGCCGA GCCCGGCCGC GTGATGGTGG AGAACCTCTC GGCCTGGCAT AACGAGCCGG TGGCCGCGCA GAGCGGCAAG ACCACCGTAC CGAGCTTCAA CGGCATGTTT CAGGTCAAGC ATGTGGCGAT GATGGCGAGC TGGACGCGTG CGCGCTACCT GCACGTGTTC GACCAGCAGG CCGAGGCCGA GCAGCGCTTC GCGCGCGGCG AGTGCGCGGT GATCGCCGCG CCCTCGGCGA GCTGGACCGA CTTCCGCCGT GCCGGCAAGG TCGATGTGGC GGTATCCAAG CTGCCCTACT ACGACGACTT CCCCGGTGCG CCGCAGAACA CCATCGCCGA CGGTCCCGCA CTGTGGGCCT CGGCAGGCAA GAAACCGGCC GAATACAAGG CCGTGGCGCG CTTCGTGAGT TTCTGGCTGC AGCCCGACAA CCAGGTCGCG TGGCAGCGCG AGACCGGCTA CCTGCCGCTC AACCGCGCCG GCCTGCTGGC CTCGCGCAGC GAGCTGCTCG GCAACGACCT CGAGAACATC CAGGTCGCGG TCGACCAGCT CGGCGGCAAG CCCGCCACGC CGCAGTCGTC GGCGCAGCCC GTGGTCGAGC GCCAGAAGGT GCGCCGCATC CTCGATGAGG AACTCGCCGG CGTATGGGCC GACGAGAAGG CCGCGAAGGA AGCGCTCGAC AACGCGGTGA CGCGCGCCCG CAACGGCAAC TGA
|
Protein sequence | MNKMRFKALA AAVAAGGLFF APGSVVQASA QQQVQLFHRL PDAKAGALKD LVERFNAQSK DVQVVMSAAD WRSGAPHLMI LEGDDEEEFV AGKPRFKPLF QLMKESGVPL QTLRPPAMMT RTPVDAKGQL LALPVGLSTP VLFLNRDALR QAGLNPETTR INTWFDLQET LGRLADTGHT CPYTVAEPGR VMVENLSAWH NEPVAAQSGK TTVPSFNGMF QVKHVAMMAS WTRARYLHVF DQQAEAEQRF ARGECAVIAA PSASWTDFRR AGKVDVAVSK LPYYDDFPGA PQNTIADGPA LWASAGKKPA EYKAVARFVS FWLQPDNQVA WQRETGYLPL NRAGLLASRS ELLGNDLENI QVAVDQLGGK PATPQSSAQP VVERQKVRRI LDEELAGVWA DEKAAKEALD NAVTRARNGN
|
| |