Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0161 |
Symbol | |
ID | 7085258 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 186140 |
End bp | 187246 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643697203 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002353852 |
Protein GI | 217968618 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1840] ABC-type Fe3+ transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGATCA GGGGTGGGAG GGCCGCGCTG AAGACGGTGC TGGCGGTCGT GGCGGCGGTG GCGATCGCCC CGGCGCGGGC CGAGCCGATC GCCGATCGCT TCGATCAGGC ACGCCTGGAG GGCGTGGTGG TCGTCTATGC GGCGACCGAC CTCGCGGTGG TCAAGCCGGT CATCGACGAC TTCGAGGCCC TCCATCCCGG CGTTCGGGTG CAGTACCACG ACATGCACTC GGCCGAACTC CATGCGCGCG TGGTCGACGA GGCCCGGCGC GGGCTGGCCG GTGCCGACGT GGTGTGGAGC TCGGCGATGG ACCTGCAGGT GAAGCTGGTC AACGACGGCC ACGCCCAGCC GCACCGCTCC GCCGAGACCG CGGCGCTGCC GCGCTGGGCG GTGTGGAAGG ACGAGGCCTT CGGCACCACC TACGAGCCGG CGGTGATCGT CTACAACAAG CATCTGCTCG GCACGACCGA GGTGCCCGAC AGCCATGCGG AGCTGATCCG CCTGCTCGAT CGCGACCCGG CGCCGTTGCG CGGGCGCATC GCCACCTACG ACCCCGAGCG CTCCGGCCTC GGCCTGCTGC TGCACACGCA GGACGCGCAG GCCAACCCGA TCGTGTTCTG GCAGCTCGCG CGCGGCATGG GCCGGCAGGG CCTGGAGCAG CACGCGGCGA GCAGCGAGAT GCTCGACCGC GTCGCCGCGG GCAAGCTGGT GCTCGCCTAC AACGTGCTGG GCTCGTACGC GCACCGGCGG GCGCGCAGCG ATCCGGCGCT CGGGGTGGCG CTGCCGCGGG ACTACACGCT GGTGCTGAGC CGGGTCGCCT TCATCGTGCG TGGCGCGCGT CATCCGGCGG CGGCGCGCCT GTGGCTCGAT CATCTGCTGT CGACCCGAGG CCAGGCCCTG CTCGCCGCCA ACCTCGGCCT GCTGCCGGTG CGCACCGACG CCGGCACCGC GGGCGCGGAC AGCGCTGCCG CGCTGCTCCA CCACAACCTG CAGCATGCCT TCCGCCCGAT CCGCATCGGC TCCGGGCTGC TCGCCTACCA GGACCAGGCC AAGAAGCAGG CCTTCCTGCG CCAGTGGGAT GCGGCGACGC GGCCGGCTTC CGAGTGA
|
Protein sequence | MRIRGGRAAL KTVLAVVAAV AIAPARAEPI ADRFDQARLE GVVVVYAATD LAVVKPVIDD FEALHPGVRV QYHDMHSAEL HARVVDEARR GLAGADVVWS SAMDLQVKLV NDGHAQPHRS AETAALPRWA VWKDEAFGTT YEPAVIVYNK HLLGTTEVPD SHAELIRLLD RDPAPLRGRI ATYDPERSGL GLLLHTQDAQ ANPIVFWQLA RGMGRQGLEQ HAASSEMLDR VAAGKLVLAY NVLGSYAHRR ARSDPALGVA LPRDYTLVLS RVAFIVRGAR HPAAARLWLD HLLSTRGQAL LAANLGLLPV RTDAGTAGAD SAAALLHHNL QHAFRPIRIG SGLLAYQDQA KKQAFLRQWD AATRPASE
|
| |