Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0446 |
Symbol | |
ID | 7084956 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 507855 |
End bp | 509021 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643697478 |
Product | Extracellular ligand-binding receptor |
Protein accession | YP_002354121 |
Protein GI | 217968887 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCCCT TGCGTCGGCT TGCGCTCGCG CTCGCCTCCT CGCTGTCCCT CGCCTGCGCC CTGCCCGCCA CGGCGGCGGG GGTGAGCGCC GATGAAATCG TCGTCGGCAC CGTGTCCGAC CTCTCCGGCC CGATCGCCAT GCTCGGCGTG CCCGTGCGCG ACGGCATGCT GATGCGCTTC GACGAGGCCA ACGCCGGCGG TGGCGTGCAT GGGCGCAAGA TCCGCCTTGC GGTCGAGGAC GCCGGCTACG ACCCCAAGCG CGCGGTGCTG GCCGCGCGCA AGCTGGTGCA GCACGATCAG GCCTTCGCCT TCATCGCCAA CATGGGCACG CCGGTGGTCA TGGCGAGCAT GCCGATCATC GTCGATGCCG GCCGCCTGCA CTTGTTCCCC TTCTCGCCCC ACCGTGCGAC CTACGAGCCG CTGCACCCGC TCAAGTTCCA GAACTTCGCG CCCTATCAGG ACTACATGGA AGCCGCCACC CGCCACATGG TGCGCGAGCG CGGCTACCAG CGCAGCTGCC TGCTCTACCA GGACGACGAC TACGGCCTGG AAGTCATGAA GGGCGTCGAA AAGGCGCTGG CCGGGCTCGG CACCGAGCTC GTCGAGCGCA CCAGCTACAA GCGCGGCGCC ACCGATTTCT CCAGCCAGAT CGCGCGCCTG CGCGCCGCGC GCTGCGACCT CGTGGTGCTC GCCACCGTGG TGCGCGAGAC CGTCGCCGCC ATGTCCGAAG CGCGCAAGAT CGGCTGGGAC GTCGACATGC TCGTCACCGC CTCCGGCTAT TCGGCGCAGA CCCACGAGCT CGGCGGCGCC GCGGTCGAGG GCCTGTACGG CGTCTCGGTG CTGCCGCACC CCTACGCCGA GGGCGCGAAC AGCCAGCTCG CCGCCTGGAT CGAGCGCTAT CGCGCGCGCT TCAACACCGA ACCCAACGTG TGGAGCGTGA TGGGCTACAC CCTCGCCGAC CTGTTCGTGC GCACCGCCGA GGCCACCGGG CGCGAGCTCA CCCCCGAGCG CTTCGCGCAC ACGCTCGAAG GCATGGCGTT CACCCGCGAC TACTTCGGCA GTCCCGCCTA CCGCTTCAGC GCCGACGACC ACCTCGGCAA CCGCAAGGGC CGCCTCGCCC AGATCCGCAA CGGACGCTGG GAGCTGATCA CCGACTACCT GGAGTGA
|
Protein sequence | MSPLRRLALA LASSLSLACA LPATAAGVSA DEIVVGTVSD LSGPIAMLGV PVRDGMLMRF DEANAGGGVH GRKIRLAVED AGYDPKRAVL AARKLVQHDQ AFAFIANMGT PVVMASMPII VDAGRLHLFP FSPHRATYEP LHPLKFQNFA PYQDYMEAAT RHMVRERGYQ RSCLLYQDDD YGLEVMKGVE KALAGLGTEL VERTSYKRGA TDFSSQIARL RAARCDLVVL ATVVRETVAA MSEARKIGWD VDMLVTASGY SAQTHELGGA AVEGLYGVSV LPHPYAEGAN SQLAAWIERY RARFNTEPNV WSVMGYTLAD LFVRTAEATG RELTPERFAH TLEGMAFTRD YFGSPAYRFS ADDHLGNRKG RLAQIRNGRW ELITDYLE
|
| |