Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3551 |
Symbol | |
ID | 7873057 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3891671 |
End bp | 3893266 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643700492 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002890522 |
Protein GI | 237654208 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.186732 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAGA CCCGACTTGC CCTCGCCCTG CCTTTCGCCC TCGCGACGCT CGCCAGCGTG GCGGCCGCCC CCGCGGCTGC ACAGACCGTG CGCTGGGCGG CGGCCGGGGA CGCGCTGACC ATGGACCCGC ATTCTCAGAA CGAGGGCCCG ACCCACGTCA TGAACCACCA GGTCTACGAC TCGCTGGTGT TCCGCGACCA GGCCATGAAG CTGGCGCCGC GGCTGGCGAC CGCGTGGAAG ATCACCGAGG ACCCCAACGT GTGGGAGTTC AAGCTGCGCG AGGGCGTGAA GTACCACAAC GGCAACCCCT TCACGGCCGA CGACGTGGTG TTCTCCATCC AGCGTGCCAA GCACGAGAAC TCGGACATGA AGGGCCTGCT CACCAGCGTG GTCGAGGTGG TGAAGGTCGA CGAGCACACC GTGCGCATGC GCACCGACGG CCCCAACCCG CTGCTGCCCA ACAACCTGAC CAACCTCTTC ATCATGGACC GCGAGTGGTC CGAGGCGAAC AAGGTCATGC TGCCGCAGAA CTACAAGGCC GGCGACGAGA CCTTCGCGGT GCGCAATGCC AACGGCACCG GTCCCTTCCG GCTCGTGAAG CGCGAGCCCG ACGTGCGCAC CGAACTCGAA CGCAACGAGG ACTACTGGGG CAAGGGGACC TATCCCATGG AGGTGGCCAA GGTGGTGTTC ACCCCGGTGC GCTCGGCCGC CACCCGGGTC GCGGCGCTGC TCTCGGGCGA GGTCGATTTC CTCCTCGACC CGCCGGTGCA GGACCTCGAG CGCCTGTCCG CGGCCAAGGG TATCGTGGTG CGCTCCGGGC CGGAGAACCG CACGATCTTC CTCGGCATGA ACCAGGGCGC GGCGGAGCTG CGCAGCGCGG ACGTGAAAGG CAGGAACCCC TTCGCGGACA AGCGCGTGCG CGCGGCGATG AACATCGCGA TCAACCGCGA CGCGGTGAAG CGCGTGGTGA TGCGCGGCCA GTCGGTGCCC GCCGGCATCG TCGCGCCGCC CTTCATCGAC GGCTACGACA AGGCGATGGA CGTCGTGCCG GCGCCCGACG TCGCGCGCGC CAAGGCGCTG CTCGCCGAGG CCGGCTACCC GAACGGCTTC GCGGTGACGC TGTCCTGCCC CAACGACCGC TATGTGAACG ACGAGGCCAT CTGCCAGGCG GTGACCGGCA TGTTCGGCCA GATCGGCGTC AAGGCGCGGC TGGACGCACG GCCCAAGAGC ATCCACTTCG CCGAGCTGCC CAAGGGCGAG CTCGACCTCT ACATGCTCGG CTGGGGTGTG CCGACCATGG ACTCGCACTA CGTCTTCCAT TACCTCTACG AGACCAGGAC CGACAAGGGC GGCTCGTGGA ACGTGACCGG CTATTCCAGC GCGAAGGTGG ACGAGCTGAC CAAGGCGATG AACCGCGAGA TCGACCTCGG CAAGCGCGCC GGCATGGTCG CCGAAGTGTG GAAGACGGTG CAGGACGACG TCGTCTACCT GCCGATCCAC CATCAGATGC TGAACTGGGC GATGAAGGAC GACATCGACT TCCCGGTGCA GTCGGAGAAC TATCCCTACT TCAAGCTGTT GAAGTACAGG AAGTGA
|
Protein sequence | MNKTRLALAL PFALATLASV AAAPAAAQTV RWAAAGDALT MDPHSQNEGP THVMNHQVYD SLVFRDQAMK LAPRLATAWK ITEDPNVWEF KLREGVKYHN GNPFTADDVV FSIQRAKHEN SDMKGLLTSV VEVVKVDEHT VRMRTDGPNP LLPNNLTNLF IMDREWSEAN KVMLPQNYKA GDETFAVRNA NGTGPFRLVK REPDVRTELE RNEDYWGKGT YPMEVAKVVF TPVRSAATRV AALLSGEVDF LLDPPVQDLE RLSAAKGIVV RSGPENRTIF LGMNQGAAEL RSADVKGRNP FADKRVRAAM NIAINRDAVK RVVMRGQSVP AGIVAPPFID GYDKAMDVVP APDVARAKAL LAEAGYPNGF AVTLSCPNDR YVNDEAICQA VTGMFGQIGV KARLDARPKS IHFAELPKGE LDLYMLGWGV PTMDSHYVFH YLYETRTDKG GSWNVTGYSS AKVDELTKAM NREIDLGKRA GMVAEVWKTV QDDVVYLPIH HQMLNWAMKD DIDFPVQSEN YPYFKLLKYR K
|
| |