Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0425 |
Symbol | |
ID | 7084936 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 485591 |
End bp | 486631 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643697457 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002354100 |
Protein GI | 217968866 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0687] Spermidine/putrescine-binding periplasmic protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCACG CCGCATCCTT CCTTCGCGCC GCCGTCCTCG TCGCCCTCGG GGGCGCCGTC GGCACCGCAT CGGCCGACAT CACCGTCATC TCCTTCGGCG GCGCCTCGCA GAAGGCGCAG GACAAGGCCT ACTACGCGCC CTTCACCAAG GCCACCGGGG TCAAGGTCGT CGCCGGCGAG TACAACGGCG AGCAGGCCAA GGTGAAGGCC ATGGTCGAGG CCGGCAACGT GACCTGGGAC GTGCTCGAGG TCGAGTCGCC CGAGCTGGTG CGCGGCTGCG AGGAAGGCCT GTTCGAGAAG ATCGACTTCG CGCAGGTGGG CGACAAGGCC GATTTCGTCC CTGCCGCCGT GAGCGAATGC GGCGTCGGCA TCTTCGTGTG GTCGACCGCG CTCGCCTACA ACGCCGACCG CCTCAAGGAG GCGCCGACCT CCTGGGCCGA CTTCTGGAAC GTGGACAAAT TCCCCGGCAA GCGCGGGCTG CGCAAGGGCG CGAAGTACAC CCTCGAGTTC GCCCTGCTCG CCGATGGCGT CCCGACCAGC GATGTATACA AGGTGCTCGC CACCCCGGCC GGCGTCGACC GCGCCTTCGC CAAGCTCGAC AAGCTCAAGG CGAACATCCA GTGGTGGGAA GCCGGCGCGC AGCCGCCGCA GCTGCTTGCC TCGGGCGACC TGGTCATGAG CGCCGCCTAC AACGGTCGCA TCTCCGCAGC GCAGGTCGAA GGCAAGAACC TGAAGGTGGT GTGGAACGGC AGCATCTACG ACGTCGATTC GTGGACCATC CCCAAGGGCT CGCCGAACAA GGCGGAGGCG CTCAAGTTCA TCCGCTTCGC CAGCCAGCCG GAGAACCAGG CGGTGTTCTC GGGCGAGATC GCCTACGGCC CGGTCAACCT GAAGGCGGTG CCGGCGATCG ACGCCAAGGT CGCCGCGGGC CTGCCGACTG CGCCCGCGAA CATGAAGGGC GCGATGGCGA CCGATACCGA GTTCTGGGTC GAGCACGGCG AGAACCTCGA GCAGCGCTTC AACGCCTGGG CCGCGCGCTG A
|
Protein sequence | MKHAASFLRA AVLVALGGAV GTASADITVI SFGGASQKAQ DKAYYAPFTK ATGVKVVAGE YNGEQAKVKA MVEAGNVTWD VLEVESPELV RGCEEGLFEK IDFAQVGDKA DFVPAAVSEC GVGIFVWSTA LAYNADRLKE APTSWADFWN VDKFPGKRGL RKGAKYTLEF ALLADGVPTS DVYKVLATPA GVDRAFAKLD KLKANIQWWE AGAQPPQLLA SGDLVMSAAY NGRISAAQVE GKNLKVVWNG SIYDVDSWTI PKGSPNKAEA LKFIRFASQP ENQAVFSGEI AYGPVNLKAV PAIDAKVAAG LPTAPANMKG AMATDTEFWV EHGENLEQRF NAWAAR
|
| |