Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3565 |
Symbol | |
ID | 7873071 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3910221 |
End bp | 3911666 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643700506 |
Product | amino acid carrier protein |
Protein accession | YP_002890536 |
Protein GI | 237654222 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1115] Na+/alanine symporter |
TIGRFAM ID | [TIGR00835] amino acid carrier protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0685949 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGCAT TCGCCAACCA GGTCGTCGAC CTGCTCAACG GCCTGCTCTG GGGCAAGATC CTGATCTGGC TGCTGGTCGG CACCGGCATC TATTTCACCC TGCGCCTGGG CTTCATCCAG TTGCGCCACT TCGGCCACAC CTTCACGGTG CTGCGCGGCA GCCGCCGGTC CGACGCCTCC GGCATCTCGT CCTTCCAGGC CCTGTGCACC TCGCTCGCTG CGCGCGTGGG CACCGGCAAC ATCGCCGGCG TGGCGGTGGC GATCACGCTC GGCGGCCCGG GCGCGATCTT CTGGATGTGG GTGATCGCGC TCGTGGGCAT GGCCACCGGC TTCGCCGAGG CCACGCTCGC GCAGCTCTTC AAGCAGCGCG ACGACAAGGG GCAGTTCCGC GGCGGCCCCG CCTACTACAT GGAGAAAGGT CTCGGCCAGC GCTGGGCGGG CACGATGTTC TCGATCTTCC TGATCATCGC CTTCGGCTTC GCCTTCAACT CGGTGCAGTC GAACACGATC GCCGGCGCGA TGGTCGGCGC CTTCGAGCTC GACATGGGCA GCTTCACGCT GGGCGGCAAC GAGGTCGCCG TCGCGGCGCT GGTGGTCGGC GTGGCGGTCA CCGTGCTCAC CGCGGTGATC ATCTTCGGCG GCCTGCGCTC GATCGCGCGC TTCTCCGAGC TCGCCGTGCC CTTCATGGCG GTCGGTTACC TGGTGGTGGC GGTCGGCATT CTCCTCGTCA ACCTCGGCGA GGTGCCGGGC GTGCTGGCGA TGATCGTCAA GAGCGCCTTC GGCTTCCACG AAGCCGCCGC CGGCGGCATC GGCGCGGCCA TCCTCAACGG CTTCAAGCGC GGCCTGTTCT CCAACGAGGC CGGCATGGGC TCGGCGCCCA ACGCCGCCGC CGCGGCCACG CCCTACCCGC CGCACCCGGC CTCGCAGGGC TACGTGCAGA TGGCCGGCGT GTTCATCGAC ACGATCCTGA TCTGCACCGC GTCCGCCGCC ATCATCCTGC TCGCCGGTCC GGTCGAAGGC ACCGGCGTCG GCCTGGTGCA GAACGCCCTG ACCAGCGAAG TGGGCGGCTG GGGCAAGTAC TTCCTCGCCG TCGTGGTGCT GTTCTTCGCC TTCACCTCGA TCGTCGCCAA CTACTTCTAC GCCGAGAACT GCCTGGTGTT CATCGAGCAC AACCACCCCG CGGGCCTGCT GATCTTCCGC CTGATCGTGC TGGCCATGGT GATGTTCGGC GCGATCGGCT CGCTGCCCTT CGTGTGGAAC TTCGCCGACG TGGCGATGGG CCTGATGGCG ATCACCAACC TCGTCGCCAT CCTGTTGCTC TCGGGCCTGG TGGTGAAGCT GGCGAAGGAC TACAACGCCC AGCGCGCCGC CGGCAAGCTG CCCACCTTCG ACGCCCGCGC GTACCCCGAG ATCCGCAGCA AGCTCACGCC GGGCATCTGG GACTGA
|
Protein sequence | MEAFANQVVD LLNGLLWGKI LIWLLVGTGI YFTLRLGFIQ LRHFGHTFTV LRGSRRSDAS GISSFQALCT SLAARVGTGN IAGVAVAITL GGPGAIFWMW VIALVGMATG FAEATLAQLF KQRDDKGQFR GGPAYYMEKG LGQRWAGTMF SIFLIIAFGF AFNSVQSNTI AGAMVGAFEL DMGSFTLGGN EVAVAALVVG VAVTVLTAVI IFGGLRSIAR FSELAVPFMA VGYLVVAVGI LLVNLGEVPG VLAMIVKSAF GFHEAAAGGI GAAILNGFKR GLFSNEAGMG SAPNAAAAAT PYPPHPASQG YVQMAGVFID TILICTASAA IILLAGPVEG TGVGLVQNAL TSEVGGWGKY FLAVVVLFFA FTSIVANYFY AENCLVFIEH NHPAGLLIFR LIVLAMVMFG AIGSLPFVWN FADVAMGLMA ITNLVAILLL SGLVVKLAKD YNAQRAAGKL PTFDARAYPE IRSKLTPGIW D
|
| |