Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0420 |
Symbol | |
ID | 7084931 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 479769 |
End bp | 480992 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643697452 |
Product | L-carnitine dehydratase/bile acid-inducible protein F |
Protein accession | YP_002354095 |
Protein GI | 217968861 |
COG category | [C] Energy production and conversion |
COG ID | [COG1804] Predicted acyl-CoA transferases/carnitine dehydratase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGGCG CCCTGTCGCA CGTCCGCGTC CTCGACCTCT CGCGCATCCT GGCCGGCCCC TGGGCCGGGC AGATGCTGGC CGACCTCGGC GCGGACGTCA TCAAGGTCGA GCGCCCCGGC GCCGGCGACG ACACCCGCGG CTGGGGGCCG CCCTGGCTCA AGGACGAGCA GGGCGCCGAC ACCGACGTGG CGGCGTACTA CCTGTGCGCC AACCGCAACA AGCGCTCGAT CACCATCGAC ATCACCCAGG CCGAAGGCCA GGCGCTGGTG CGCCGGCTCG CCGCCGAGTC CGACGTGGTG CTGGAGAACT TCAAGGTCGG CGGCCTGGCG CAGTATGGCC TCGACTACGA CAGCCTGAAG GCGGTCAATC CGCGCCTGGT GTATTGCTCG ATCACCGGCT TCGGCCAGGA CGGACCGTAC GCGCCGCGCG CCGGCTACGA CTTCCTCATC CAGGGCCTGG GCGGGCTGAT GAGCATCACC GGCCGGCCCG ACGGCGAGGA GGGCGGCGGT CCGATGAAGG TGGGCGTGGC ACTCACCGAC ATCCTAACCG GGCTGTACGC TACCAACGCG GTACTCGCTG CGCTGGCCTG GCGCGAGCGC AGCGGCGAGG GCCAGTACAT CGATATGGCC CTGCTCGACG TGCAGGTGGC CTGCCTCGCC AACCAGGCGG GCAACTATCT CGCCACCGGG CAGAGCCCGC AGCGGCTGGG CAATGCGCAC CCCAACATCG TGCCCTACCA GGACTTCCCC ACCGCCGACG GCTACATGAT CCTCGCCATC GGCAACGACG GGCAGTTCGC GCGCTTCTGC GCGGCCGCCG GTGCGCCGCA GCTCGCGACC GACGAACGCT TCGCCACCAA CCGCGCGCGC GTGGTCAATC GCGCCACGCT GATCCCGCTG CTGAAGAAGC TCACCGTCGA GCGCGGCACC GCGGAGTGGA TCGCGAAGCT CGAGGCGCTC GCCGTGCCCT GCGGTCCGAT CAACACCCTG GCCGACGTCT TCGCCGACCC GCAGGTGCAG GCGCGCGAGA TGAAGGTGAC GATGCCGCAT CCGGTCGCCG GCCAGGTACC GCTCGTCGCC AGCCCGATGA AGCTCTCGGC CACGCCGGTG GACTACCGCC TGCCGCCGCC GATGCTCGGC GAGCACACCG ACGAGATCCT CGCCGCAACG CTCGGTCTCG ATGCCGGGGC GATCGCCCGC CTGCGTGCGG ACGGCGTGGT CTGA
|
Protein sequence | MPGALSHVRV LDLSRILAGP WAGQMLADLG ADVIKVERPG AGDDTRGWGP PWLKDEQGAD TDVAAYYLCA NRNKRSITID ITQAEGQALV RRLAAESDVV LENFKVGGLA QYGLDYDSLK AVNPRLVYCS ITGFGQDGPY APRAGYDFLI QGLGGLMSIT GRPDGEEGGG PMKVGVALTD ILTGLYATNA VLAALAWRER SGEGQYIDMA LLDVQVACLA NQAGNYLATG QSPQRLGNAH PNIVPYQDFP TADGYMILAI GNDGQFARFC AAAGAPQLAT DERFATNRAR VVNRATLIPL LKKLTVERGT AEWIAKLEAL AVPCGPINTL ADVFADPQVQ AREMKVTMPH PVAGQVPLVA SPMKLSATPV DYRLPPPMLG EHTDEILAAT LGLDAGAIAR LRADGVV
|
| |