Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3358 |
Symbol | |
ID | 7873850 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3667486 |
End bp | 3668676 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643700296 |
Product | L-carnitine dehydratase/bile acid-inducible protein F |
Protein accession | YP_002890330 |
Protein GI | 237654016 |
COG category | [C] Energy production and conversion |
COG ID | [COG1804] Predicted acyl-CoA transferases/carnitine dehydratase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCCCG CATCGAACGC TCCCGCACGC GCGCCGCTCG CCGGCGTGCG CATTCTCGAC CTCACCCGGC TGCTGCCCGG TCCGCTCGCC ACCCAGCACC TGGCCGACTA CGGCGCCGAG GTGATCAAGG TGGAAGACAC CGGTGCCGGC GACTACGCGC GCACGATGGG CGCGATGGGG GGCGACACCA GCTGGTTCTA CCAACTGGTG AACCGCAACA AGCGCAGCCT GCGGCTCGAC CTCAAGCAGG AGGCGGGGCG TGCGCTGTTC CTGCGCCTGG TGGAAAGCGC CGATGTGGTG GTCGAGGGCT TCCGCCCGGG GGTGATGGAG CGCCTCGGCC TCGGCTACGC CGTGCTCGCC GAGCGCAATC CGCGCGTAGT GCTTTGCAGC ATCAGCGGCT ACGGACAGAC CGGGCCGTAC GCACTGCGCG CCGGCCACGA CATCAATTAT CTCGGCTACG CCGGCGTGCT CGACCAGATC GGCTGCGCCG GCGGCGCGCC GGCGCTGTCG AACCTGCAGA TCGGCGACCT GCTCGGCGGC ACCATGGGCG CGGTGATGGG CATCTTGGTG GCGCTGTTCG ACGCGCGCCG CAGCGGCCGC GGCCGCGAGG TCGACGTGTC GATGACCGAC GCCGCGTTCG CGCACATGAT CTTCCCGCTC GCCGAGGTGC TTGCCCACGG TGGCGTGCGG CCGCGCGGCG AGGACCTGCT CACCGGCGGG GTGCCCTGCT ACGGCGTGTA TGAGACCGCG GACGGCCGCT ACATGGCGGT GGGTGCGCTG GAGGAGAAGT TCTGGGGGCT GCTGTGCACT ACGCTCGGGC GTGCGGACCT GATCCCGGGG CATCTCGCCA CCGGGGCGGA CGGAGCACGC GTGCGGGGCG AGCTTGCGAC GATCTTCCGC GGACGCAGCC AGCGCGAGTG GGTGGAGGTG TTCGACCCGG TCGATTGCTG CGTGACGCCC GTGTTGCGCC TGGAGGAGAG CCTGCGCGAC CCGCAGCTGC GCGCACGCGG CATGGTGGTG GAGACGAACG GCCTGTGCCA CGCCGGCGCG GCGGTGCGTC TGGGCGGGAT GCCGCCGATA CCCGACGCGC CGGCACCGGC GTGTGCGGGG GCGGATACGG ATGCGATCCT GGAGGCGCTC GGCGTGGATG CCGAGGAGCT TGCCCGGCTG CGGGGCGCGG GCGTGGTGTA G
|
Protein sequence | MNPASNAPAR APLAGVRILD LTRLLPGPLA TQHLADYGAE VIKVEDTGAG DYARTMGAMG GDTSWFYQLV NRNKRSLRLD LKQEAGRALF LRLVESADVV VEGFRPGVME RLGLGYAVLA ERNPRVVLCS ISGYGQTGPY ALRAGHDINY LGYAGVLDQI GCAGGAPALS NLQIGDLLGG TMGAVMGILV ALFDARRSGR GREVDVSMTD AAFAHMIFPL AEVLAHGGVR PRGEDLLTGG VPCYGVYETA DGRYMAVGAL EEKFWGLLCT TLGRADLIPG HLATGADGAR VRGELATIFR GRSQREWVEV FDPVDCCVTP VLRLEESLRD PQLRARGMVV ETNGLCHAGA AVRLGGMPPI PDAPAPACAG ADTDAILEAL GVDAEELARL RGAGVV
|
| |