Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1936 |
Symbol | |
ID | 7084404 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2178429 |
End bp | 2179949 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 643698961 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_002355583 |
Protein GI | 217970349 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1022] Long-chain acyl-CoA synthetases (AMP-forming) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCCGGATG CGTCTGAGTC CTTCCTGGCC GGCCTCGCCC GCCATGGCGG CCGGCCCGCG CTGCGTGGCG AGAGTGGGGA GATTGCCTTC GACCGCCTGC CCGAGGCCGT CGCCGCGTGC AGGGGTGTGC TCGAGTCGGT GGCTGCGCGC CGTGTCGCGC TCGCGCTCGA CAACGGCCTA CAGTGGGCGC TCTGGGATCT CGCGCTGCTT GCCGACGGGC GGGTGTGCGT GCCGCTGCCG GGCTTCTTCT CGCCGGCGCA GCAGGCTCAC GTGCTCGACA GTGCCGGCGT CGATACGCTG ATCGTCGACC CAGTGGCCGC CAGCGCGAGC GCGGCGGTGT TCCCGGGCTT CGTGCCGGTC GCACCCGGGA TCCTCCGGCG CGTGCCGGCG CAGGTTCCGG CGCTGCCCAC GGGCACGGTG AAGATCACCT ACACCTCCGG CACCACCGGC CAGCCCAAGG GGGTTTGCCT GAGCGCGGCG GCGCAGCTCG CGGTGGCGCG CAGCGTGGCG CGGATCGGCG AGGCTGGCGC GGTCGAGCGT CATCTGGGCG TGCTGCCGCT GGCGACGCTG CTCGAGAACA TCGCCGGCTT GTACGCCGGC CTGCTCGCTG GCGCCTGCGT CGAGTTGCTG CCGATGCGCA CGATCGGCTT CAGCGGGGGC GGGGGCTTCG ATCCGGCGCG CTTCCTGCAG ACCCTGCATG CCCGTCGACC GCACAGTCTG ATCCTGTTGC CGCAGATGCT GCTCGCCTTG GTCGGGGCCG CCGAGCAGGG CCACGCGCCG CCCGCCGGGC TGCGCTTCGT CGCGGTCGGT GGCGGGCAGG TGTCGGCGCG GCTGCTGCAG CGCGCCGAAG CGCTCGGTCT GCCTGTGTAC GAGGGCTACG GCCTGTCCGA GTGCGCCTCG GTGGTGTGCC TCAACACCCC GGCTGCGCGG CGCGTCGGCA CGGTGGGGCG GCCCTTGCCC CATGCCGCGC TGCGCATCGC CGACGATGGC GAGGTGCAGG TGCGCGGCGC GCACATGCTG GGCTACCTGG GCGAGGCGCC GCTGGCAGAC CAGCTGGTGG CAGGCGAGTG GCTGGGCACC GGCGACCTCG GCCATTTCGA CGATGGTTTC CTCGTTCTGC ACGGGCGCAG GAAGCACCAG TTCATCACCG CCTACGGGCG CAACGTGAAT CCCGAGTGGG TCGAGGCCGA ACTCGTCCAG CAGGGCCCGA TCGCCCAGGC CTGGGTGCAT GGCGAGGCGC TCGCCGAGAA CCTCGCCGTG CTCGTGCCGC GCCGCGCCGA CTGCAGCGAC GCCGAGCTCG ACGCGGCGGT GGCCGCCGCC AACGCGGGTT TGCCCGACTA CGCGCGCGCC GGGCGCTGGC TGCGCGCCGA CGCGCCCTTC ACCCCGGCAA ACGGCCTCCT CACCGCCAAC GGCCGCCTGC GCCGCGCCGC GCTCGCGGCG CACTACCTGC CACGCAGCGG CGCCGACGGC GCCGCCTCCT GCGCGACCTT CACCTCCTTC AGCGACGAGA TCGACACATG A
|
Protein sequence | MPDASESFLA GLARHGGRPA LRGESGEIAF DRLPEAVAAC RGVLESVAAR RVALALDNGL QWALWDLALL ADGRVCVPLP GFFSPAQQAH VLDSAGVDTL IVDPVAASAS AAVFPGFVPV APGILRRVPA QVPALPTGTV KITYTSGTTG QPKGVCLSAA AQLAVARSVA RIGEAGAVER HLGVLPLATL LENIAGLYAG LLAGACVELL PMRTIGFSGG GGFDPARFLQ TLHARRPHSL ILLPQMLLAL VGAAEQGHAP PAGLRFVAVG GGQVSARLLQ RAEALGLPVY EGYGLSECAS VVCLNTPAAR RVGTVGRPLP HAALRIADDG EVQVRGAHML GYLGEAPLAD QLVAGEWLGT GDLGHFDDGF LVLHGRRKHQ FITAYGRNVN PEWVEAELVQ QGPIAQAWVH GEALAENLAV LVPRRADCSD AELDAAVAAA NAGLPDYARA GRWLRADAPF TPANGLLTAN GRLRRAALAA HYLPRSGADG AASCATFTSF SDEIDT
|
| |