Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1149 |
Symbol | |
ID | 7084678 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 1269380 |
End bp | 1271326 |
Gene Length | 1947 bp |
Protein Length | 648 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643698164 |
Product | Poly-beta-hydroxybutyrate polymerase domain protein |
Protein accession | YP_002354804 |
Protein GI | 217969570 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG3243] Poly(3-hydroxyalkanoate) synthetase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000277322 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCCAT CCGCCCCCAA GCCGCCCTCG GGCGAGGTCG CTCCGGCGCC CACGCGCGCC CGGCGCAGAT CGGCGACCAC CGCGCCCCCC GGCACAGCCG GCAAACCCGC ACGCGCCGCG CGGTCGGACA CTGCGAGCGC GAACACGTCC AGCGCGCAGG CCGGAAAGTC CCCCGCCCGG CCCGCAGCCT CCGCTGCCCG CAGCTTTGCC GAACCCTTCG CCCCGCCGCC GCCGCGCAAC CCGCTCGATC GCCGGGTGCA TGCGGCAATC GCACGCGCCA CCGCCTCGGT GTCGCCGATC GCGCTGCTGC TGGCCACGGT CGACTGGGCC GGCCACCTCG CGGTCTCGCC AGGCAAGCGC ATGGAGCTCG CCGACCTCGC CATCGTGCAG ACGCGCCGCC TGCTGCGCTA CGCGCAGCAG CTCGCCCTCG CCCCGGCGGG CAGCGCCGCG CACGAGTGCA TCGAGCCGCC CGCCCAGGAC CGCCGCTTCC AGGCGCCCGA GTGGCACGCC TGGCCCTTCA ACCTGATGCA CCAGTCCTTC CTGCTCGGCC AGGAATGGTG GGACGCCGCC ACCCACGGCG TGGCCGGCGT CTCGCGCCAT CACGAGCAGG TCGTCGCCTT CGCCGCGCGC CAACTGCTCG ACCTGTTCTC GCCGGGCAAC TTCCTGCCCA CCAACCCGGT CGTGCTGCGA CAGACCCTGG CGAGCGGCGG CATGAACCTC GCGCGCGGCA CGATCCACGC CATCGAGGAC TTCGAGCGCC TCGTCAGCGG CGCGCCCCCG GCGGGCACCG AAGACTTCGT CGTCGGCCGC GACGTCGCGG TCACCGCCGG CAAGGTGGTG CTGCGCAACC GCCTCGTCGA GCTCATCCAG TACGCGCCCG CCACCGCCGA CGTGTACGCC GAGCCGGTGC TGATCGTGCC GGCCTGGATC ATGAAGTACT ACATCCTCGA CCTCTCGCCG CACAACTCGC TGGTCAAGTA CCTGGTCGAG CGCGGCCACA CCGTGTTCTG CATCTCGTGG AAGAACCCCG GGACAGAGGA GCGCGAGCTC GACATGGACG ACTACCTGCA GCTCGGCTTC TTCGCCGCGC TCGACGCGAT CAATGCCATC GTCCCCGGGC AGAAGGTGCA CGCAACCGGC TACTGCCTCG GCGGCACCCT GCTGGCGATC GCCGCGGCAG CGATGGACCG CGACGGCGAT GCGCGCCTCG GCTCGATGAC GCTGTTCACC GCGCAGACCG ACTTCACCGA ACCGGGCGAG CTCGCGCTCT TCATCGACGA CAGCGAGGTC AGCCTGCTCG AGGCGCAGAT GGAGGAGACC GGCTTCCTCA CCGCCGGCCA GATGGCGGGC GCGTTCCAGA TCCTGCGCTC CAACGACCTG CTGTGGTCGC GCGTCGTCGG CGAATACCTC ATGGGCGAGC GCACGCCGAT GAACGACCTC ATGGCCTGGA ACGCCGACGC CACGCGCATG CCCGCGCGCA TGCACAGCCA GTACCTGCGC CGGCTCTTCC TCAACGACGA CCTCAGCGAG GGGCGCTACC CGGTGGGCGG CAAGCCGGTG GCGCTGTCCG ACATCGAGCT GCCGGTGTTC TGCGTCGCCA CCCTCACCGA CCACGTCGCG CCCTGGCGCT CGGTGTACAA GCTGCACTAC CTGGTGCCGA CCGAGATCAG CTTCGTGCTC ACCAGCGGCG GCCACAACGC CGGCATCGTC AGCGAGCCCG GCCGCCCGCG GCGCAGCTTC CGCATGCACA CCCGCCCCGC GGGCGGCAAC TACGTCCCGC CGGACGACTG GCTCGAGCGC ACGCCCGAGC AGGAAGGCTC GTGGTGGCCG GCCTGGAGCG ACTGGCTCGC CGCGCGCAGC GGCGCGCGCG TGGCGCCGCC GGCGATCGGT GCCCCAGACT ATCCCGCGCT CGACGACGCG CCGGGCTGCT ACGTGCATGA GAAGTGA
|
Protein sequence | MSPSAPKPPS GEVAPAPTRA RRRSATTAPP GTAGKPARAA RSDTASANTS SAQAGKSPAR PAASAARSFA EPFAPPPPRN PLDRRVHAAI ARATASVSPI ALLLATVDWA GHLAVSPGKR MELADLAIVQ TRRLLRYAQQ LALAPAGSAA HECIEPPAQD RRFQAPEWHA WPFNLMHQSF LLGQEWWDAA THGVAGVSRH HEQVVAFAAR QLLDLFSPGN FLPTNPVVLR QTLASGGMNL ARGTIHAIED FERLVSGAPP AGTEDFVVGR DVAVTAGKVV LRNRLVELIQ YAPATADVYA EPVLIVPAWI MKYYILDLSP HNSLVKYLVE RGHTVFCISW KNPGTEEREL DMDDYLQLGF FAALDAINAI VPGQKVHATG YCLGGTLLAI AAAAMDRDGD ARLGSMTLFT AQTDFTEPGE LALFIDDSEV SLLEAQMEET GFLTAGQMAG AFQILRSNDL LWSRVVGEYL MGERTPMNDL MAWNADATRM PARMHSQYLR RLFLNDDLSE GRYPVGGKPV ALSDIELPVF CVATLTDHVA PWRSVYKLHY LVPTEISFVL TSGGHNAGIV SEPGRPRRSF RMHTRPAGGN YVPPDDWLER TPEQEGSWWP AWSDWLAARS GARVAPPAIG APDYPALDDA PGCYVHEK
|
| |