Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3948 |
Symbol | |
ID | 7873594 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 4344472 |
End bp | 4345836 |
Gene Length | 1365 bp |
Protein Length | 454 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643700885 |
Product | ABC-1 domain protein |
Protein accession | YP_002890908 |
Protein GI | 237654594 |
COG category | [R] General function prediction only |
COG ID | [COG0661] Predicted unusual protein kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.234521 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACACCG CCGCCAGCGC CATCGACCCG AGCATTCCCG TCGCGGATGC CGCGCGCAGC GCCGTGGTAC CGGGCGGCCG CCTGTCCCGT CTTGCCCGCC TGGGCAGTCT CGCCACCGGG GTGGCGGGCG GCATGCTCGC CGAGGGTGCG CGCCAGCTCG CCGCCGGCAA ACGACCCAAG GTGAGCGAGC TGGTGCTCAC CCCGGCCAAC GCCCGCCGCG TCGCCGAACA GCTCGCGCAG CTGCGCGGTG CGGCGATGAA GGTCGGCCAG CTGATGTCGA TGGACGCCGG CAGCCTGCTG CCGCCCGAAC TCGCCGACAT CCTCGCCCGC CTGCGCGAGG ACGCGCGCAC GATGCCGATG AGCCAGGTGG TCGAGGTGCT GGAGACGCAC TGGGGCAAGG GCTGGGAGCA GGGCTTCGAG CGCTTTTCCT TCACCCCGTG CGCGGCGGCC TCGATCGGCC AGGTGCATCG CGCGCGCACC CGCGATGGCG AGGAACTGGC GATCAAGCTG CAATATCCCG GCGTGCGGCG CAGCATCGAC AGCGACGTGG ACAACGTCGC CACCCTGCTG CGCGTCTCCG GGCTGCTGCC CAAGGCGCTC GATCTCGCGC CGCTGCTCGC CGAGGCCAAG CGCCAGCTCC ACGAGGAGGC CGACTACCGT CGCGAGGCCG AGAGCCTGCA CCGCTTCGGC GGCCTGCTCG GCGATGCGGA ACACTTCGTG CTGCCGCGCG CGGTCGATGC GCTCACCCGC AGCGACATCC TGGCGATGAG CTGGGTGGAG GGCGTGGCGG TGGAGACGCT CGCCGACCCG CAGGCGGCCG ACCAGGCGCT GCGCGACCGC GTGGCGAGCC TCTTGATCGG CCTGCTGTTC CGCGAGCTGT TCGAGTTCCG CCTCATCCAG ACCGACCCCA ACTTCGCCAA CTACCGCTTC GACGCCGCCA GCGGCCGTGT GGTCCTGCTC GACTTCGGCG CCACCCGGCC CTACGCCGAG CCGGTGGTCG AGGCCTACCG CCGCCTGATG GCGGGCTCGG TGCGTGGCGA CCGCGTCACG ATGGGCGAGG CGGCGCAGGC GATCGGCTAC TTCCAGGACA ATATCCATGC CCATCAGCGC GATGCGGTGA TCGACCTCTT CGAGATCGCT TGCGAGCCGG TGCGCCACCC CGGCGCCTAC GATTTCGGTA CCAGCGACCT GCCGCTGCGC CTGCGCGATG CCGGGCTGAA GCTGAGCATG GAGCGCGACT TCTGGCACAC CCCGCCCGCC GATGCGCTCT TCCTGCACCG CAAGCTCGGC GGCCTGTACC TGCTCGCGGC GCGGCTGCGC GCGCGCGTGG ACGTGGCGGC GCTGGCGGCG CCCTGGCTGG GCTGA
|
Protein sequence | MNTAASAIDP SIPVADAARS AVVPGGRLSR LARLGSLATG VAGGMLAEGA RQLAAGKRPK VSELVLTPAN ARRVAEQLAQ LRGAAMKVGQ LMSMDAGSLL PPELADILAR LREDARTMPM SQVVEVLETH WGKGWEQGFE RFSFTPCAAA SIGQVHRART RDGEELAIKL QYPGVRRSID SDVDNVATLL RVSGLLPKAL DLAPLLAEAK RQLHEEADYR REAESLHRFG GLLGDAEHFV LPRAVDALTR SDILAMSWVE GVAVETLADP QAADQALRDR VASLLIGLLF RELFEFRLIQ TDPNFANYRF DAASGRVVLL DFGATRPYAE PVVEAYRRLM AGSVRGDRVT MGEAAQAIGY FQDNIHAHQR DAVIDLFEIA CEPVRHPGAY DFGTSDLPLR LRDAGLKLSM ERDFWHTPPA DALFLHRKLG GLYLLAARLR ARVDVAALAA PWLG
|
| |