Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0324 |
Symbol | |
ID | 7085625 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 369884 |
End bp | 371629 |
Gene Length | 1746 bp |
Protein Length | 581 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643697361 |
Product | ABC-1 domain protein |
Protein accession | YP_002354009 |
Protein GI | 217968775 |
COG category | [R] General function prediction only |
COG ID | [COG0661] Predicted unusual protein kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCTGGC AGGCACTGAC CGCGATGCGT GATCTACGGC GTTTGCACGA AATCGCCTCG ATCCTGATCC GCTACGGGTT TGGCGACATG GTGCGCCGGA TGGGGCTGTC CAACGCCTTG GAGCGGGCGG GGACGGCACT GCACTGGAAC GAAGCGACAG ACTTCGCCCA CATGACGCCC CCTGAACGAG TGCGGCGGGC GCTGGAGGAG ATGGGGCCGA CTTTCGTCAA ACTCGGCCAG GTGCTCGCTA CCCGGGTCGA TCTACTCGAA CCCGAGTGGA CCGCCGAATT CGGGAAATTA CAGGACAGCG CGCCGCCCGC TCCCTGGGCA GCCGTACATC AGCAGCTCAC CGAAGACCTC GGCGCGCCGC CCGAGGAAAT CTTCGCGGCC TTCGATCCCG AGCCCTTGGC GGCCGCTTCC ATTGCGCAGG TGCATCGGGC GCGTCTCGAC GATGGCAGCG AAGTCGTGGT CAAGGTGCGC AGGCCCGGCA TTCGGCCCAT TCTGGAAGCC GATCTGCGCT GGCTTGCGCG CTTGGCCGAA CTCGCTGAGA CGGAGAGCGC GGAATGGCGC GTCCTGCATC CGCGCGAGAT GGTGCGCCAG TTCGGCCAGT CGCTGCGCAA TGAACTCGAT TTCGCCGGCG AATGCCGCAA TGCCGAACGC ATCGCCGAGA ACTTCACCGG CTATACCGAT CAGGACTCTC CTCCTGTCGT CCCGGGCGAG GAGAAGCCGG ACGCGGATGG CGCACATCCC ATCATCGTCA TTCCCCGCGT GTATTGGCAG TGGACCGGCG AGCGGGTCTG CGTACAGGAA TTCATTGGTG GTATTTCCGG GCGCAGGCTT CAGGCCGTAG ATCAGGCTGG CCTGGACCGC AAGATGCTCG CACGGCGTGG CGCCAATGCC GTACTGAAGA TGATCGTCGA GGACGGGGTG TTTCACGCTG ACCCCCATCC GGGCAACGTG TTCTATTTGC CGGGCAATCG AATCGCCTTC ATCGACTTTG GCATGGTTGG CAGGCTGGAG GAGGAACGCC GCGACCAGTT GATCCGCCTG CTGCTGGGGC TGGTGAGGAA CGACGCCCGC AGCGTCGTCG ACGTCATGCT CGAATGGACG AGCGACGGTG TGGCAAATGA GCAGGATCTG TTGTCGGAAG TCCAGTCTTT CATAGATCGC TACCACGGCG TGCCACTGAA GCAACTGCGG TTGAGCGTAA TGTTGTCCGA CCTCGTGGCG ATCCCGCGCC AGCATCACCT CGCCCTGCCC AACGATCTGT CCCTGCTGGT CAAAGCCTTC GTCACACTGG AAGGCTTGGG CCGTGAACTC GATCCCGGTT TCGACATGGC CAACCAGGCA CTGCCGCTGC TGGAGCGTGC CATGCACGCG CGCTACTCCC CGGCAGCGCT GCTCAGACGG GGGCGGCGGG CGGCCGGTGA CATCTTCTCG TTGTTGGCTG GCTTGCCCCA GGACATCTCC CGGTTGCTGC GAGCCGCCCG CCGCGGTCGC CTGGAAGTAC ACATCGACGT CACCCATCTG AGACGGGTCG GCGATCAGCT CGACAATGCC GCCAACCGTC TGGTGATCGG GATCGTCGTC GCCGCGCTCA TCGTCGGTTC GTCCATCGTC ATGACCGTGT CGGGAGGGCC GACGCTCCTG GGGTTGCCGG TCTTTGGCTT TCTCGGATTC CTTGGCGCGA CAGTCGGCAG CCTATGGCTT CTGCTTTCCA TTCTGCGCAG CGGGAGAGGA CGATGA
|
Protein sequence | MLWQALTAMR DLRRLHEIAS ILIRYGFGDM VRRMGLSNAL ERAGTALHWN EATDFAHMTP PERVRRALEE MGPTFVKLGQ VLATRVDLLE PEWTAEFGKL QDSAPPAPWA AVHQQLTEDL GAPPEEIFAA FDPEPLAAAS IAQVHRARLD DGSEVVVKVR RPGIRPILEA DLRWLARLAE LAETESAEWR VLHPREMVRQ FGQSLRNELD FAGECRNAER IAENFTGYTD QDSPPVVPGE EKPDADGAHP IIVIPRVYWQ WTGERVCVQE FIGGISGRRL QAVDQAGLDR KMLARRGANA VLKMIVEDGV FHADPHPGNV FYLPGNRIAF IDFGMVGRLE EERRDQLIRL LLGLVRNDAR SVVDVMLEWT SDGVANEQDL LSEVQSFIDR YHGVPLKQLR LSVMLSDLVA IPRQHHLALP NDLSLLVKAF VTLEGLGREL DPGFDMANQA LPLLERAMHA RYSPAALLRR GRRAAGDIFS LLAGLPQDIS RLLRAARRGR LEVHIDVTHL RRVGDQLDNA ANRLVIGIVV AALIVGSSIV MTVSGGPTLL GLPVFGFLGF LGATVGSLWL LLSILRSGRG R
|
| |