Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2071 |
Symbol | |
ID | 7085341 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2342499 |
End bp | 2344244 |
Gene Length | 1746 bp |
Protein Length | 581 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643699093 |
Product | ABC-1 domain protein |
Protein accession | YP_002355710 |
Protein GI | 217970476 |
COG category | [R] General function prediction only |
COG ID | [COG0661] Predicted unusual protein kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0987216 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCTGGC AGGCACTGAC CGCGATGCGT GATCTACGGC GCTTGCACGA AATCGCCTCG ATCCTAGTCC GCTACGGGTT TGGCGACATG GTGCGCCGGA TGGGGCTGTC CAACGTCTTG GAGCGGGCGG GGACGGCACT GCACTGGAAC GAAGCGACAG ACTTCGCCCA CATGACGCCC CCTGAACGAG TGCGGCGGGC GCTGGAGGAA ATGGGGCCGA CTTTCGTCAA ACTCGGCCAG GTGCTCGCTA CCCGGGTCGA TCTACTCGAA CCCGAGTGGA CCGCCGAATT CGGGAAGTTG CAGGACAGCG CACCGCCCGC TCCCTGGGCA GCCGTACATC AGCAGCTCAC CGAAGACCTC GGCGCGCCAC CCGAGGAAAT CTTCGCGGCC TTCGATCCCG AGCCCTTGGC GGCCGCTTCC ATTGCGCAGG TGCATCGGGC GCGTCTCGAA GATGGCGGCG AAGTCGTGGT CAAGGTGCGC AGGCCCGGCA TTCGGCCCAT TCTGGAAGCC GATCTGCGCT GGCTTGCGCG CTTGGCCGAA CTCGCCGAGA CGGAGCGCGC GGAATGGCGC GTCCTGCATC CGCGCGAGAT GGTGCGCCAG TTCGGCCAGT CGCTGCGCAA TGAACTCGAT TTCGCCGGCG AATGCCGCAA TGCCGAACGC ATCGCCGAGA ACTTCACCGG TTATACCGAT CAGGACTCGC CTCCTGTCGT CCCGGGCGAG GAGAAGCCGG ACGCGGATGG CGCACATCCC ATCATCGTCA TTCCCCGCGT GTATTGGCAG TGGACCGGCG AGCGGGTCTG CGTACAGGAA TTCATTGGTG GTATTTCCGG GCGCAGGCTT CAGGCCGTAG ATCAGGCTGG CCTGGACCGC AAGATGCTCG CACGGCGTGG CGCCAATGCC GTACTGAAGA TGATCGTCGA GGACGGGGTG TTTCACGCTG ACCCCCATCC GGGCAACGTG TTCTATTTGC CGGGCAATCG AATCGCCTTC ATCGACTTTG GCATGGTTGG CAGGCTGGAG GAGGAACGCC GCGACCAGTT GATCCGCCTG CTGCTGGGGC TGGTGAGGAA CGACGCCCGC AGCGTCGTCG ACGTCATGCT CGAATGGACG AGCGACGGTG TGGCAAATGA GCAGGATCTG TTGTTGGAAG TCCAGTCTTT CATCGATCGC TACCACGGCG TGCCGCTGAA GCAACTGCGG TTGAGCGCAA TGTTGTCCGA CCTCGTAGCG ATCCCGCGCC AACACCACCT TGTCCTGCCT AACGATCTGT CCTTGCTGAT AAAAGCCTTC GTCACACTGG AAGGCTTGGG CCGCGAACTC GATCCCGGCT TCGACATGGC CAACCAGGTA CTGCCGCTGC TGGAGCGTGC CATGCACGCG CGCTACTCCC CGGCAACGCT GCTCAGACGG GGGCGGCGGG CGGCCGGTGA CATCTTCTCG TTGCTGGCTG GCTTGCCCCA GGACATCTCC CGGCTGTTGC GAGCCGTCCG CCGAGGCCGT CTCGAAGTAC ACATCGACGT CACCCATCTG AGACGGGTCG GCGATCAACT CGACAATGCC GCCAACCGCC TGGTGATCGG GATCGTCGTC GCCGCGCTCA TCGTCGGCTC GTCCATCGTC ATGACCGTGT CGGGAGGACC GACGCTTCTG GGGTTACCGG TTTTCGGCCT TCTCGGATTC CTGGGCGCGG TGGCCGGCAG CCTATGGCTT CTGCTTTCCA TTCTGCGCAG CGGGAGAGGA CGATGA
|
Protein sequence | MFWQALTAMR DLRRLHEIAS ILVRYGFGDM VRRMGLSNVL ERAGTALHWN EATDFAHMTP PERVRRALEE MGPTFVKLGQ VLATRVDLLE PEWTAEFGKL QDSAPPAPWA AVHQQLTEDL GAPPEEIFAA FDPEPLAAAS IAQVHRARLE DGGEVVVKVR RPGIRPILEA DLRWLARLAE LAETERAEWR VLHPREMVRQ FGQSLRNELD FAGECRNAER IAENFTGYTD QDSPPVVPGE EKPDADGAHP IIVIPRVYWQ WTGERVCVQE FIGGISGRRL QAVDQAGLDR KMLARRGANA VLKMIVEDGV FHADPHPGNV FYLPGNRIAF IDFGMVGRLE EERRDQLIRL LLGLVRNDAR SVVDVMLEWT SDGVANEQDL LLEVQSFIDR YHGVPLKQLR LSAMLSDLVA IPRQHHLVLP NDLSLLIKAF VTLEGLGREL DPGFDMANQV LPLLERAMHA RYSPATLLRR GRRAAGDIFS LLAGLPQDIS RLLRAVRRGR LEVHIDVTHL RRVGDQLDNA ANRLVIGIVV AALIVGSSIV MTVSGGPTLL GLPVFGLLGF LGAVAGSLWL LLSILRSGRG R
|
| |