Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2728 |
Symbol | |
ID | 7873469 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2953180 |
End bp | 2954499 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643699651 |
Product | hypothetical protein |
Protein accession | YP_002889706 |
Protein GI | 237653392 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0283161 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGACGA GCGACTTCAG CTTCAGCCTG CACGCCGGCG CGCGCGACGT GGCCGAGGCG ATCAACCGGG GCTGCGACTG CCGCAGCCTC GATCCCGAGC GCCTGCGCCG GCAGCTCGAG GCGGAGCCCA GCCTGCTCGG GCTCGCCGCC GAGATCGGCC GCAGCCGGCC GCAGCTCTTC TCCGCGACCG CCGTGTTCAT CGCGCCCGCA ACCCTGGCGG CCATGGCGGA CCTCATCGGC GCGGTCGAAT CGGTCCTGAC GCTGCCGGCC TGTCAGGCGA TCGCGCTCGA TCGCGCACCG CCGATCGCCC GCATCGCGCT CGGACCGAAA TCGGTGTTCA TGGGTTTCGA CTTTCACCTG GGCGAAGCCG GCCCGCAGCT CATCGAGATC AACACCAACG CCGGCGGAGC CCTGCTCAAC GGGGCACTGG CACGCGCGCA AAAAGCCTGC TGCGAGCAGA TCCAGCCCTT CCTGCGCCCG ACCGCAGCGC TCGGGACGCT CGACCAGCGC TGGCTGGACG ACTTCCTCGC CGACTGGTCG CTGCAGGGGC GAAGCGGCAA GCCCGCCCGC GTCGCCATCG TCGACGAACA TCCCACGGGC CAATACCTGT ATCCCGAGTT CCTGCTCTTT CGCCAGCTCT TCCGCGGCGC CGGCATCGAT GCCGTGATCT GCGACCCGGC CGAGCTGTCG TTCGAGTCGG GCCGCCTGCT GCATGGCGGC GCGCCGATCG ACCTGGTTTA CAACCGCCTC ACCGATTTCT ACCTGCAGGC TCCAGCGCTG GCGCCCCTGC GCGCAGCCTA CGAGACCGGC GCGGTCGTGC TGACCCCCCA CCCGCGTGCG CACGCGCTCT ACGCGGACAA GCGCAACCTC ACCGTCCTGA GCGATGCCGC ACGGCTTGCC GAACTCGGCG TGCCACAGCC CCTTGTCGAC CGCCTGCTCG CAGGGGTCCC GCGCACGGTC GAAGTCACCC CGGAACGCGC AGGAGCGTTG TGGGCAGCAC GGCGCGGCCT GTTCTTCAAA CCCTTCGCCG GTTTCGGCAG TCGTGCGACC TATCGGGGAG ACAAGCTCAC GCAGCGCGTC TGGCAGGAGA TTCTCGCCGG CGGCTACGTC GCGCAAGACC TCGCCCTTCC CTCCGCACGG CGCGTCGCCG TTGATGGCCA ACGCAGCGAT CTCAAGCTCG ATGTCCGCGC GTATGCGTAT GCCGGCGCGA TCCGCCTGGT CGCTGCGCGG CTGTACAAGG GACAGACGAC CAACTTCCGC ACGCCGGGCG GCGGCTTTGC GCCGGTATTC GTGGCCACCG CCGGGCGGCT TTCCGGGTGA
|
Protein sequence | METSDFSFSL HAGARDVAEA INRGCDCRSL DPERLRRQLE AEPSLLGLAA EIGRSRPQLF SATAVFIAPA TLAAMADLIG AVESVLTLPA CQAIALDRAP PIARIALGPK SVFMGFDFHL GEAGPQLIEI NTNAGGALLN GALARAQKAC CEQIQPFLRP TAALGTLDQR WLDDFLADWS LQGRSGKPAR VAIVDEHPTG QYLYPEFLLF RQLFRGAGID AVICDPAELS FESGRLLHGG APIDLVYNRL TDFYLQAPAL APLRAAYETG AVVLTPHPRA HALYADKRNL TVLSDAARLA ELGVPQPLVD RLLAGVPRTV EVTPERAGAL WAARRGLFFK PFAGFGSRAT YRGDKLTQRV WQEILAGGYV AQDLALPSAR RVAVDGQRSD LKLDVRAYAY AGAIRLVAAR LYKGQTTNFR TPGGGFAPVF VATAGRLSG
|
| |