Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1627 |
Symbol | |
ID | 7084837 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 1822066 |
End bp | 1823301 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643698647 |
Product | hypothetical protein |
Protein accession | YP_002355278 |
Protein GI | 217970044 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00139041 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGGCCG CCAAGCCCCC CTACCGCAGC CCCTTGGTCA AGGCGTGGAA CGCCATCGCC AGCGCCGCTC GACGCCCGGT GACGCTCGAC GCGAGGGCGC TGCTCGTCGC GGCACAGCGC AAGCAGCCCG ATGCGCCACC GCCACCCGCA GCGACGATGG ATGCGCTAGG CGTGCTGGTC GAAGCCATCC ACTCGGGCAG GAGTGCCCTG CACCCGTTCG GGCGTTACTA TCTCGAGCAG ATGTTTGCGG GGCTGCTGTC GAATCGGATG CGACTTGCCA AGTTCTGGGC CGAGCACCCG GAAACACTCG CGGACGAAGT TCGCCAGCCG GTGATCGTGG TCGGCCTGCC CCGCTGCGGG ACGTCCTACC TGTTCAACCT GCTCGCCCAC GACCCGGAAC ACCGTTACCT CACCAACTGG GAAACCACCG TCTCACAGAT CCCGCCCACG CCCCCGCCCG TCACGCTGCG ACAGGACCCC AGACGACGCA TCGGCAAGCT CTTGATGGCG TTCCAGCGCC ATCTCGCACC CGGCCTGGAG TCGATCCACG AGTTCCACCT AGACGGCCCC GAGGAATGCA CCCCCCTGCT GATGCAGGGC TTCGACACCC AGGCGCTGGC GGGGATGTTC GACGCCCCGG ACTACTCGCA CTGGCTCGAT CACGCCGATC ATCGCGCGAC CTACCAGCAT CACAGGCGCA TCCTGCTGAC CCTGCAGCGT TGTTATCCGG CCGGGCGCTG GCTGCTCAAA TCGCCTGATC ACCTCGCTGC GCTCGAGGCC CTCCTCGAGA CCTATCCCGA CGCATGCCTG ATCCAGCTGC ATCGCGACCC GGTGCAGGCG GTGTCCTCGT GGGCGAGCCT GAATGCGGCC TTCCGTGGCA TCTGGAGCGA ACGTATCGAT GCGGCCGAGC TCGGACCGCA GATCCTGGAA CGGCTTGCCA CCGACATGGA CGCCAGCATC GACGCGCGCC AACGCCTCCC CGCGGATCGC TTCCTCGACC TCCAGTATCG CGACCTCATC GCCGATCCGC TCGGCCAGGT CGAGCGCATC CATGCGCATT TCGGACTCGA CTTCGCTCCA TCGACGCGGG CACGCGTCGA GAGTTTCCTG CACGGCGACC GCGACAAGAA GCGCAGCCAT GCCTATGCTC CCGAGCACTT CGGCCTGAGT GCGGAACGCA TTCGCGAACG CTTCGCGCGC TACATCGGCC ACTACGGAGT TGCTCCTGCA CGCTGA
|
Protein sequence | MQAAKPPYRS PLVKAWNAIA SAARRPVTLD ARALLVAAQR KQPDAPPPPA ATMDALGVLV EAIHSGRSAL HPFGRYYLEQ MFAGLLSNRM RLAKFWAEHP ETLADEVRQP VIVVGLPRCG TSYLFNLLAH DPEHRYLTNW ETTVSQIPPT PPPVTLRQDP RRRIGKLLMA FQRHLAPGLE SIHEFHLDGP EECTPLLMQG FDTQALAGMF DAPDYSHWLD HADHRATYQH HRRILLTLQR CYPAGRWLLK SPDHLAALEA LLETYPDACL IQLHRDPVQA VSSWASLNAA FRGIWSERID AAELGPQILE RLATDMDASI DARQRLPADR FLDLQYRDLI ADPLGQVERI HAHFGLDFAP STRARVESFL HGDRDKKRSH AYAPEHFGLS AERIRERFAR YIGHYGVAPA R
|
| |