Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_4020 |
Symbol | |
ID | 7873666 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 4418138 |
End bp | 4419466 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643700957 |
Product | hypothetical protein |
Protein accession | YP_002890980 |
Protein GI | 237654666 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.811457 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTACAT CGACATGGAG TGCTTCCGGC GCCGCAGCAT TCGCCGCTTG TGTGGCTGCT GCAGCGACAC CGGCCTTCGC CGCGGCTCCC GCTGACTGGG CAGCCATCCC GGTGCAGACG GTGACGCTGT TCTATCCCGG GCAGGTCGAT TACGGCTGGC TGCGCAGCGC GGAGCACAAG CGTGCCAATG CCAAGGTCAG GGAGGGCGAA GCCTGTCTGT CCTGCCACGA GGGCGAGGAG GCCGAACTCG GCGCCACCCT CGTCAAGGGC GGGCGGCACG AACCGGTCCC GATCGCCGGC AAGCTCGGCG CGGTCGCGCT CCAGGTCCAG GCCGCCCATG ACGACGCCAA CCTGTACCTG CGCTTCCAGT GGAAGACGCA GATGGCGCGG GCCGGGCAGA TGCACGACTA CATGATGTAC GACGGCGAGA AGTGGGCCTT CATCGGCGGG CCGCGCTCGA AGGAAGCCGT GCGCAGCGGC GCCCAGCCGC CGCTCTACGA GGATCGCCTG TCGGTGATGA TCGACGACGG CAAGGTGCCG ATGTTCGCCA ACCAGGGCTG CTGGCTGACC TGCCACACCG GCATGCGCGA CATGCCCGGA GAGCCGACCA AGGAACAGGT CCAGGCCCAT CCGCTGATCG GTCAGACGCA CAAGGAAAGC GACGTGCGCA AGTATCTGCC GGCCACGCGC ACGGACGAGG CGGCGAGCTG GGACAAGACC CGCGCGCCGG AGGAGATCGC CCGCCTCAAG GAGGCGGGCG CCTTCGTCGA GCTGATGCAG TGGCGCGGTC ATCGCAGCAA TCCGGTGGGC ATGGCCGACG ATGGCTACGT GCTCGACTAT CGCCTCGTCG ACGCCGGCAA GGGCCCGTTC GGCTGGAACG TCGACCGCAA GACCATGACG CCGAAGTTCA TGTTCGACCC TGCGAAAGTC GGTGTGAAGG CGCTTGCGCT CGCGGATGTC GGCAACGCGT CGAAGCCGCA CGCGCTGATC CGGGAAGACA ACGCCGTGGC CTACGATCCG GCCGCGGGCT GGAAGAAGGG CGACGTCCTT CCCGGGCGCC TGCTCTCACG CGCCGACGCA AGCGGTTCGG CGGCGGACAA TGCCGACGTC CGCGGCGAGT GGGCGGATGG CCAGTGGACG GTGCTGTGGA CGCGCAAGCT CGACACCGGG CATGCCGACG ACGACAAGGC CCTGAAGCCG GGCGGCGTCG TCAACGTGGG CTTTGCCGTT CATGACGACA ACGTCACAAC GCGTTTCCAT CATGTGTCCT TCCCGCTGAC CCTGGGGATC GGCACGAAAG CCACCATCTC CTCGGTCGCG CTCGAGTGA
|
Protein sequence | MTTSTWSASG AAAFAACVAA AATPAFAAAP ADWAAIPVQT VTLFYPGQVD YGWLRSAEHK RANAKVREGE ACLSCHEGEE AELGATLVKG GRHEPVPIAG KLGAVALQVQ AAHDDANLYL RFQWKTQMAR AGQMHDYMMY DGEKWAFIGG PRSKEAVRSG AQPPLYEDRL SVMIDDGKVP MFANQGCWLT CHTGMRDMPG EPTKEQVQAH PLIGQTHKES DVRKYLPATR TDEAASWDKT RAPEEIARLK EAGAFVELMQ WRGHRSNPVG MADDGYVLDY RLVDAGKGPF GWNVDRKTMT PKFMFDPAKV GVKALALADV GNASKPHALI REDNAVAYDP AAGWKKGDVL PGRLLSRADA SGSAADNADV RGEWADGQWT VLWTRKLDTG HADDDKALKP GGVVNVGFAV HDDNVTTRFH HVSFPLTLGI GTKATISSVA LE
|
| |