Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3543 |
Symbol | |
ID | 7873049 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3883975 |
End bp | 3885072 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643700484 |
Product | hypothetical protein |
Protein accession | YP_002890514 |
Protein GI | 237654200 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.571666 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGACG AGTCCTACCC GACGACCGAG GCCCTGGTCG CCTCGCTTCA GGAAACCGAG TCGGCGCCGG GGACGAGTGG GAGCCGCCTC GCCCGCCTGA AGCGGCGCGG CCTGTGCGTG GTGGTGTGGC TGCGCCGCCA TCCCGGGCTG ATCGCCACCT TCGGCTTCGT CTCCGGGGTG TCGAGCTTCT TACTGGTCGA GCGCCACGAG GGTGTGGCGC GCGCGATGGC GGTGGCGATG CTGGCGGGCT GGCTGCTGCT GGTGCTCGAG CGCGTGCTCG ACCGCGCGCT CGAGCGGCGC TTCGGTTTCG GCCTGCCGCC GGCGCTGGTG CGCTACCTGA CCCAGTTCAC CCACCAGGAG AGCCTGTTCT TCGCTCTGCC CTTCTTCTTC GCGAGTACCT CGTGGAACAG CGGGCAGGCG GTGTTTACCG GCGCCCTCGG GCTGATGGCG CTGGTGGCGA TCCTCGACCC GGTCTACTTC GGCCGCCTGG CGACACGGCG CTGGCTCTTC CTCGGCTACC ACACGCTGAC CCTGTTCGCG CTGCTGCTGG TGGTGTTCCC GCTGGTGCTG CAGGTGCCGG CGCTGGCGAG CTACCAGCTC GCGCTCGCGC TCGCGGTGGT GCTGTCCTTC CCGACGCTGA CCGGGGCGAT CAGCGTGCCG CGCTGGTGGC GCGGGCTGCT GGTGCTGGCG CTGCTGGCCG CGCTCGGCGC TGCGGGCTGG CTGGCGCGGC TGTGGGTGCC GCCGGCCACG CTGCGCCTGA CCCAGGTCGC GGTGACCAGC GTGGTGGACG AGGCGCAGCG CGCGCCCGCC GAGAGCCTGC GCCAGATCGA GGCCGGGCAG CTGCTCGCCG AGGGCCTGTA CGCCTACACC GCGATCCATG CGCCGCTGGG CCTGTCGGAG AAGGTCGTGC ACGTGTGGCG CCACGAGGGA CGCGTGGTGG ACCGTATCGA GCTGGAGGTG AACGGCGGCC GTGCCGAGGG CTACCGGGCG TGGACGCGCA AGCGCAACTT CCCGGACGAC CCGCGCGGGC GCTGGCAGGT GCAGGTGCTC GCCGCCGACG ACCGCATGAT CGGCACGCTG CGCTTCAGGG TGGAGTGA
|
Protein sequence | MKDESYPTTE ALVASLQETE SAPGTSGSRL ARLKRRGLCV VVWLRRHPGL IATFGFVSGV SSFLLVERHE GVARAMAVAM LAGWLLLVLE RVLDRALERR FGFGLPPALV RYLTQFTHQE SLFFALPFFF ASTSWNSGQA VFTGALGLMA LVAILDPVYF GRLATRRWLF LGYHTLTLFA LLLVVFPLVL QVPALASYQL ALALAVVLSF PTLTGAISVP RWWRGLLVLA LLAALGAAGW LARLWVPPAT LRLTQVAVTS VVDEAQRAPA ESLRQIEAGQ LLAEGLYAYT AIHAPLGLSE KVVHVWRHEG RVVDRIELEV NGGRAEGYRA WTRKRNFPDD PRGRWQVQVL AADDRMIGTL RFRVE
|
| |