Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0052 |
Symbol | |
ID | 7083435 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 57563 |
End bp | 58888 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643697100 |
Product | conserved hypothetical cytosolic protein |
Protein accession | YP_002353749 |
Protein GI | 217968515 |
COG category | [S] Function unknown |
COG ID | [COG4924] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTGGA GCACCGCGCA GGACCTGAGG GCGCAGGTGA TGCGCCTGTG GGAGCGCGGC GAGCTGCTGC GCGAGGGGCT GCCGCAGGAC AGCGGCGCCC CCCCGTGCGC GTGCTTGCCG GAGGAGGAGA GCGAGGGCTC TGGCGCTCAG CCTTGCGAGC TACCGGCCGA TGCCGCGGCG TACACGGCCG TGGGACCACC TGCAGGCCAC AGCCGCTTCC CGCTGCGACT CACCCTGAAG ACCCCAACCT CGGACGACAT CACCCGCCAC TTCGATGCCG TGCGCGCCTG GGTTGCGGCG ATCTCGGCCA CGCCCCATGT GCGCCTCGAA TGGCAGGAGA CCCGCCACCG CGTGCAGGGC AGCCAGCGCC TGCCCGCGAG CGCCTGGGTC GACCGCCTCG ACGACGCCCT GGCCTGGATC GGCAAGCGCG CCGAGGACGC GCGCTTTCGT GCGCTGCACG CCGAGACTGC CGCACGCCAG CCCCTGCTGC TGCCCTGGCT GCACAAGCGC CCGCTGCGCG CGCTCGAACT CGCCGCCGAG TGGTCGCGCC TGCTCGACGT GGTGGCCTGG CTGCAGGCCC ACCCGCGCCC GGGCATGTAT CTGCGCCAGG TCGACCTGCC CGGCATCCAC ACCAAGTTCA TCGAATCCCA GCGCGGCGTG CTCGCCGAGC TGCTCGACCT CGCCCTGCCC GCGGCGGCGA TCGACCCGAG CCGCACCGGC GCGCAGCAGT TCGCCGCCCG CTACGGCTTC CTGGACAAGC CCGTGCTGCT GCGCTTGCGC ATCCTCGACC CGGCGCTCGG CCTGCTGCCT GGCGCGCCCT GCCCCGACCT CGCCCTCGAC GCCGACAGCT TCGCCCGCCT GCGACTCGAC GTGGCGCGCG TCTTCATCAC CGAGAACGAG ACCAACTTCC TCGCCTTCCC CCGCGTCGAC AAGGCCATCG TGATCTTCGG CGCCGGCTAC GGCTGGGAGG CCCTCGCGCG CGCCGAGTGG CTGCAGCGCT GCCCGATCCA CTACTGGGGC GACATCGACA CCAACGGCTT CGCCATCCTC GCCCAGCTGC GCGCCCGCTT CGCCCATGTC GAGTCCCTGC TGATGGACCG CGCCACCCTG CTCGCACATG AGGCGCTGTG GGGCCGGGAA GACAGCCCGC GCCCGGCCGA CGTCTCGCGC CTCAGCGCCG AGGAACGCGG CCTGTACGAA GACCTGCGCA ACCACCACAT CCGCCCGTCC CTGCGCCTGG AGCAGGAACA CATCGGCTTC GGCTGGCTGG AGAAGGCGCT GAGAATCGTC CACGCGGTGG ATGATTTTCA GCCATCTGAT GGTTGA
|
Protein sequence | MSWSTAQDLR AQVMRLWERG ELLREGLPQD SGAPPCACLP EEESEGSGAQ PCELPADAAA YTAVGPPAGH SRFPLRLTLK TPTSDDITRH FDAVRAWVAA ISATPHVRLE WQETRHRVQG SQRLPASAWV DRLDDALAWI GKRAEDARFR ALHAETAARQ PLLLPWLHKR PLRALELAAE WSRLLDVVAW LQAHPRPGMY LRQVDLPGIH TKFIESQRGV LAELLDLALP AAAIDPSRTG AQQFAARYGF LDKPVLLRLR ILDPALGLLP GAPCPDLALD ADSFARLRLD VARVFITENE TNFLAFPRVD KAIVIFGAGY GWEALARAEW LQRCPIHYWG DIDTNGFAIL AQLRARFAHV ESLLMDRATL LAHEALWGRE DSPRPADVSR LSAEERGLYE DLRNHHIRPS LRLEQEHIGF GWLEKALRIV HAVDDFQPSD G
|
| |