Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0664 |
Symbol | |
ID | 7084602 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 750167 |
End bp | 752125 |
Gene Length | 1959 bp |
Protein Length | 652 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643697690 |
Product | nitrous-oxide reductase |
Protein accession | YP_002354332 |
Protein GI | 217969098 |
COG category | [C] Energy production and conversion |
COG ID | [COG4263] Nitrous oxide reductase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGACG CAAAGATGAA CCCGGTCGAA GCCATGCCCG ATCCGAGCCG CCGCAAGTTC CTCAACACCG CAGCGCTCGC CGGCCTCGCC GGTGCCGGCA TGTCGGTGGG CCTGTCCGCG TGCAACAAGG AAACCGCTCC CGCTCCGGCC GCTGCACCCG CCGCGGCACC GGCCGCCGCG CCCGCCGCGG CGCATGCTGC CTCCGGCCTC AAGCTGCACA TGGCACCGGG TGAGCTCGAC ACCTACTACG GCATCTGGTC CGGCGGCCAC TCCGGCGAAT GCCGCGTCCT CGGCCTGCCT TCGGGGCGTG AAATCAAGCG CGTGCCGACC TTCAACATCG ACTGCATGAG CGGCTGGGGC ATCACCAACG AGTCCAAGAA GGTGATCGGC ACGCGTGCGG ACGGCCAGTT GAAGTACAAG ACCGGCGACA CCCACCACAT CCACGGCTCC TACACCGACG GCACCTACGA CGGCAAGTAC TTCTGGGTCA ACGACAAGAT CCACGGCCGC CTGGGCCGTA TCCGTGGCGA CGTCTTCGAA TGCGACGCGA TCACCGAGAT CCCGAACATC CAGGGCTTCC ACGGCATCTT CTCGGACAAG CGCGACCCGG TCGACAAGGC CATCAACCAC ACCACCCGCG TGTTCTGCGG CAACGAGTTC CATATCCCGC AGCCGAACGA CGGCCGCGAC CTCGACGATC CGACCAAGTA TTTCTGCCTG TTCACCTGCG TCGACGCCGA GACGATGGAA GTGCGCTGGC AGTGCAAGGT CGACGGCAAC ATGGATCTGG TCGCGACCTC CTATGACGGC CGCTTCGCCG CCGCCAACCA GTACAACACC GAGGGCGGCG CGCGCTACGA AGACATGATG TCGGCCGAGA TGGACGCCTG CGTGTTCTTC GACGTCGCCC GCATCGAAAA GGCGATCAAG GACGGCAAGT CCTTCACCGT CGGCACCTCC AAGGTCCCGG TCGTCGACGG CACCAAGGCC GCCAACCCCG ATCCGAAGAC CGCGCTGACC GCCTATGTAC CGGTCCCCAA GAACCCGCAC GGCGTCAACG CCAGCCCGGA CGGCAAGTAC ATGATCTGCT CGGGCAAGCT GTCGCCCACC TGCACGGTCA TCGAAGTCGC CAAGGTCGCC GACTTCCTCG ACGGCAAGCT CGACGACATC CGCAAGTCGG TGGTCGCTGA AGTCGAAGTC GGCCTCGGCC CCCTGCACAC CACCTACGAC GGCCGCGGCA ACGCCTTCAC GACCCTCTTC CTGGACAGCC AGATCGTCAA GTGGAACATC GACGCCGCGA TCAAGTTCCA TAACGGCGAC AAGGCCGCGC AGTACGTGGT CGACCGTATC GATGTGCATT ACCAGCCCGG CCACATCAGC GCCGACATGG GCGAAACCAA GGAAGCCAGC GGCCAGTTCC TGGCGGTGGG CTGCAAGTTC TCCAAGGACC GCTTCCTGCC GGTGGGTCCG ATGCACCCGG AAAACGAGCA GCTGATCGAC ATCCGTGGCG AGAAGATGGT GCTGCTGGCC GACCACCCGG TGCATCCGGA GCCGCACGAC TTCATCATCG TCAAGCGCGA ACTCATCAAG ACCCGTCAGG TCGGCGTGCT GGACGCACAC CCGCTGGCGA TCAAGGATGC AAAGGAATCG GGCGTGTTCC GCGACGGCAA CAAGGTCACG GTGAGGATCG CCTCGCAGGC ACCGGCCTAT AGCCTGCGCG AGTTCGAACT CAAGGTGGGC GACGAGGTCA CCCTCATCCT CACCAACCTC GACAAGGTGG AAGACCTGTC GCACGGCTTC GCGATTCCGA AGTACGACAT CAACTTCGTG GTGAACCCGC TGGAGACGAA GTCCGTCACC TTCAAGGCCG ACAAGCCGGG CGTGTTCTGG TGCTATTGCA CGCACTTCTG CCACGCGCTG CACCTGGAGA TGCGTACGCG CATGCTCGTG CGTCCGTAA
|
Protein sequence | MNDAKMNPVE AMPDPSRRKF LNTAALAGLA GAGMSVGLSA CNKETAPAPA AAPAAAPAAA PAAAHAASGL KLHMAPGELD TYYGIWSGGH SGECRVLGLP SGREIKRVPT FNIDCMSGWG ITNESKKVIG TRADGQLKYK TGDTHHIHGS YTDGTYDGKY FWVNDKIHGR LGRIRGDVFE CDAITEIPNI QGFHGIFSDK RDPVDKAINH TTRVFCGNEF HIPQPNDGRD LDDPTKYFCL FTCVDAETME VRWQCKVDGN MDLVATSYDG RFAAANQYNT EGGARYEDMM SAEMDACVFF DVARIEKAIK DGKSFTVGTS KVPVVDGTKA ANPDPKTALT AYVPVPKNPH GVNASPDGKY MICSGKLSPT CTVIEVAKVA DFLDGKLDDI RKSVVAEVEV GLGPLHTTYD GRGNAFTTLF LDSQIVKWNI DAAIKFHNGD KAAQYVVDRI DVHYQPGHIS ADMGETKEAS GQFLAVGCKF SKDRFLPVGP MHPENEQLID IRGEKMVLLA DHPVHPEPHD FIIVKRELIK TRQVGVLDAH PLAIKDAKES GVFRDGNKVT VRIASQAPAY SLREFELKVG DEVTLILTNL DKVEDLSHGF AIPKYDINFV VNPLETKSVT FKADKPGVFW CYCTHFCHAL HLEMRTRMLV RP
|
| |