Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2618 |
Symbol | |
ID | 7873359 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2822452 |
End bp | 2824134 |
Gene Length | 1683 bp |
Protein Length | 560 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643699541 |
Product | Nitrite reductase (NO-forming) |
Protein accession | YP_002889597 |
Protein GI | 237653283 |
COG category | [C] Energy production and conversion |
COG ID | [COG2010] Cytochrome c, mono- and diheme variants |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0412143 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAAAGA AAGGTTTGAT CGGATCGGCG TTTGCCATGG GGGTGCTGTC CCTCGCGATG AGCAGCGCCT GGGCCCAGGA AGCGCCCAAG CTGACCGCCG AGGAGATGGA AAAAGGCAAG CAGATCTACT TCGAGCGCTG CGCAGGTTGC CACGGCGTGC TGCGCAAGGG CGCCACCGGC AAGAACCTCG AGCCGCACTG GACCAAGACC GCTCCGGACG GCACCAAGCT GGAGGGCGGC ACCCTCAAGC TCGGCACCGA GCGCCTCGAG AAGATCATCT CGTACGGCAC CGAAGGCGGC ATGGTCAACT ACGATGACAT CCTGACCAAG GAAGAGATCA ACCTGATGGC GCGTTACATC CAGAACACGC CTCCGATCCC GCCCGAGTTC TCGCTCAAGG ACATGAAGGA CAGCTGGAAG CTGCTGGTCC CCGTCAAGGA CCGTCCGACC AAGCAGATGA ACAAGGTCAA CCTGAAGAAC GTGTTCGCGA TCACGCTGCG CGACGCGGGC AAGCTCGCGC TGGTCGACGG CGACACCCAC AAGATCTGGA AGATCCTCGA TACCGGCTAC GCGGTGCACA TCTCCCGCCT GTCGGCTTCG GGCCGCTACG TCTACACCGT CGGTCGTGAC GGCCTGACCA CCATCATCGA CATGTTCTAC GAAGAGCCCA CCACGGTCGC CACCGTGCGC CTGGGTTCGG ATGCCCGTTC GGTCGATACC TCCAAGTTCA AGGGCTACGA GGACAAGTAC CTGATCGGTG GCACCTACTG GCCGCCCCAG TACTCGATCA TGGACGGCGA GACGCTCGAG CCGATGAAGA TCGTGTCGAC CCGTGGCAAC ACCGTCGATG GCGACTATCA CCCCGAGCCG CGCGTGGCCT CCATCGTGGC CTCGATGACG AAGCCGGAGT GGGTGGTCAA CGTCAAGGAA ACCGGCCAGA TCCTGCTGGT CGATTACTCC GACATCAAGA ACCTGAAGAG CATCGCGATC GAGTCCGCCA AGTTCCTGCA CGATGGCGGC TGGGACATGT CGAAGCGCTA TTTCATGGTC GCCGCCAACG CGTCCAACAA GGTCGCCGCG GTCGACACCA AGGAAGGCAA GCTGGCTGCG CTGGTCGATA CCGCCAAGAT CCCGCACCCG GGGCGTGGCG CCAACTTCGT CCATCCGAAG TTCGGTCCGG TGTGGTCGAC GGGTCACCTC GGCGATGCCG TGGTCTCGCT GATCTCGACC GCTTCCGACG ATCCCAAGTT CGCCAAGTAC AAGGAGCACA ACTGGAAGGT CGTCGAACAG CTGAAGATGC CGGGCGCCGG CAACCTGTTC GTCAAGACGC ACCCGAAGTC GAAGCACTTC TGGGCCGATG CGCCGATGAA CCCGGAGCGT GAAGTGGCCG AGTCGGTGTA CGTGTTCGAC ATGAACGATC TGTCGAAGGC GCCGGTCGCG CTCAACGTCG CCAAGGACTC GGGCCTGCCC GAGAGCAAGG CCATCCGCCG CGCCGTGCAG CCGGAGTACA ACGAGGCTGG CGACGAAGTG TGGATCTCGC TGTGGGGCGG CAAGACCGAC CAGTCCGCGA TCGTGATCTA CGACGACAAG ACCCTGAAGG TCAAGAAGGT GATCACGGAC CCGGCGATCG TCACGCCGAC CGGCAAGTTC AACGTGTACA ACACGATGCA CGACGTTTAT TGA
|
Protein sequence | MQKKGLIGSA FAMGVLSLAM SSAWAQEAPK LTAEEMEKGK QIYFERCAGC HGVLRKGATG KNLEPHWTKT APDGTKLEGG TLKLGTERLE KIISYGTEGG MVNYDDILTK EEINLMARYI QNTPPIPPEF SLKDMKDSWK LLVPVKDRPT KQMNKVNLKN VFAITLRDAG KLALVDGDTH KIWKILDTGY AVHISRLSAS GRYVYTVGRD GLTTIIDMFY EEPTTVATVR LGSDARSVDT SKFKGYEDKY LIGGTYWPPQ YSIMDGETLE PMKIVSTRGN TVDGDYHPEP RVASIVASMT KPEWVVNVKE TGQILLVDYS DIKNLKSIAI ESAKFLHDGG WDMSKRYFMV AANASNKVAA VDTKEGKLAA LVDTAKIPHP GRGANFVHPK FGPVWSTGHL GDAVVSLIST ASDDPKFAKY KEHNWKVVEQ LKMPGAGNLF VKTHPKSKHF WADAPMNPER EVAESVYVFD MNDLSKAPVA LNVAKDSGLP ESKAIRRAVQ PEYNEAGDEV WISLWGGKTD QSAIVIYDDK TLKVKKVITD PAIVTPTGKF NVYNTMHDVY
|
| |