Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2822 |
Symbol | |
ID | 7873230 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3050404 |
End bp | 3052161 |
Gene Length | 1758 bp |
Protein Length | 585 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 643699743 |
Product | Respiratory-chain NADH dehydrogenase domain 51 kDa subunit |
Protein accession | YP_002889798 |
Protein GI | 237653484 |
COG category | [C] Energy production and conversion |
COG ID | [COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.112551 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTTCTC CCTCCCCCTC CCCCTCCCCC TCCGCCTCCC AAGCCGCAGG CCGGCACACC CGCCCCGGCC TGCGCGGACG CCAGACCGAT CCCGCCGCGC TCGCCGAGAT CGAGGCCCTG CTCGGCGCTG CCCACCGCGA GCGCGACGAG CTGATCGAGC ACCTGCATGC GCTGCAGGAC CGCTTCGGCC ACCTCTCGCT GCGCCACCTG CGCGCGCTCG CCGACTGGAT GCGGATGCCG ATGGCCGAGG TGTACGAGAC CGCCACCTTC TACGCCCACT TCGACGTCGT GCGCGAGGAC GAGCCGGTGC CACCCGCGCT TACCGTGCGC GTGTGCGACT CCCTGCCCTG CCAGCTCGCC GGCGCGCAGG CGCTGCGCGC GGCGCTCGAC GCCGCGCTCG ATCCGGCGCG CATCCGCGTG CTGCGCGCGC CCTGCATGGG ACGCTGCGAC CAGGCCCCGG TCGCCCAGCT CGGCCGCCGT CACCTGAGCC GCGCGACCCC GGCCGCGGTC CTGGCCGCAC TCGCCCGCGG AGCGCTCGAC CCCGAACCGA TCGCCTGGCA GCGCCTCGCC GACTATCGCG CCGCAGGCGG CTACACCCTG CTCCAGCGCC TGCGCAGCGG CGAGACCGGC GTTGCCGCGC TCGAGGCCCG GCTCGCCGAA GCCGGCCTGC GCGGCCTCGG CGGTGCCGGC TTCCCGACCG CGCGCAAGTG GCAGGCGGTG CGCGCCGGCG CCGCACCGCG CTACCTGGTC GTCAATGCCG ACGAAGGCGA GCCGGGCACC TTCAAGGACC GCCACTACCT GGAAACGGCG CCGCACCGGG TGCTCGAGGG CGCGCTGGCG AGCGCGCTCG CGGTGGGCGC GGCGGCGATC TACGTCTACC TGCGCGACGA ATACCCCGGC CTGCACGCGG TGCTGCGCGA GGCGATCGCC GAACTGGAGG CCGCCGGCCT CGTCGCACCC GGCTTCATCG TGTTGCGCCG CGGCGCCGGC GCCTACATCT GCGGCGAGGA GTCGGCGCTG ATCGAGTCGC TCGAAGGCAA GCCGGGCAAG CCGCGCCATC GCCCGCCCTT CGTCGCCGAG GCCGGGCTCT TCGGCCGGCC GACGCTGGTG AACAACGTCG AGACCCTGTA CTGGATCCCG CTGCTCGCCG CCGGCGCCGC CTTCGCCGGC GAGGGGCGGC GCGGGCGCAG CGGCCTGCGC AGCTTCTCGG TGTCTGGCCG GGTGAACAAG CCGGGCGTGC ACCTGGCCCC CGCCGGCATC ACCCTGCGCG AGCTGGTGGA CGAGCACTGC GGCGGCCTGC AGCCTGGCCA CCGCCTGCTC GCCTACCTGC CCGGCGGCGC CTCGGGCGGC ATCCTGCCCG CCGCGCTCGC CGATCTGCCG CTCGACTTCG ACACCTTGCA GCCCCACGGC AGCTTCATCG GCTCGGCGGC GATCATCGTG CTCTCGGACC AGGACGATCT GCGCGCCGTC GCCGACAACC TGCTCGCCTT CTTCGCCGAC GAATCCTGCG GCCAATGCAC GCCCTGCCGC CTCGGCACCG AAAAGCTGCT CACGCTGCTG CGCACGGACG ACTGGGACGT GGCGCGGCTG CAGGCCCTCG CGCAGACCCT GCGCGACGCC TCGATCTGCG GCCTCGGCCA GGCCGCGCCC AATCCGGTGA GCAGCCTGCT GCGCTTCTTC CCGGCCGAGC TCGCGCGCGC CGGGGTGAGG CTGCATGCCG GGCCTCCTGC CCGCGATGCC GGGGAGCTGC AGCCATGA
|
Protein sequence | MPSPSPSPSP SASQAAGRHT RPGLRGRQTD PAALAEIEAL LGAAHRERDE LIEHLHALQD RFGHLSLRHL RALADWMRMP MAEVYETATF YAHFDVVRED EPVPPALTVR VCDSLPCQLA GAQALRAALD AALDPARIRV LRAPCMGRCD QAPVAQLGRR HLSRATPAAV LAALARGALD PEPIAWQRLA DYRAAGGYTL LQRLRSGETG VAALEARLAE AGLRGLGGAG FPTARKWQAV RAGAAPRYLV VNADEGEPGT FKDRHYLETA PHRVLEGALA SALAVGAAAI YVYLRDEYPG LHAVLREAIA ELEAAGLVAP GFIVLRRGAG AYICGEESAL IESLEGKPGK PRHRPPFVAE AGLFGRPTLV NNVETLYWIP LLAAGAAFAG EGRRGRSGLR SFSVSGRVNK PGVHLAPAGI TLRELVDEHC GGLQPGHRLL AYLPGGASGG ILPAALADLP LDFDTLQPHG SFIGSAAIIV LSDQDDLRAV ADNLLAFFAD ESCGQCTPCR LGTEKLLTLL RTDDWDVARL QALAQTLRDA SICGLGQAAP NPVSSLLRFF PAELARAGVR LHAGPPARDA GELQP
|
| |