Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4318 |
Symbol | nrfD |
ID | 5595072 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 4322242 |
End bp | 4323198 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640923416 |
Product | nrfD protein |
Protein accession | YP_001460861 |
Protein GI | 157163543 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3301] Formate-dependent nitrite reductase, membrane component |
TIGRFAM ID | [TIGR03148] cytochrome c nitrite reductase, NrfD subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 73 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCAGA CTTCCGCATT TCATTTTGAA TCGCTGGTGT GGGACTGGCC GATTGCCATC TACCTGTTTT TGATTGGTAT TTCTGCCGGT CTGGTGACGC TGGCCGTGCT GTTACGTCGC TTCTACCCGC AGGCGGGCGG TGCAGACAGT ACGTTGCTGC GCACCACGCT GATTGTCGGG CCGGGCGCGG TGATCCTCGG TCTGTTGATC CTCGTCTTCC ACCTGACAAG ACCGTGGACC TTCTGGAAGC TGATGTTCCA CTACAGTTTT ACCTCGGTGA TGTCGATGGG GGTGATGCTG TTTCAGCTCT ACATGGTGGT GCTGGTGCTG TGGCTGGCGA AAATCTTTGA ACATGATTTG CTTGCCCTGC AACAACGCTG GTTGCCGAAG CTGGGGATCG TGCAAAAGGT TCTGAGCCTG CTGACGCCCG TTCATCGCGG ACTGGAAACA TTGATGCTGG TGTTGGCGGT GTTGTTGGGG GCTTATACCG GCTTTCTGCT GTCGGCGCTG AAATCGTATC CGTTCCTCAA TAACCCGATC CTGCCGGTGC TGTTCCTCTT CTCCGGCATC TCGTCCGGTG CGGCGGTGGC GCTGATCGCC ATGGCGATAC GCCAACGCAG TAACCCGCAT TCCACGGAAG CGCAGTTTGT ACACCGTATG GAAATCCCCG TGGTATGGGG TGAAATCTTC CTGCTGGTGG CGTTTTTTGT CGGTCTGGCG CTGGGCGATG ACGGTAAAGT GCGTGCGCTG GTGGCGGCAT TAGGTGGCGG TTTCTGGACG TGGTGGTTCT GGCTTGGTGT CGCCGGGCTG GGGCTGATTG TGCCAATGTT GCTCAAACCG TGGGTCAATC GCAGTTCCGG CATTCCTGCC GTGCTGGCGG CGTGTGGGGC CAGTCTGGTC GGCGTGTTGA TGCTGCGCTT TTTCATTCTC TACGCCGGGC AGTTAACGGT GGCGTAA
|
Protein sequence | MTQTSAFHFE SLVWDWPIAI YLFLIGISAG LVTLAVLLRR FYPQAGGADS TLLRTTLIVG PGAVILGLLI LVFHLTRPWT FWKLMFHYSF TSVMSMGVML FQLYMVVLVL WLAKIFEHDL LALQQRWLPK LGIVQKVLSL LTPVHRGLET LMLVLAVLLG AYTGFLLSAL KSYPFLNNPI LPVLFLFSGI SSGAAVALIA MAIRQRSNPH STEAQFVHRM EIPVVWGEIF LLVAFFVGLA LGDDGKVRAL VAALGGGFWT WWFWLGVAGL GLIVPMLLKP WVNRSSGIPA VLAACGASLV GVLMLRFFIL YAGQLTVA
|
| |