Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4535 |
Symbol | nrfD |
ID | 6144499 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4636534 |
End bp | 4637490 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641619351 |
Product | nrfD protein |
Protein accession | YP_001746463 |
Protein GI | 170682297 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3301] Formate-dependent nitrite reductase, membrane component |
TIGRFAM ID | [TIGR03148] cytochrome c nitrite reductase, NrfD subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCAGA CTTCCGCATT TCATTTTGAA TCACTGGTGT GGGACTGGCC GATTGCCATC TACCTGTTTT TGATTGGTAT TTCTGCCGGT CTGGTGACGC TGGCCGTGCT GTTACGTCGC TTCTACCCGC AGGCGGGCGG TGCAGACAGT ACGTTGCTGC GCACCACGCT GATTGTCGGA CCTGGCGCGG TGATCCTCGG TCTGTTGATC CTCGTCTTCC ACCTGACAAG ACCGTGGACC TTCTGGAAGC TGATGTTCCA CTACAGTTTT ACCTCGGTGA TGTCGATGGG GGTGATGCTG TTTCAGCTTT ACATGGTGGT GCTGGTGCTG TGGCTGGCGA AAATCTTTGA ACATGATTTG CTTGCCCTGC AACAACGCTG GTTGCCGAAG CTGGGGATCG TGCAAAAGGT TCTGAGCCTG CTGACGCCCG TTCATCGCGG ACTGGAAACA TTGATGCTGG TGCTGGCGGT GCTGCTGGGG GCTTATACCG GCTTTCTGCT ATCGGCGCTG AAATCGTATC CGTTCCTCAA TAACCCGATC CTGCCGGTGC TGTTCCTCTT CTCCGGCATC TCGTCCGGTG CGGCGGTGGC GCTGATCGCC ATGGCGATAC GCCAACGCAG TAACCCGCAT TCCACGGAAG CGCACTTTGT ACACCGTATG GAGATCCCTG TGGTATGGGG CGAAATCTTC CTGCTGGTGG CGTTTTTTGT CGGTCTGGCG CTGGGCGATG ACGGTAAAGT GCGTGCGCTG GTGGCGGCAT TGGGCGGCGG TTTCTGGACG TGGTGGTTCT GGCTTGGTGT CGCCGGGCTG GGGTTGATTG TGCCAATGTT GCTCAAACCG TGGGTGAATC GTAGTTCCGG CATTCCTGCC GTGCTGGCGG CGTGTGGGGC CAGCCTGGTC GGCGTGTTGA TGCTGCGCTT TTTCATTCTC TACGCCGGGC AGTTAACGGT GGCGTAA
|
Protein sequence | MTQTSAFHFE SLVWDWPIAI YLFLIGISAG LVTLAVLLRR FYPQAGGADS TLLRTTLIVG PGAVILGLLI LVFHLTRPWT FWKLMFHYSF TSVMSMGVML FQLYMVVLVL WLAKIFEHDL LALQQRWLPK LGIVQKVLSL LTPVHRGLET LMLVLAVLLG AYTGFLLSAL KSYPFLNNPI LPVLFLFSGI SSGAAVALIA MAIRQRSNPH STEAHFVHRM EIPVVWGEIF LLVAFFVGLA LGDDGKVRAL VAALGGGFWT WWFWLGVAGL GLIVPMLLKP WVNRSSGIPA VLAACGASLV GVLMLRFFIL YAGQLTVA
|
| |