Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4315 |
Symbol | nrfA |
ID | 5591369 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 4319530 |
End bp | 4320966 |
Gene Length | 1437 bp |
Protein Length | 478 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640923413 |
Product | cytochrome c552 |
Protein accession | YP_001460858 |
Protein GI | 157163540 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3303] Formate-dependent nitrite reductase, periplasmic cytochrome c552 subunit |
TIGRFAM ID | [TIGR03152] formate-dependent cytochrome c nitrite reductase, c552 subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 67 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAGGA TAAAAATAAA CGCACGCCGT ATCTTCAGCT TATTGATTCC CTTTTTCTTT TTCACTTCTG TTCACGCTGA ACAAACGGCA GCTCCCGCAA AAACTGTAAC TGTGGAAGCG AAGAATGAAA CCTTTGCCCC GCAGCATCCC GATCAATATC TCTCCTGGAA AGCCACCTCG GAACAGTCAG AGCGTGTTGA CGCCCTGGCG GAAGATCCAC GGCTGGTGAT CCTGTGGGCG GGGTATCCCT TCTCGCGCGA TTACAACAAG CCGCGTGGAC ATGCTTTTGC TGTGACCGAT GTGCGTGAAA CCCTGCGTAC CGGTGCGCCG AAAAACGCTG AAGATGGTCC GCTACCGATG GCGTGCTGGA GTTGTAAAAG CCCGGATGTG GCGCGTCTGA TCCAGAAAGA CGGCGAAGAT GGCTACTTCC ACGGTAAATG GGCGCGCGGC GGCCCGGAAA TCGTCAACAA CTTAGGTTGT GCCGACTGCC ATAACACCGC CTCACCAGAG TTCGCCAAAG GCAAACCGGA GTTAACCCTT TCCCGTCCGT ATGCGGCTCG CGCGATGGAA GCCATTGGTA AACCTTTTGA GAAAGCCGGA CGTTTCGACC AGCAATCGAT GGTTTGCGGT CAGTGCCATG TGGAGTATTA CTTCGACGGC AAAAACAAAG CGGTTAAATT CCCGTGGGAT GACGGCATGA AAGTCGAAAA TATGGAGCAG TATTACGACA AAATTGCCTT CTCTGACTGG ACTAACTCCC TGTCGAAAAC GCCAATGCTG AAAGCGCAGC ACCCGGAATA TGAAACCTGG ACAGCGGGCA TTCACGGTAA AAACAACGTG ACCTGTATCG ACTGCCATAT GCCAAAAGTG CAGAACGCCG AAGGCAAACT CTACACCGAC CATAAAATTG GTAATCCGTT TGATAACTTC GCCCAGACTT GTGCGAACTG CCATACCCAG GACAAAGCTG CCTTGCAAAA AGTGGTCGCG GAACGTAAGC AGTCGATTAA CGACCTGAAA ATCAAGGTTG AAGATCAACT GGTTCACGCT CACTTCGAAG CGAAAGCAGC GCTGGATGCA GGCGCGACGG AAGCCGAAAT GAAGCCAATT CAGGACGATA TCCGTCATGC CCAGTGGCGC TGGGATCTGG CGATCGCTTC CCACGGCATT CATATGCACG CACCGGAAGA AGGTTTACGG ATGCTCGGTA CGGCGATGGA TAAAGCGGCG GATGCACGCA CCAAACTGGC GCGCCTGCTG GCGACCAAAG GCATCACCCA TGAAATCGAG ATCCCGGATA TCTCGACCAA AGAGAAAGCC CAGCAGGCCA TTGGCCTGAA CATGGAACAA ATCAAGGCCG AGAAGCAGGA CTTCATCAAA ACGGTGATCC CGCAGTGGGA AGAACAGGCA CGTAAAAACG GTCTGTTAAG CCAATAA
|
Protein sequence | MTRIKINARR IFSLLIPFFF FTSVHAEQTA APAKTVTVEA KNETFAPQHP DQYLSWKATS EQSERVDALA EDPRLVILWA GYPFSRDYNK PRGHAFAVTD VRETLRTGAP KNAEDGPLPM ACWSCKSPDV ARLIQKDGED GYFHGKWARG GPEIVNNLGC ADCHNTASPE FAKGKPELTL SRPYAARAME AIGKPFEKAG RFDQQSMVCG QCHVEYYFDG KNKAVKFPWD DGMKVENMEQ YYDKIAFSDW TNSLSKTPML KAQHPEYETW TAGIHGKNNV TCIDCHMPKV QNAEGKLYTD HKIGNPFDNF AQTCANCHTQ DKAALQKVVA ERKQSINDLK IKVEDQLVHA HFEAKAALDA GATEAEMKPI QDDIRHAQWR WDLAIASHGI HMHAPEEGLR MLGTAMDKAA DARTKLARLL ATKGITHEIE IPDISTKEKA QQAIGLNMEQ IKAEKQDFIK TVIPQWEEQA RKNGLLSQ
|
| |