Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4532 |
Symbol | nrfA |
ID | 6143161 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4633821 |
End bp | 4635257 |
Gene Length | 1437 bp |
Protein Length | 478 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641619348 |
Product | cytochrome c552 |
Protein accession | YP_001746460 |
Protein GI | 170679805 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3303] Formate-dependent nitrite reductase, periplasmic cytochrome c552 subunit |
TIGRFAM ID | [TIGR03152] formate-dependent cytochrome c nitrite reductase, c552 subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAGGA TAAAAATAAA CGCACGCCGT ATCTTCAGCT TATTGATTCC TTTTTTCTTT TTCACTTCTG TTCACGCTGA ACAAACGGCA GCTCCCGCAA AACCTGTAAC TGTGGAAGCG AAGAATGAAA CCTTTGCCCC GCAACATCCC GATCAATATC TCTCCTGGAA AGCCACCTCG GAACAGTCAG AGCGTGTTGA CGCCCTGGCG GAAGATCCAC GGCTGGTGAT CCTGTGGGCG GGGTATCCCT TCTCGCGCGA TTACAACAAG CCGCGTGGAC ATGCCTTTGC TGTGACCGAT GTGCGTGAAA CCCTGCGTAC CGGTGCGCCG AAAAACGCTG AAGATGGTCC GCTACCGATG GCATGCTGGA GTTGTAAAAG CCCGGATGTG GCGCGTCTGA TCCAGAAAGA CGGCGAAGAT GGCTACTTCC ACGGAAAATG GGCGCGCGGC GGCCCGGAAA TCGTCAACAA CTTAGGTTGT GCCGATTGCC ATAACACCGC CTCACCAGAG TTCGCCAAAG GCAAACCGGA GTTAACCCTT TCCCGTCCGT ATGCGGCTCG CGCGATGGAA GCCATTGGTA AACCTTTTGA GAAAGCCGGA CGTTTCGACC AGCAATCGAT GGTTTGCGGT CAGTGCCATG TGGAGTATTA CTTCGACGGC AAAAACAAAG CGGTTAAATT CCCGTGGGAT GACGGCATGA AAGTCGAAAA TATGGAGCAG TATTACGACA AAATTGCCTT CTCTGACTGG ACTAACTCCC TGTCGAAAAC GCCAATGCTG AAAGCGCAGC ACCCGGAATA TGAAACCTGG ACAGCGGGCA TTCACGGTAA AAACAACGTG ACCTGTATCG ACTGCCATAT GCCAAAAGTG CAGAACGCCG AAGGCAAACT CTACACCGAC CATAAAATTG GTAATCCGTT TGATAACTTC GCCCAGACTT GTGCGAACTG CCATACCCAG GACAAAGCTG CCTTGCAAAA AGTGGTCGCG GAACGTAAGC AGTCGATTAA CGACCTGAAA ATCAAGGTTG AAGATCAACT GGTTCACGCT CACTTCGAAG CGAAAGCGGC GCTGGATGCA GGCGCGACGG AAGCCGAAAT GAAGCCTATT CAGGACGATA TCCGTCATGC CCAGTGGCGT TGGGATCTGG CGATCGCTTC CCACGGCATT CATATGCACG CACCGGAAGA AGGTCTGAGG ATGCTCGGTA CGGCGATGGA TAAAGCGGCG GATGCGCGCA CTAAACTGGC GCGACTGCTG GCGACCAAAG GCATCACCCA TGAAATCCAG ATCCCGGATA TCTCGACCAA AGAGAAAGCC CAGCAGGCCA TTGGCCTGAA CATGGAACAA ATCAAGGCCG AGAAGCAGGA CTTTATCAAA ACGGTGATCC CGCAGTGGGA AGAGCAGGCA CGTAAAAACG GTCTGTTAAG CCAATAA
|
Protein sequence | MTRIKINARR IFSLLIPFFF FTSVHAEQTA APAKPVTVEA KNETFAPQHP DQYLSWKATS EQSERVDALA EDPRLVILWA GYPFSRDYNK PRGHAFAVTD VRETLRTGAP KNAEDGPLPM ACWSCKSPDV ARLIQKDGED GYFHGKWARG GPEIVNNLGC ADCHNTASPE FAKGKPELTL SRPYAARAME AIGKPFEKAG RFDQQSMVCG QCHVEYYFDG KNKAVKFPWD DGMKVENMEQ YYDKIAFSDW TNSLSKTPML KAQHPEYETW TAGIHGKNNV TCIDCHMPKV QNAEGKLYTD HKIGNPFDNF AQTCANCHTQ DKAALQKVVA ERKQSINDLK IKVEDQLVHA HFEAKAALDA GATEAEMKPI QDDIRHAQWR WDLAIASHGI HMHAPEEGLR MLGTAMDKAA DARTKLARLL ATKGITHEIQ IPDISTKEKA QQAIGLNMEQ IKAEKQDFIK TVIPQWEEQA RKNGLLSQ
|
| |