Gene EcE24377A_4626 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_4626 
SymbolnrfA 
ID5586873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp4626846 
End bp4628282 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content53% 
IMG OID640928242 
Productcytochrome c552 
Protein accessionYP_001465574 
Protein GI157157304 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3303] Formate-dependent nitrite reductase, periplasmic cytochrome c552 subunit 
TIGRFAM ID[TIGR03152] formate-dependent cytochrome c nitrite reductase, c552 subunit 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAGGA TAAAAATAAA CGCACGCCGT ATCTTCAGCT TATTGATTCC TTTTTTCTTT 
TTCACTTCTG TTCACGCTGA ACAAACGGCA GCTCCCGCAA AACCTGTAAC TGTGGAAGCG
AAGAATGAAA CCTTTGCCCC GCAGCATCCC GATCAATATC TCTCCTGGAA AGCCACCTCG
GAACAGTCAG AGCGTGTTGA CGCCCTGGCG GAAGATCCAC GGCTGGTGAT CCTGTGGGCG
GGGTATCCCT TCTCGCGCGA TTACAACAAG CCGCGTGGAC ATGCCTTTGC TGTGACCGAT
GTGCGTGAAA CCCTGCGTAC CGGTGCGCCG AAAAACGCTG AAGATGGTCC GCTACCGATG
GCGTGCTGGA GTTGTAAAAG CCCGGATGTG GCGCGTCTGA TCCAGAAAGA CGGCGAAGAT
GGCTACTTCC ACGGTAAGTG GGCGCGCGGC GGCCCGGAAA TCGTCAACAA CTTAGGTTGT
GCCGACTGCC ATAACACCGC CTCACCAGAG TTCGCCAAAG GCAAACCGGA GTTAACCCTT
TCCCGTCCGT ATGCGGCTCG CGCGATGGAA GCCATTGGTA AACCTTTTGA GAAAGCCGGA
CGTTTCGACC AGCAATCGAT GGTTTGCGGT CAGTGCCATG TGGAGTATTA CTTCGACGGC
AAAAACAAAG CGGTTAAATT CCCGTGGGAT GACGGCATGA AAGTCGAAAA TATGGAGCAG
TATTACGACA AAATTGCCTT CTCTGACTGG ACTAACTCCC TGTCGAAAAC GCCAATGCTG
AAAGCGCAGC ACCCGGAATA TGAAACCTGG ACAGCGGGCA TTCACGGTAA AAACAACGTG
ACCTGTATCG ACTGCCATAT GCCAAAAGTG CAGAACGCCG AAGGCAAACT CTACACCGAC
CATAAAATTG GTAATCCGTT TGATAACTTC GCCCAGACTT GTGCGAACTG CCATACCCAG
GACAAAGCTG CCTTGCAAAA AGTGGTCGCG GAACGTAAGC AGTCGATTAA CGACCTGAAA
ATCAAGGTTG AAGATCAACT GGTTCACGCT CACTTCGAAG CGAAAGCAGC GCTGGATGCA
GGCGCGACGG AAGCCGAAAT GAAGCCAATT CAGGACGATA TCCGTCATGC CCAGTGGCGC
TGGGATCTGG CGATCGCTTC CCACGGCATT CATATGCACG CACCGGAAGA AGGTTTACGG
ATGCTCGGTA CGGCGATGGA TAAAGCGGCG GATGCACGCA CCAAACTGGC GCGCCTGCTG
GCGACCAAAG GCATCACCCA TGAAATCCAG ATCCCGGATA TCTCAACCAA AGAGAAAGCC
CAGCAGGCTA TTGGCCTGAA CATGGAACAA ATCAAGGCCG AGAAGCAGGA CTTCATCAAA
ACGGTGATCC CGCAGTGGGA AGAACAGGCA CGTAAAAACG GTCTGTTAAG CCAATAA
 
Protein sequence
MTRIKINARR IFSLLIPFFF FTSVHAEQTA APAKPVTVEA KNETFAPQHP DQYLSWKATS 
EQSERVDALA EDPRLVILWA GYPFSRDYNK PRGHAFAVTD VRETLRTGAP KNAEDGPLPM
ACWSCKSPDV ARLIQKDGED GYFHGKWARG GPEIVNNLGC ADCHNTASPE FAKGKPELTL
SRPYAARAME AIGKPFEKAG RFDQQSMVCG QCHVEYYFDG KNKAVKFPWD DGMKVENMEQ
YYDKIAFSDW TNSLSKTPML KAQHPEYETW TAGIHGKNNV TCIDCHMPKV QNAEGKLYTD
HKIGNPFDNF AQTCANCHTQ DKAALQKVVA ERKQSINDLK IKVEDQLVHA HFEAKAALDA
GATEAEMKPI QDDIRHAQWR WDLAIASHGI HMHAPEEGLR MLGTAMDKAA DARTKLARLL
ATKGITHEIQ IPDISTKEKA QQAIGLNMEQ IKAEKQDFIK TVIPQWEEQA RKNGLLSQ