Gene CPF_2050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2050 
SymbolnoxA 
ID4203540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2289139 
End bp2290362 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content32% 
IMG OID638082915 
Productnitrate reductase, NADH oxidase subunit 
Protein accessionYP_696479 
Protein GI110800143 
COG category[C] Energy production and conversion 
COG ID[COG1251] NAD(P)H-nitrite reductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATATA TTGTTGTGGG GGCATCAGCT GCTGGAATAA GTGGAGCAAA AACACTTAGA 
GAACTTGACA AAGATGCAGA AATAATATTA GTTTCAAAGG ACGAAAATGT TTATTCAAGA
TGTATATTAC ATCATTACAT AAGTGGCCAT AGAGATATTG AGGCTTTAGA TTTTACAGAT
AGAGATTTCT TTGAAAAATA CAACATAGAG TGGAAAAAAG GCTTAGAAGT TAAAGCTATA
GATGATAGAG AACATGTAAT AGTTCTTTCA AATGGAGAAA GCTTAAAATA TGATAAGATA
TTATTAGCAA CAGGAGCATC AGCATTCATT CCTCCAGTAG AGAATTTAAG AGAAGCTAAA
AATGTTGTTG GATTAAGAAA CTTAGAGGAT GCTATTAAAA TTAAAGAAGA GGCAGAAAAG
GTTAAAAACG TGGTTGTTTT AGGAGCAGGA CTTGTTGGCA TAGACGCCAT AGCAGGACTT
GCATTTAAAG ATTTAAATGT TACTTTAGTT GAAATGGGGG ACAGAGTCCT TCCAATTCAA
CTTGATAAAT ATGCTTCTTC TAAATACGAG AAGAGATTTG AAGATGCTGG AGTTAAATTA
AAACTTGGAG TTAGAGCGGA AAAAGTTTTA ATTGATGAAA ATAAAAATCC AAAGGCATTA
CTTATAAATA CAGGAGAAGA AATTCCTTGC GAGCTTATAA TAGTTGCAAC TGGAGTTAGA
TCAAATGTAG CATTCTTAAA AGATAGCTCT ATAGAAACAG ATAGATTTGG ATTAATAATA
AATGAAAAAG GCGAAACTAA TGCAAGAGAT GTCTATGGAG CTGGAGATAT CACTGGAAGA
AATCCTATAT GGCCAACTGC TGTAAAAGAA GGTATAATAG CAGCAAACAA TATGGTTGGT
AATGAAATAT TCATGGAAGA TTTCTTTGGA AGTAAGAATA CAATGAATTT CTTAGGACTT
ACAACAATGT CTTTAGGAGT TGTTAATGCT CCAGATGATT CTTACACAGA AGAAATTGAT
ATTTCAGGAG AGAATTATAA AAAGATAATC CATAAGGATG GAAAAATTTA TGGAGCTATA
ATTCAAGGCG ATTTATCTTA TGCAGGAGTT TTAACTCAAC TTATAAAAGA GAAGATACAC
GTATCAAAGG TTAAAAAGCC ATTATTTGAA ATTGACTATG CAGATTTCTT CAATATAAAA
GAAAATTTAG AGTACACATA TTAA
 
Protein sequence
MRYIVVGASA AGISGAKTLR ELDKDAEIIL VSKDENVYSR CILHHYISGH RDIEALDFTD 
RDFFEKYNIE WKKGLEVKAI DDREHVIVLS NGESLKYDKI LLATGASAFI PPVENLREAK
NVVGLRNLED AIKIKEEAEK VKNVVVLGAG LVGIDAIAGL AFKDLNVTLV EMGDRVLPIQ
LDKYASSKYE KRFEDAGVKL KLGVRAEKVL IDENKNPKAL LINTGEEIPC ELIIVATGVR
SNVAFLKDSS IETDRFGLII NEKGETNARD VYGAGDITGR NPIWPTAVKE GIIAANNMVG
NEIFMEDFFG SKNTMNFLGL TTMSLGVVNA PDDSYTEEID ISGENYKKII HKDGKIYGAI
IQGDLSYAGV LTQLIKEKIH VSKVKKPLFE IDYADFFNIK ENLEYTY