Gene CPF_2052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2052 
SymbolnarA 
ID4201663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2290790 
End bp2292868 
Gene Length2079 bp 
Protein Length692 aa 
Translation table11 
GC content32% 
IMG OID638082917 
Productnitrate reductase, catalytic subunit 
Protein accessionYP_696481 
Protein GI110800988 
COG category[R] General function prediction only 
COG ID[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAA TACAATCTAC TTGTAACTAT TGTGCTCTTG CGTGTAATTT AGATTTTTAC 
ACAGAGGATG GAAAAATAAA AAGAGTAGTT CCAACTCCAC ATTATCCAGT TAATAAGGGA
TTTTCTTGTA TAAAAGGACT TAATTTAGAT AAACAATGTA CTAAGTTTAA TGGTTCAAAG
AAGCCTTTAC TTAAAATGAA AGATGGAGAA AGAAAAGCTA TAGAATGGAA AGAAGCTTTT
GATTTATTTG CTAGTAAGAT GACAGCTATT CAAGAAAAGT ATGGTAAAGA AAGTGTAGCT
TACATAAGTA CAGGTCAATT ACCAACAGAG GAAATGGCTC TTTTAGGTCA TGTAGGAAGA
AGCTACATGG GTATAAATGG AGATGGTAAT ACAAGACTTT GTATGGCATC AGCAGTTGTA
GCTTATAAAC AAAGCTTTGG ATTTGACGCC CCTCCATATA CTTTAAAAGA TTTAGAACTT
TCAGATACTA TATTTTTTAT TGGAGCAAAT CCAGTTATAG CTCATCCAAT AGCTTGGGGA
AGAGTTAGAA AAAATAAGGA TGCTAAAATA ATTACTATAG ATCCAAGAAA GTCTGAGACA
GCTATGAATT CAGATATGTG GATTGATATA AAAACTAAGG GAGATTTAGC TCTTTTCTAT
ACTTTAGCAA ATGTTCTTAT AGAAAAAGGA TGGATAAACC AAGATTATAT AAATAATTAC
ACAGAGGGCT TTGAAGATTT CAAAGCACAT GTTAAGAAAT ATACATTAGA AGATGTTGAA
GAAAGAACAG GAATCTCTAA GATGAGAGTT CTAGAACTTG CAAAAATAAT CCATGAAGGA
AAGAGAGTTT CATTCTGGTG GACAATGGGA GTTAACCAAA GCTATGAAGC TGTTAGAACT
GCTCAAGCCA TTATAAATCT TGCTTTAATC ACAGGAAATA TGGGAAGAGA AGGAACAGGA
GCTAACTCCT TAACAGGACA ATGTAATGCT ATGGGATCAA GAATGTTTAG TAACACAACT
GCTCTTTATG GTGGTGGAGA ATACAATAAC AAAGAGAGAA GAAAAGTGGT TGCTGATATA
TTAGGCATGG ATGAGAATAT GCTTCCAACT AAGCCAACTT TAGATTATGA GCAAATAATA
AAAGGAATAA ATAAGGGAGA AATCAAAGGA CTATGGGTAG TTTGTACTAA CCCTAGACAT
TCATTTAGTA ACAACGAAGA GTTTAAAAAA GCTATGAAAA ACCTAGATTT CTTTGTAGTT
CAAGATATTT ATGAAGATAC AGATAGTTCT AAAGAATGTG ATTTATATTT ACCTTCAGTT
CCAGCTATTA AAAAAGAAGG TTTCTTAATA AATACTGAGC GTAGACTTTC AGCTTTAGTT
CCTGTTTTAG AGAAGGAAGA AGATGAATTA AGTGATTATG AAATATTATT AGGAATTGGA
GAAGCCTTAG GAATGGGAAG TCTTTTAGAC AAATGGAGAA CTCCAGAGGA TGCCTTTAAG
CTTCTAAGAG AATGTAGTAA AGGAATGCCT TGTGATATTA CAGGAGTATC TTATGAAAGA
TTAAGAGATT CTAAGGGAAT TCAATGGCCT TGTAGAAAAG GAGAAGAATT AGAGTCTGAT
GAAAGAAGAT TATTTGAAGA CGGAAAATAT TATACTCCAA GTGGAAAAGC TAAGTTTATT
TTTGAAGATG TAACTGAAAA TCCAAATGCT ACAAATGAAG AGTTCCCATT TAACTTAAAC
ACTGGTAGAG GAACTGTGGG ACAATGGCAT ACTCATACTA GAACTAGAGA AATACAAGCT
GTAACTAATA TAGTTTCACA AAAGGCATAT GTAGATATAA ATAGAAAAGA TGCAGAAAAG
CTTGATATAA AAGAAAATGA TGAGGTTTTA ATTCATTCAT CAAATGGGCA TACATCTAAG
TTTATAGCAA GATTAACTGA TAATCTTAAA GAAAAAACTT TATATGCACC AATACATTAT
ATAGAAACAA ATTTATTAAC ACCATCTGTA TTTGATCCTT ATTCTAAGGA GCCTTCATAT
AAAACAGTTC AAGTTAATAT AGAAAAGGTT AGAAAATAG
 
Protein sequence
MKRIQSTCNY CALACNLDFY TEDGKIKRVV PTPHYPVNKG FSCIKGLNLD KQCTKFNGSK 
KPLLKMKDGE RKAIEWKEAF DLFASKMTAI QEKYGKESVA YISTGQLPTE EMALLGHVGR
SYMGINGDGN TRLCMASAVV AYKQSFGFDA PPYTLKDLEL SDTIFFIGAN PVIAHPIAWG
RVRKNKDAKI ITIDPRKSET AMNSDMWIDI KTKGDLALFY TLANVLIEKG WINQDYINNY
TEGFEDFKAH VKKYTLEDVE ERTGISKMRV LELAKIIHEG KRVSFWWTMG VNQSYEAVRT
AQAIINLALI TGNMGREGTG ANSLTGQCNA MGSRMFSNTT ALYGGGEYNN KERRKVVADI
LGMDENMLPT KPTLDYEQII KGINKGEIKG LWVVCTNPRH SFSNNEEFKK AMKNLDFFVV
QDIYEDTDSS KECDLYLPSV PAIKKEGFLI NTERRLSALV PVLEKEEDEL SDYEILLGIG
EALGMGSLLD KWRTPEDAFK LLRECSKGMP CDITGVSYER LRDSKGIQWP CRKGEELESD
ERRLFEDGKY YTPSGKAKFI FEDVTENPNA TNEEFPFNLN TGRGTVGQWH THTRTREIQA
VTNIVSQKAY VDINRKDAEK LDIKENDEVL IHSSNGHTSK FIARLTDNLK EKTLYAPIHY
IETNLLTPSV FDPYSKEPSY KTVQVNIEKV RK