Gene CPF_0526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0526 
Symbol 
ID4203948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp622525 
End bp624075 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content31% 
IMG OID638081408 
Productnitrite/sulfite reductase-like protein 
Protein accessionYP_694980 
Protein GI110800027 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0155] Sulfite reductase, beta subunit (hemoprotein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.574671 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACAT TAAATGATAT TTTATCAAAA GAAATTGAAG AATTTAGAGA AAAAGGACAT 
AAGTTTTTAA ATAAAGAAAT GACTGTTGGT GAATTTAAAA AGGCATCAGG AGGCATGGGA
GTTTATGCGC ACCGTGGAGG AAAAGAATTC ATGGTGAGAT TAAGAGTTCC ATCAGGAGTT
GCTGGATTAA GTTTATTAGA AGAAGCATAT AACTGGGCTA AGAAATATAA CTTAGAAAAA
ATACATTTAA CAACTCGTGA AGCAATACAA TTACATGGAA TATCAATAGA TGCTGTATGC
GACATAATGC AAGAAGGGTT AACTAAAGGA ATATACACTA GAGGCGGTGG TGGAAACTTC
CCAAGAAACG TAGCCTTAAA TCCTTTATCA GGAGTTCAAA TAGGAGAAAC TTTTGATGTA
ACTCCTTATG CACTAGCATC TAATGCACAT TATATGAGTA AAATATATAC TTATAATTTA
CCAAGAAAAA TTAAAACTGC ATTTTCAAAT ACAGAGGAGG ATTCAGCTCA TTGCACAGTT
ACAGACTTAG GATTTTTAGC TGTGGACAAT AACGGTAAGA GAGCCTTTAA ACTTTACATA
GGTGGAGGAT TAGGAAGAAA TCCAAGAACA TCTGTTGAAT TTGATGAGTT AATAGATGAA
AAGGACGTAT TATATTGTCT TGAAGCTATG ACTAATTTAT TTGTAGCTGA AGGAGATTAT
AAAAATCACG GAAAAGCTCG TATAAGATAT ATTGTTGAGA GAATGGGCGA AGAAGAGTTT
AAAAATTGCT TTAGAAAACA CTTAGAAGAA GTGAAAGCTA AAGGTGGATT AGACCTTAAG
GTTGAAGATA TAAAATATAG TAAAAAAGGT GAAAAAACTA ATATAAAACA TGAAAGATTA
ATACCTCAAA AACAAGAAGG ATTATATACT GTTTTATTCC ACCCAGTTGG AGGACAATTA
AGCTTAGATG TATTAAAAGA ATTAATAGAA AAAATAGAGA AAATAGAAGA AGTAGAAGTA
AGATTAGGTA TGGATGAAAA TCTATATGTT AGAAATTTAA ATGGAAAAGA AGCAGAGGCT
ATCTTAGACT TTACTCAAGG TAAAGGAGGA GAGACTTTAG CTGAAAGAAG TGTTTCTTGT
ATAGGTACAC CAATATGTCA AATGGGAGTA TTAGAAAGCC AAAAAACTTT AAGAGAGTTT
GTAGAATTAT TAAAAGAAAA TTCAATTAAG GAAGGAACAT TACCAAGAGT TCGTTTCTCA
GGATGTCCAA ATTCTTGTGG TATGCATGAA ATAGGAGATA TAGGATTCGC TGGTAAAAAG
AAAAAAGTTT GTGATGCTTT ATCAAATGTA TTTGAATTAC ATTTTGGTGG AAAATTAGGA
ATAGGCGAAA CTCGTTTAGG AGATTACTAT GGAGATATAC TTCAAGACAA GGTTCCAGAA
TTCTTAATTA AGGTTGCAAA GGCTGTAGAA GCTAAAAATT CAACTTTTGA AGAGTGGATT
AAGGATAATA GTGAAGAATT TAAAGCTATA GTTGAAGAAT ATAATGTATA G
 
Protein sequence
MNTLNDILSK EIEEFREKGH KFLNKEMTVG EFKKASGGMG VYAHRGGKEF MVRLRVPSGV 
AGLSLLEEAY NWAKKYNLEK IHLTTREAIQ LHGISIDAVC DIMQEGLTKG IYTRGGGGNF
PRNVALNPLS GVQIGETFDV TPYALASNAH YMSKIYTYNL PRKIKTAFSN TEEDSAHCTV
TDLGFLAVDN NGKRAFKLYI GGGLGRNPRT SVEFDELIDE KDVLYCLEAM TNLFVAEGDY
KNHGKARIRY IVERMGEEEF KNCFRKHLEE VKAKGGLDLK VEDIKYSKKG EKTNIKHERL
IPQKQEGLYT VLFHPVGGQL SLDVLKELIE KIEKIEEVEV RLGMDENLYV RNLNGKEAEA
ILDFTQGKGG ETLAERSVSC IGTPICQMGV LESQKTLREF VELLKENSIK EGTLPRVRFS
GCPNSCGMHE IGDIGFAGKK KKVCDALSNV FELHFGGKLG IGETRLGDYY GDILQDKVPE
FLIKVAKAVE AKNSTFEEWI KDNSEEFKAI VEEYNV