Gene CPF_1787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1787 
Symbol 
ID4203364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2014894 
End bp2015940 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content32% 
IMG OID638082659 
Productanaerobic sulfite reductase, C subunit 
Protein accessionYP_696223 
Protein GI110800509 
COG category[C] Energy production and conversion 
COG ID[COG2221] Dissimilatory sulfite reductase (desulfoviridin), alpha and beta subunits 
TIGRFAM ID[TIGR02912] sulfite reductase, subunit C 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCATG ATATAGATAT TAAAAAAGTT AGATTAAATT GTTTTCGTCA ATCAAAGGTT 
CCAGGAGAGT TTATGCTTCA AATGCGTATT CCAGGAGGAA TAGTAGATGC TAAGTATTTA
TCACAAATAC AAGAAATAGC AGAAACTTGG GGAAATGGAA CATTTCATAT GGGGATGAGA
CAAACCTTTA ATATTCCAGG AATTAAATAT GAAAATATTC CAGCAGTAAA TAAATTTATA
GAGAATTATT TACAAGAAGT TGAAGTTGAT AGATGTAATT GTGATATGAA AGTTGATGAA
AATGGATATC CAACAATAGG TGCTAGAAAT GTTATGGCAT GTATCGGAAA TTCACATTGT
ATAAAAGCAA ATGTTGATAC TAAGGATATG GCTAATAAAA TAGAAAAATT AGTATTCCCA
TCACACTATC ATATAAAAGT ATCTGTAGCT GGATGTCCAA ATGACTGTGC TAAAGGGCAT
TTCCAAGACT TTGGTGTTAT AGGACAAGCT AGAATGGAAT ATCACGAAGA AAGATGTATA
GGTTGTGGAG CTTGTGTAAG AGCTTGTGAA CATCATGCTA CAAGAGTTTT AAGTTTAAAT
GATAAAGGAT TAGTTGATAA GGATCCATGT TGTTGCGTTG GATGTGGAGA ATGTGTATTA
GCATGTCCAG CAAGTGCTTG GACTAGAAAG CCAGAAAAAT ACTATAGAAT AGTTATAGGA
GGAAGAACAG GAAAACAAAC TCCTAGAATG GGTAAAACAT TTATAAACTT TGCAACAGAA
GAAGTTGTTC TTGGTATCTT TGCTAACTGG CAAAAATTCT CTGCTTGGGC TTTAGATTAT
AAACCAGAAT ATCTACATGG TGGTCACTTA ATTGATAGAG CGGGATATCA TAAATTTAAA
GAAATAATTT TAGATGGAGT AGAGTTAAAT CCAGAAGCTT TAGTTGCAGA TAATATATTC
TGGGCTGAAA CAGAGTATAG ATCAAACTTT AATGTTAAGC CAATAAAGAT GCATAAAACT
ATAGAATCTA ATAGACCTTT AAGATAA
 
Protein sequence
MNHDIDIKKV RLNCFRQSKV PGEFMLQMRI PGGIVDAKYL SQIQEIAETW GNGTFHMGMR 
QTFNIPGIKY ENIPAVNKFI ENYLQEVEVD RCNCDMKVDE NGYPTIGARN VMACIGNSHC
IKANVDTKDM ANKIEKLVFP SHYHIKVSVA GCPNDCAKGH FQDFGVIGQA RMEYHEERCI
GCGACVRACE HHATRVLSLN DKGLVDKDPC CCVGCGECVL ACPASAWTRK PEKYYRIVIG
GRTGKQTPRM GKTFINFATE EVVLGIFANW QKFSAWALDY KPEYLHGGHL IDRAGYHKFK
EIILDGVELN PEALVADNIF WAETEYRSNF NVKPIKMHKT IESNRPLR