Gene CPR_1909 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1909 
Symbol 
ID4204852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2108296 
End bp2109648 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content31% 
IMG OID642566459 
Productradical SAM domain-containing protein 
Protein accessionYP_699219 
Protein GI110802909 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00558063 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGGTG GAGAGTATTA CGTAGTAGAC GTAAACTCAG GAAGTGTTCA TATTGTCGAT 
GACCTTATTT ACAATATGAT AGATAATGAT AACAAATTAA GTAGTAAGGA ATCTTTAATT
GAAAAGTTAA AAGATAAATA TCCAGTTGAG GAAATTGAAG AGGCTTATGA AGATTTACTT
CAATTAGTAG AAGAAGACGC TTTATACTCA GGAGATTTAT ATGAAGAGGT TGCTAAAGAA
AGTGACAAGG CACCTTCATA TATAAAAGCT CTATGCTTAA ATGTTGTACA TGACTGTAAT
TTAAGATGTA AATATTGTTT CGCTGACGAG GGAGAATACA AAGGATGTAG AAAACCAATG
AGTGCTGAAG TTGGTAAAAA AGCTATTGAT TTTGTATTAG CAAATTCAGG TGGAATAAAG
AATATAGAAG TTGATTTATT TGGTGGGGAA CCACTAATGG TATTTGATAC TATAAAAGAA
ATAATAGATT ATGGTAAAAA AAGAGGACAA GAAGTAGGCA AAAATGTAAG ATTTACTATG
ACTACTAATG CAACTCTTTT AAATGATGAA AGAATCGATT ATATAGATAA AAACATCGGA
AATATAATTC TTTCAATAGA TGGAAGAAAA GAAGTTAACG ATGCTGTTAG AATAAGAGTT
GATGGTTCAG GAAGTTATGA TAGAATACTT CCAAACATAA AGAAAATGGT TGAAAAGAGA
GATCCAAGTA AACAATATTA TGCTAGAGGA ACATTTACAA GAAATAATAC TGACTTCTTC
CAAGATGTAA TGGCTCTTGC TAATGAAGGA TTTACTGAGA TTTCAATAGA GCCAGTTGTT
CTTCCAGATG AACATGAATT ATCTTTAAGA AGAGAAGATT TACCAACTAT ATTTGGAGAG
TATGATAAAT TATACAATGA AATGGTGAGA AGATTAGAAG AGGGAGATAA CATATTCAAA
TTCTATCACT TTAACATTGA TTTAAATGGT GGTCCTTGTG TTTATAAGAG AATAGCAGGA
TGTGGAGCAG GTCATGAGTA TATTGCTGTA ACTCCAGATG GAGAAATATA TCCATGTCAC
CAATTCGTTG GAAATACTGA TTTCTTACTA GGAAATATCT ATGATGGAGA AATCAGAAAA
GATCTTTCAA CAGAATTTAA ACAAGCTCAC ATATATAATA AGCCAAAATG TAAAGAATGC
TGGGCTAGAT TCTACTGTAG TGGTGGATGT CAAGCTAATA ACTTTAACTT TAATGGCGAT
ATTCATGTTC CTTATGAAGT TGGATGTGAA ATGCAAAAGA AAAGAATTGA GTGTGCTATT
GCATTAAAGT CAAAGACTAT GGACGCTGAA TAA
 
Protein sequence
MQGGEYYVVD VNSGSVHIVD DLIYNMIDND NKLSSKESLI EKLKDKYPVE EIEEAYEDLL 
QLVEEDALYS GDLYEEVAKE SDKAPSYIKA LCLNVVHDCN LRCKYCFADE GEYKGCRKPM
SAEVGKKAID FVLANSGGIK NIEVDLFGGE PLMVFDTIKE IIDYGKKRGQ EVGKNVRFTM
TTNATLLNDE RIDYIDKNIG NIILSIDGRK EVNDAVRIRV DGSGSYDRIL PNIKKMVEKR
DPSKQYYARG TFTRNNTDFF QDVMALANEG FTEISIEPVV LPDEHELSLR REDLPTIFGE
YDKLYNEMVR RLEEGDNIFK FYHFNIDLNG GPCVYKRIAG CGAGHEYIAV TPDGEIYPCH
QFVGNTDFLL GNIYDGEIRK DLSTEFKQAH IYNKPKCKEC WARFYCSGGC QANNFNFNGD
IHVPYEVGCE MQKKRIECAI ALKSKTMDAE