Gene CPR_1618 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1618 
Symbol 
ID4204381 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1808961 
End bp1810247 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content26% 
IMG OID642566169 
Producthypothetical protein 
Protein accessionYP_698934 
Protein GI110801765 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0613735 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAATT TTATAAAGTT TATAAGTTTA ATTTTAATTA GTGCATTACT AGGAAGTATA 
ATATGTTTTT TTACTAATAA AGATGGTAAA TATTTAGAAA GTGAAAGTTT TAGTGAAGAG
TTAATACTAA AGGGATTAAA GGGAGCAAGA GCCTTATGTA ATGATGAAGA GAATATTTAT
ATAGCATTAG AAAATAAGAT TTTGAAAATA GATAGAAATA ATAATGTGTT TTTGGAATTA
AAAGAAGAGG GAAATATATA TGATTTAGAA TATTATAATA AATTTTTATA TTACACTTTA
GATGAAAAGT TAGTATCTTA TAACATAAAA AGTTCAGAAA GAGAAGTTTT AATTGAGGAT
ATTCCTAATA AGGGGATAAA TAAAGAGGAT ACTAGGATTT TAATAAATGA TGGAAAGCTT
TATTTAACCA TAGGAACTTC TACAAATTCT GGAATAGTAG ATAAAGAGGG AGAAAATCCT
GATATTCCTC CAGTAGATAT TGTTTTAAGT GGAAGAAATT ATGATGAAAA TAAAAAGGGA
GCTTTTGTGC CATATAATAC TAAGACAGTA AAGGGAGAAA AGGTAAAAGG AAATATATTA
GGTAATGGAG CTATTATAGA ATTTGATATA GAGAGCAAGA AAAAGCAATT ATATTCCTAT
GGAATAAGAA ATGTTAAAGG TTTTGATTTA AATAGTTCAG GAGAGATTTT TGCAGTTGTT
GGAGGAATGG AAGATGAGGG GGTTAGACCT TTAAGTGGAG ATTCAGATTA TATATATAAA
ATAGAAGGAA AGGGAACTTG GTATGGTTGG CCAGATTATA GTGGAGGAGA TCCTGTTAAT
TCTCCAAGAT TTAGAGAAGA AGGAAAACCT ATAATTAACT TTGTAACAGA TGCACATAAA
AGCTATGTAA TGCCTAAACC ACTATACCAA AGTGAAGACA CTAGAAATAT AAATACATTA
TTAATAGATA AAAAAGGAAT AATTCTAGAA GATGAGAATT CATTTTTATT CTTTAACAAC
AAAAACAATA CTCTTTTAAA ATTATTAAAG GAGGGAGAAG TTAAAGAGTT AATATATTTA
GATAAAAATT CATATATAAA TGATATGAAA ATAATAGGGA AGAATCTTTA TATACTAGAT
GGAAATAAGG GGGTACTTTT TAGACTAGAA AAAAGTAACA CTATAAATAA CATACCTATT
TATAATTACT TTGTTATATT AGGAATAAAC TTTATATTAA TTGGAGTTTT AGGTATTAAG
TTCCTACTAT CCTTAAAGAA AAAATAA
 
Protein sequence
MKNFIKFISL ILISALLGSI ICFFTNKDGK YLESESFSEE LILKGLKGAR ALCNDEENIY 
IALENKILKI DRNNNVFLEL KEEGNIYDLE YYNKFLYYTL DEKLVSYNIK SSEREVLIED
IPNKGINKED TRILINDGKL YLTIGTSTNS GIVDKEGENP DIPPVDIVLS GRNYDENKKG
AFVPYNTKTV KGEKVKGNIL GNGAIIEFDI ESKKKQLYSY GIRNVKGFDL NSSGEIFAVV
GGMEDEGVRP LSGDSDYIYK IEGKGTWYGW PDYSGGDPVN SPRFREEGKP IINFVTDAHK
SYVMPKPLYQ SEDTRNINTL LIDKKGIILE DENSFLFFNN KNNTLLKLLK EGEVKELIYL
DKNSYINDMK IIGKNLYILD GNKGVLFRLE KSNTINNIPI YNYFVILGIN FILIGVLGIK
FLLSLKKK