Gene CPR_1104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1104 
Symbol 
ID4204881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1248372 
End bp1249775 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content33% 
IMG OID642565660 
Producthypothetical protein 
Protein accessionYP_698426 
Protein GI110801930 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.229225 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTACAT CTGTAATGCT TACAGGATGT GGATCAAGTG ACAAAAAAGC AGAGGGAGGA 
GCAGCAAGTG AGGAACCAGT TAACTTAGTA TGGTATGTTA TAGGTAAACC TCAGACTGAT
GGAGAATTAG TAGAAGAAGA AGTAAATAAA TATATAAAAG ATAAAATAAA TGCTACTGTA
GACATAAAAC ATATTGACTT TGGTGATTAC AGCCAAAAAA TGAATGTAAT AGCTAACTCA
GGAGAAGAAT ATGATTTAGC ATTTACATGT TCATGGGCTT TCCCATACTT AGAAAATGCT
AGAAAAGGTG CTTTCCTTGA ATTAAATGAT TTATTAGATA AAGAGGGAGC AGACCTTAAA
GGGGTTATAG ATGAAAGACT TTGGAAAGGT GCTGAGGTTG ATGGAAAAAT ATATGGAGTT
CCAAACCAAA AAGAAATAGC AGGAGCACCT ATGTGGGTAT TTGATAAGGA GCTTGTTGAA
AAATATGATA TTCCATATCA AGATTTACAC TCAGTAGAAG ATTTAGAACC ATGGTTACAA
ATCATAAAAG AAAAGGAACC AGATTTTGTA CCATTCTATA CTCAAGGGGA TTCAATTCCA
TTAGAATTTG ATGAAATAAT GAGACCTTTA GGAGTATTCT TTAATGATGA TACTTTAACA
GTACAAAATA TGTATGAGAC AGAAGAAATG AAGGCTATGA TGACTAAATT AAGAGAATAC
TATGAAAAAG GATATATAAA TCAAGATGCA GCAGTTAATA ATATGAAAAA TGAAGTTAAG
AGATTTATGT GGAAAGCTGA TGGACAACCA TATGCAGAAA ATGGATGGGG ACAAGCTTTA
GGTAGAGAAG TTGTAACATC ATCAATAATC CCTCCATATG TTACAAATAA TTCAACAACT
GGAGCTATGA CTGCTATATC AGCAACATCT AAGCATCCTG AAAAAGCTAT GGAGCTTATA
AACTTAGTAA ATACTGACTC TACATTAAGA AACCTATTAA TGTTTGGAAT AGAGGGAACT
CACTATGAAA AGGTTAGTGA CAATCAAATA AAGAGAGATC CAAATGGACC ATATAGTGTT
ACAAGTTGGG CTTACGGAAA CTTATTTGAT ACTTACGTTT TAGATAGTGA TCCAGCAGAT
AAATGGGATG CTTTTGAAGA ATTTAACCAA GGTGCTAAGA CTTCACCAAT CTTAGGATTT
AAGTTCAATA CAGAGCCAGT TACAACTCAA ATATCAGCAA TTAATAACGT ATTACAAGAG
TTTGAAAGAA CTTTATACTC AGGTTCAGTA GATCCAGTAA AAGGATTAGA TGACTTAAAT
AAAAAGTTAG CTGCATCTGG ATTAGATGAC ATAAAAGCTG AAATGCAAAA ACAATTAGAT
GAATGGAAAG CTTCTAATAA ATAA
 
Protein sequence
MTTSVMLTGC GSSDKKAEGG AASEEPVNLV WYVIGKPQTD GELVEEEVNK YIKDKINATV 
DIKHIDFGDY SQKMNVIANS GEEYDLAFTC SWAFPYLENA RKGAFLELND LLDKEGADLK
GVIDERLWKG AEVDGKIYGV PNQKEIAGAP MWVFDKELVE KYDIPYQDLH SVEDLEPWLQ
IIKEKEPDFV PFYTQGDSIP LEFDEIMRPL GVFFNDDTLT VQNMYETEEM KAMMTKLREY
YEKGYINQDA AVNNMKNEVK RFMWKADGQP YAENGWGQAL GREVVTSSII PPYVTNNSTT
GAMTAISATS KHPEKAMELI NLVNTDSTLR NLLMFGIEGT HYEKVSDNQI KRDPNGPYSV
TSWAYGNLFD TYVLDSDPAD KWDAFEEFNQ GAKTSPILGF KFNTEPVTTQ ISAINNVLQE
FERTLYSGSV DPVKGLDDLN KKLAASGLDD IKAEMQKQLD EWKASNK