Gene CPR_2338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2338 
Symbol 
ID4204429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2564446 
End bp2565672 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content33% 
IMG OID642566888 
Productmaltose-binding periplasmic protein precursor 
Protein accessionYP_699603 
Protein GI110802374 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAAAC GTACAAAAAT TCTAGCAACT GTAATGGCTG CAACAATGTT ATTTGCAGGA 
TCATTAGTAG GATGTGGAGA GAAAGCTGAT TCAGGTAGTA CTGATGGTTC AGGAAAAGAA
CTTACAGTAT GGTCACATTT AACTACTCCA GAAGTAGAAG AATTAAATAA AATAGCTTCA
AAATGGGGAG AAGAAAATGG AGTTAAAGTT AAGGTTGTAG AAGACAAATC AGAAATGCAA
GCTTACATAC AAGCTGCAAA TAGTTCAAAG GGACCAGATA TAATGTTTGG ACTAGCTCAT
GATAACTTAG GAACATTCCA AAAGGCTGGA CTTTTAGCAG AGGTTCCAGA AGGAATTATA
AATGATTCTG ATTATGCATC ACATCAAGTT TTAGATGCTG TAACAATAGG TGGAAAAAGA
TATGCAGTTC CAATAGCACA AGAAACATCA GCTCTTTTCT ATAATAAGGA TAAAGTTAAA
GAAGTTCCAA AAACCATGGA AGATTTAGTT AAGGTTGCAA AAGATGGAGT TGGATTTGAA
TATGATATTA ATAATTTCTA TCCTACTTAT GGATTTATAG CAGCAGATGG AGGATATGTT
TATAAAGATA ATAATGGAAC TCTTGACCCA ACTGATATCG GATTAGGAAC ACCAGGAGCT
ATAAAAGGAT ATCAATTTGT TCAAGATTTA GTTCAAAAAG ATAAATTAAT GCCAGCTGAT
ATAACTGGAG ATATAGCAAA GGGAGATTTC TTATCTAAGA AATCAGGATT TTATATTTCA
GGACCTTGGG ATATATCAGC ATTTAAAGAT GGAGGAGTAA ATTTTGGAGT AGCTCCAATG
CCAACATTAT TTGAAAAACA AGTACCAACT TTCTTAGGTG TTCAAACTGC TTTCGTAAGT
GAAAAATCTC AGAATAAAGA TTTAGCATGG AAATTAGTTA AATACTTATC AGAAAATTCA
GGTGATGTAT TGTTAAGTAA AGGTAACAGA ATTCCAGTAT TAAATAAATA CTTAGATAGT
GCAGATTTCA AAAATAATGA GTATATGAGT GCTTTCGCAG AACAAGCTAA ATTCGCTACA
CCAATGCCTA ATATACCAGA AATTCAAGCT ATGTGGGGAC CTGCTGGAGC TAACTTACAA
TTATTAACTT CAGGACAAGT TACACCAGAA AAATGTGCTG AAATGACAGT AGAACAAATT
AAACAAGGTA TATCTCAACA AAAATAA
 
Protein sequence
MGKRTKILAT VMAATMLFAG SLVGCGEKAD SGSTDGSGKE LTVWSHLTTP EVEELNKIAS 
KWGEENGVKV KVVEDKSEMQ AYIQAANSSK GPDIMFGLAH DNLGTFQKAG LLAEVPEGII
NDSDYASHQV LDAVTIGGKR YAVPIAQETS ALFYNKDKVK EVPKTMEDLV KVAKDGVGFE
YDINNFYPTY GFIAADGGYV YKDNNGTLDP TDIGLGTPGA IKGYQFVQDL VQKDKLMPAD
ITGDIAKGDF LSKKSGFYIS GPWDISAFKD GGVNFGVAPM PTLFEKQVPT FLGVQTAFVS
EKSQNKDLAW KLVKYLSENS GDVLLSKGNR IPVLNKYLDS ADFKNNEYMS AFAEQAKFAT
PMPNIPEIQA MWGPAGANLQ LLTSGQVTPE KCAEMTVEQI KQGISQQK