Gene CPR_0540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0540 
Symbol 
ID4204494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp640231 
End bp641616 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content34% 
IMG OID642565097 
ProductABC transporter, substrate-binding protein 
Protein accessionYP_697868 
Protein GI110803174 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAA AATTAGTTAA AATGCTAACT GTTGCATGTG TTACAGCTAT AGCTGCCTCT 
GCATTTGTTG GATGTGGAAA TAAAGAAGAG ACACCAAAAG ATAATGAAAA ACCATCAACT
GAACAAGCTG AAGGTTCAGG AGAGAAAAAG GTTTTAGAAA TAGCTGTTTT TGAAGGTGGA
TTTGGTAAGG ATTACTGGGA AGCTTGCATA GAGGCTTTTG AAGCTGAACA TCCTGATGTA
GAAGTTAAGA TGGAAGCTAA TCCTAAAATA GGTGATATTA TAAGGCCTAA ATTATCATCA
GAAAAAACAC CTGATTTTAT ATATTTAAGT ACAAATGATC CATCAGGTAT AGCTAATGCT
TTAATAAAGG ATAAAGCTTT AGTAGATTTA AGTGATGTAT TCGATAGAGA AGATCCAGAT
AATCCAGGAC AAAAATTAAA AGATAAAATA TTACCAGGAT TTTTAGATAC ACCACTTACA
ACTCCATATG GAGATGGAAA AGTATTTTTA GCACCACTTT ACTATAATGT TACAGGTATG
TGGTATAACA AAGCATTATT CAAAGAAAAA GGATGGGAAG TTCCAAAAAC TTGGGATGAG
TTCTTTGAAT TAGGTAAGAA GGCAAAAGAT GAGGGAATAG CTCTTTATAC TTATCAAGGA
CAAGCGCCAG GATACAATGA GGCTGTAATA TTCCCAATGT TAGCTAGTGC AGCTGGAGAA
GAAACTGTAG AAAAGATATT CAACTATGAA GAAGGTGCAT GGAAAGATCC AAATGTTAAA
AAAGCATTAG ATATATTCCA AAGGATGGCT GACGAGGACA TGGTTCTTAA TGGAACTGTT
GGTATGACTC ATACTCAAGC ACAAGTTGAA TTCTTAAATG GAAAAGCATT ATTCTTACCA
TGTGGTAGCT GGTTAGAAGG AGAAATGAAA GATGCTATAC CAGAAGGATT TGAGTTTGGA
TTTATGGCTC CACCAGCATT TAAAGAAGAG GATACTCCAT ATGTAACTAC TACAATAGAG
CAAATGTATA TCCCAGCTAA ATCAGATCAA GTTGAATTAG CAAAAGAATT CTTAGCATTC
CAATATACAG ATGCTATGGT TAAAAAGAAT GCTGAAATAG CTAAGGCTGT AGTTCCAGTT
AAGGGAGCAG TTGAAAAAGC TAAATCTTCA TTAGATGCAT CAGGATATGA GTCTTATAAG
GTTGTTGAAG AAGGTGCTAA ACCAATTCCA CTTTCATTTA AACCAACAAA CTCTAAATTA
GATTTTAGAA ATGATAGTTT ATTTGGACCA GTAGGAAGTA TCATAAATAA AGAATTAACA
GTTGATGAAT GGATTAATAA CTTAGAATCT GATTCACAAA CTCTTGCTAA AGAAGTTGTT
GAATAA
 
Protein sequence
MKRKLVKMLT VACVTAIAAS AFVGCGNKEE TPKDNEKPST EQAEGSGEKK VLEIAVFEGG 
FGKDYWEACI EAFEAEHPDV EVKMEANPKI GDIIRPKLSS EKTPDFIYLS TNDPSGIANA
LIKDKALVDL SDVFDREDPD NPGQKLKDKI LPGFLDTPLT TPYGDGKVFL APLYYNVTGM
WYNKALFKEK GWEVPKTWDE FFELGKKAKD EGIALYTYQG QAPGYNEAVI FPMLASAAGE
ETVEKIFNYE EGAWKDPNVK KALDIFQRMA DEDMVLNGTV GMTHTQAQVE FLNGKALFLP
CGSWLEGEMK DAIPEGFEFG FMAPPAFKEE DTPYVTTTIE QMYIPAKSDQ VELAKEFLAF
QYTDAMVKKN AEIAKAVVPV KGAVEKAKSS LDASGYESYK VVEEGAKPIP LSFKPTNSKL
DFRNDSLFGP VGSIINKELT VDEWINNLES DSQTLAKEVV E