Gene CPR_0476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0476 
Symbol 
ID4205493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp566372 
End bp567655 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content33% 
IMG OID642565033 
ProductABC transporter, substrate-binding protein 
Protein accessionYP_697804 
Protein GI110802208 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAAGGG GATTGAGAAA TATTATTGCT ATGACATTAG CAACATTCAC AGTAGGAAGT 
TTTTTAACTG CATGTGGAAA TGATAATTTA AAGAGTGGGA ATAGTAGTGA TGGAAATGTT
ATTTTAACAT ATAGTATTTG GGATAAAGGG CAAGAGCCAG CTATGAGAGC TATTGCTGAT
GCTTTTGAAA AAGAGAATCC TAATATAAAA ATAAATGTTG AAATAACTCC ATGGGATCAA
TATTGGATGA AAATGGATGC AGCTGCTCAG GGGGGAGCTT TGCCAGATAT TTTTTGGATG
CATTCTAGCC AAATAAGTAG ATATGGCAGG AATGGGAAGC TTTTAGATTT AACAGATAGA
ATAAAGAATA GTAAAAAAAT TAATTTAAAT AATTATCCAG AAGGATTAGT AGAATTATAT
TCACCAGACA ATAAAAACTA TGCTATTCCT AAGGATTATG ATACAACAGC ACTTTGGTAT
AATAAGGCTT TGTTTGATGA GGCAGGAATT CCTTATCCAG ATGAAACTTG GACTTGGGAT
ACTTTATTAG AAACGGCAAA AAAACTTACT AATAAGGAAA ATAAGATATA TGGATTTGGA
GCACCATTAA ATAGTCATGA GGGTTATTAC AACTTCATAT TTCAAAATGG TGGATATATA
TTATCTGATG ATAAGAAAAA ATGTGGATAT GATGATAAAA AAACTTTAGA AGCACTTAGA
TGGTATATTG ATTTAGGACT TAAGGAAGGA GTATCACCAA CTCAAAAGGA ATTTGATGAG
GTTTCTCCTT TAACTTTATT TGAATCAGGA AAATTAGCAA TGGGAATTTT TGGTTCATGG
AATGTAGCAA ATTTTAAGCA AAATGAATAT GTAAGAGAAA ATTGTGATAT TGCAGTTCTT
CCAATGGGAG AAAAGAGAGC TAGTGTTTAT AATGGATTAG GAAATGCTAT TGGCTATAAT
ACTAAGCATC CTGAAGAAGC ATGGAAATTT GTTGAATTTC TAGGTGGAAA AGAAGCTAAT
ATTTTACAAG CTGAGTATGG AGCAGCAATA CCAGCTTATA AAGGAATTGA TGAGAAATGG
GCTGAAGTAA CTAAGGAATT TAATGCTAGG GCACATGTTG AGATGTTAGA TTATGCAGAG
ATATTACCTT ATTCCAATAG AACTGCTAGA TGGAAAGTAA TTGAGGATGA TCTTTGGCCA
AGAGTTTGGG CTGGACAAGC AAATTTAGAT GAGGTTTCTA AGAGAATGGA CAAAGATATA
GAAGAAGTTT TGGCTGAGGA TTAA
 
Protein sequence
MKRGLRNIIA MTLATFTVGS FLTACGNDNL KSGNSSDGNV ILTYSIWDKG QEPAMRAIAD 
AFEKENPNIK INVEITPWDQ YWMKMDAAAQ GGALPDIFWM HSSQISRYGR NGKLLDLTDR
IKNSKKINLN NYPEGLVELY SPDNKNYAIP KDYDTTALWY NKALFDEAGI PYPDETWTWD
TLLETAKKLT NKENKIYGFG APLNSHEGYY NFIFQNGGYI LSDDKKKCGY DDKKTLEALR
WYIDLGLKEG VSPTQKEFDE VSPLTLFESG KLAMGIFGSW NVANFKQNEY VRENCDIAVL
PMGEKRASVY NGLGNAIGYN TKHPEEAWKF VEFLGGKEAN ILQAEYGAAI PAYKGIDEKW
AEVTKEFNAR AHVEMLDYAE ILPYSNRTAR WKVIEDDLWP RVWAGQANLD EVSKRMDKDI
EEVLAED