Gene CPR_0904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0904 
Symbol 
ID4204386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1036071 
End bp1037390 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content30% 
IMG OID642565462 
ProductABC transporter, periplasmic substrate-binding protein, putative 
Protein accessionYP_698228 
Protein GI110801770 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00221898 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGAAAT TTTATGGTTT TGTTTTACTA ATATTTTGTT TTAATCTTTT TGGATGTTCT 
TTTAGAAATA CAGAAGTATC AAATAAAAAT ATTCAAGGAT ATGACAAAGG CGAGGAGCTT
ATAACAATGT GGGTTCATGT TATAGAAGAA ACTTCAGAAG GACAAGCTTA TAAAAATTCA
GTAGAAAGGT TTAATAAAGA ATACAATGGT AAGTATTGTT TAAGTGTTGA ATTTGTACCT
CGTAATGAAA GTGGAGGAGG ATATACTGAT AAAATAAATT CATCAGTAAT TTCTGGAGGA
CTTCCAGACA TAATAACTGT TGATGGGCCC AATGTATCAG CTTATGTTGC AAACAACATA
ATTCAGCCTT TAGTAGGTAT AACTGATGAT GAAAAGGCTA AATATTTACC GTCAGTAATA
GAGCAAGGAA CAATAAACAA TAAATTATAT GCATTAGGTC TAATGGAATC TAGTACGTTA
TTTTATTATA ACAAAGATAT ATTAAACGAA GTAGGAATAC AAGTACCATC ATTTGATAAT
CCATGGACTT GGGACGAATT AAATAAGGTC TGTGAAAAAG TTAAGAACTA TTTAGATAAA
AAAAATGGAT ATCCAATAGA TATGTCATTC CCAGCAGGGG AAACAACTAT TTATTTTTAT
GCACCATTTA TATGGTCAAA TGGTGGAGAT TTTGTAAGCT CTGATGGTTT AAAGGTTAAT
GGAGTATTTA ATTCTGAAAA GAATGTAGAA ACTATTAGTT ATTTTAAAGA AATTACAGAC
AAAGGATATA TACCTAAATA TACAATAAGT GATTTATTTG AAAAGGGAAG GGCTGCATTT
AAATTTGATG GAGCATGGGC TATTACAAAT ATAAGAAATA ACTATCCAGC TTTTAATTTA
GGAATAGCAC CATATCCAGT GGGAAATGAT TGGAATGGAG AAAAGTATAC ACCAACAGGA
GGATGGGCTT TTGCAACAAC TACAACTTGT AAAAACCCTG AGGCTGCAAA AGAAGCAATC
AAGTTTTTAA CTAATGCAGA AAGTGGCATA GATATGTATA ACTTAACAGG TAATTTACCA
TCTACATTTG AAGCTTATGA AAATATTGAT GCATTTAAAA CTGATGAATT ATTTAAAACA
GCATATTATC AGCTTGTTAA CTATGGTCAT CCAAGACCAA AATCACCAGC TTATCCTCAG
ATAAGTACAT CATATCAGCA GGCTATTGAA GGTGTACTAT TAAATGATGA AACACCAGAA
GAATCGTTAT ATAAAACAAT GAGAAGAATA GAAGATAAGT TAATACGTTA TCAAGATTAA
 
Protein sequence
MKKFYGFVLL IFCFNLFGCS FRNTEVSNKN IQGYDKGEEL ITMWVHVIEE TSEGQAYKNS 
VERFNKEYNG KYCLSVEFVP RNESGGGYTD KINSSVISGG LPDIITVDGP NVSAYVANNI
IQPLVGITDD EKAKYLPSVI EQGTINNKLY ALGLMESSTL FYYNKDILNE VGIQVPSFDN
PWTWDELNKV CEKVKNYLDK KNGYPIDMSF PAGETTIYFY APFIWSNGGD FVSSDGLKVN
GVFNSEKNVE TISYFKEITD KGYIPKYTIS DLFEKGRAAF KFDGAWAITN IRNNYPAFNL
GIAPYPVGND WNGEKYTPTG GWAFATTTTC KNPEAAKEAI KFLTNAESGI DMYNLTGNLP
STFEAYENID AFKTDELFKT AYYQLVNYGH PRPKSPAYPQ ISTSYQQAIE GVLLNDETPE
ESLYKTMRRI EDKLIRYQD