Gene CPR_0907 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0907 
SymbolmsmK 
ID4205441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1039170 
End bp1040297 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content31% 
IMG OID642565465 
Productsugar ABC transporter, ATP-binding protein 
Protein accessionYP_698231 
Protein GI110802176 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0858146 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTTTTA TAGAATTTAA GAATGTAGAA AAACAATATA AGAATGCAAC AAAGAAGTCA 
GTTACTGATT TTAATTTATC CATAGATGAA AAAGAATTTA TAGTATTTGT AGGACCATCA
GGATGTGGTA AATCAACAAC TTTAAGAATG CTTGCAGGTT TTGAAGAAAT AACTGGAGGA
ACTATTTCAA TAGATGGAAA TATAGTCAAT AATACGCCGC CAAGAGAACG TGGAATATCT
ATGGTGTTCC AAAACTATGC ATTATATCCT CATATGACAG TAGAAGATAA TATAGCTTTT
GGATTAAAGA ATATTAAAAC TCCAAAAGAT GAAATAAAGA AAAAAGTAAA CTGGGCAATT
GAGATTTTGG GTTTAGAAGA ATACAGAAAG CGTAAGCCTA AGAATTTATC TGGAGGACAA
CGTCAAAGGG TTGCACTTGG AAGAGCAATA GTACGTAATC AAAAAGTATT CTTAATGGAC
GAGCCTTTAA GTAATTTAGA TGCTAAATTA CGTGTCAGTA TGCGTAATGA GATAAGTAAA
TTGCATAGAG AACTTGGAAG TACTACAATT TATGTTACCC ATGATCAGGT TGAAGCTATG
ACTATGGCAG ATAGAATTGT TGTTATGAAA GATGGAATAA TACAACAAAT AGGAACACCT
ATGGACTTAT ATGACAATCC TAGAAACAAA TTTGTTGGAA GCTTCATAGG CTCACCACAA
ATGAACTTTC TTAATGTTGA AGTTAAAGGA AATAAAGCTA TATTAGAAAA TGGAAGCAAA
ATAACGCTTC CAGAAGGAAT ATTAAAAAGA ATGAACAACA GACAAGGCAA ATTATGTATG
GGATTTAGAG CTGAAGATAT AAAGCTTGAT AATCTAAATA TTGGATTATT TGAAGACAGT
ATTATTACTT CAGCTATAGA AAATACAGAA ATCATGGGAA ATGAAAATAA CTTGTATTTT
AAAATAGGAA ACACTACAGC AGTAGCAAGA GTAGGAAAAG AAGACGTAAA GGAAATTGGA
GAGCAATTCA AATTTGTAAT CAATGTAAAT AAAGTTCATT TCTTTGACTT GGATACTGAA
GAAAATATAC TAAACTTAGG AAATACCCTA ACTTTAGATT ATAATTAA
 
Protein sequence
MGFIEFKNVE KQYKNATKKS VTDFNLSIDE KEFIVFVGPS GCGKSTTLRM LAGFEEITGG 
TISIDGNIVN NTPPRERGIS MVFQNYALYP HMTVEDNIAF GLKNIKTPKD EIKKKVNWAI
EILGLEEYRK RKPKNLSGGQ RQRVALGRAI VRNQKVFLMD EPLSNLDAKL RVSMRNEISK
LHRELGSTTI YVTHDQVEAM TMADRIVVMK DGIIQQIGTP MDLYDNPRNK FVGSFIGSPQ
MNFLNVEVKG NKAILENGSK ITLPEGILKR MNNRQGKLCM GFRAEDIKLD NLNIGLFEDS
IITSAIENTE IMGNENNLYF KIGNTTAVAR VGKEDVKEIG EQFKFVINVN KVHFFDLDTE
ENILNLGNTL TLDYN