Gene CPF_1113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1113 
Symbol 
ID4201694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1270941 
End bp1272200 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content32% 
IMG OID638081994 
Productsugar ABC transporter, sugar-binding protein 
Protein accessionYP_695559 
Protein GI110798641 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0203061 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAAA AAATTCTAAG TGCAATTCTA TGTATGATGA TTGGAGCAAC TGCAATGGTA 
GGATGTGGTG ACAAAACTGA TTCTACTCAA GCAAAAGATG GTGGTAAAGT AAAGCTTCGT
ATGACTAACT GGAATAATGA AGATACAATG AAAGATATGT TAAAGTATTT ATCTGAAAAA
CTACCAAATG TGGAAATAGA GTATCAATTT ATTGATAACT CAAACTATAA CACAATAGTA
GATACTCAAT TATCAGCTGA AGAGGGTCCA GATATAATTT GTGAATCTCC AGCATCAGCC
TTAAAACATG CTAAATTAGG ATATCTAGAA AATGTAAACG ATTTGGCTAA GAAGTATTCA
GATTCAGGTA CTAATGTTTA TAAGTACGAT AATAATGTAT ATGCTTTACC AGGAATAAGT
TGGTTTGAAG GAATATATTA TAACGAAAAA TTATTTGAAG AAAATAATAT TCAAATACCT
AAGACTTTTG ATGAATATAT TGAGGTTTGT AAGAAATTCC AAAGCTTAGG AATAAAACCT
TTAGCAGCAG GATTAAAATC ATGGGAACCT TTATTAAAGA ATTCTATGGC CTTTGTAACA
GCAGAGTATT TATCAACAGA TGCTGGTAAA AACTTTGGAC AAGAGTATAG AGAAGGAAAA
GTTAAGTTAG ATGGTACATG GAATATGTAT TTAGATAAAT GGTCAGAAAT GATAAAGGAT
GGAATATACA CTAAAGATAT GACTGGAATA GACCATGATC AAGCTTTAGA GGAATTTGCT
ACTGGAAAAG CTGCAATGTA TTGTTCAGGA CCATGGGATT TAGAAGCAAT TATGTCAAAA
AATCCTGATT TAAAACTTAA TATGATGCCA TTTTACGGAA CAAAACCTAG CGATGGATGG
TTAATAGGTG GACCTGGATG TGGATTTGCA GTAAATTCAA AATCTAAAAA TAAAGATGCA
GCAATGGAAG TATTAAAAGC TATATCTACA GAAGAGGGAC AAAAAGCTTT ATGGGAAAAC
AATCAAGGAG GATCTTCTTA CTTAACTGGA ACCTCATTCA CTCTTCCAGA AGCCTTTAAG
GGAGCTGAAA AAGCTATAAA TGCAGGTCAT ATTTATTGTC CTTGGAATGA ATGGGGAGAT
GCAGGATCAG CACATGTAGA TTACGGTAAA CAAATGCAAA ATTATTTACT TGGAAACCAA
GATCTAAAAA CAACTTTATC AAATGTAGAT TCTGCTGCAA GTGAACTTAT AAATAAATAA
 
Protein sequence
MRKKILSAIL CMMIGATAMV GCGDKTDSTQ AKDGGKVKLR MTNWNNEDTM KDMLKYLSEK 
LPNVEIEYQF IDNSNYNTIV DTQLSAEEGP DIICESPASA LKHAKLGYLE NVNDLAKKYS
DSGTNVYKYD NNVYALPGIS WFEGIYYNEK LFEENNIQIP KTFDEYIEVC KKFQSLGIKP
LAAGLKSWEP LLKNSMAFVT AEYLSTDAGK NFGQEYREGK VKLDGTWNMY LDKWSEMIKD
GIYTKDMTGI DHDQALEEFA TGKAAMYCSG PWDLEAIMSK NPDLKLNMMP FYGTKPSDGW
LIGGPGCGFA VNSKSKNKDA AMEVLKAIST EEGQKALWEN NQGGSSYLTG TSFTLPEAFK
GAEKAINAGH IYCPWNEWGD AGSAHVDYGK QMQNYLLGNQ DLKTTLSNVD SAASELINK