Gene CPF_1548 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1548 
Symbol 
ID4202288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1766395 
End bp1767459 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content35% 
IMG OID638082426 
Productputative galactoside ABC transporter, galactoside-binding protein 
Protein accessionYP_695991 
Protein GI110800969 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00850993 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA AAGGTTTAGC ACTAATTTTA ATTTCAGCTT TAACTATGGG TACTTTAGTT 
GGTTGTGGTG GAGGTTCAGG CTCTACTGGA TCATCTGGTG ATACACCAAA AGAAAACGAT
TCACCTAAGA TAGGAGCTAC AATTTATAGT TTTGAAGATA ACTTCATGTC ATACCAACGT
AGAAATATAG AAAAACTTTG TAATGGAAAA GCCGAGTTAT TAATGAATGA CTCTCAAAAT
AATCAATCTA AGCAAATAGA ACAAGTAGAC ACTATGATAG CTAAGGGTGT TGATATTCTT
GCAATAAACT TAGTTGACCC TAAATCTGCT CCTACTGTAG TAGATAAAGC TAAGGCAGAC
AACTTACCAG TAGTATTCTT TAATAAGGAA CCAGACGAAG CTGTTATGCA AAGCTACGAC
AAGGTTTGGT ATGTTGGTAC AACTTCTGAA GAATCAGGAA TAATCCAAGG GGAAGTAATG
GCAGAAGGTT GGAAAGCTAA TCCTGCTTGG GACAAAAATG GTGATGGAAA AATACAATAT
GTAATGCTTA AAGGAGAACC TGGTCACCCT GATGCAGAAG CTCGTACAAA ATATTCTATT
GAAACAATAA ACAAGGCTGG TATAGAAACT GAAGAGTTAG CAATGGATAC AGCTATGTGG
GACTCAACTA AAGCTACTGA AAAAATGGAT GCTTGGATTG CTAAAAATGG TGACAATATA
GAAATGGTAA TCTGTAATAA TGATGGAATG GCTTTAGGTG CTATTTCTTC TCTTGAAAAA
GCAGGATATT TAGATGGAAC TCCTGAAAAG TTTGTTCCAA TATATGGTGT TGATGCTATT
CCTGAAGCTT TAGATAAAAT CAAAGCTGGT AAAATGGCTG GTACAGTATT AAACGATGCT
AAGAACCAAG CTCAAGCTCT AGTGGATTCT TGTATGAATT TAGTAAATGG AAAAGAGATA
ACTGAAGGAA CTAATTGGAA ACTTGATGAT AAAAAATCAA TCCGTGTTCC ATATGTAGGA
ATAACTAAGG ACAATATAAA CGTTGCTGAG GATTCATATA AATAA
 
Protein sequence
MKKKGLALIL ISALTMGTLV GCGGGSGSTG SSGDTPKEND SPKIGATIYS FEDNFMSYQR 
RNIEKLCNGK AELLMNDSQN NQSKQIEQVD TMIAKGVDIL AINLVDPKSA PTVVDKAKAD
NLPVVFFNKE PDEAVMQSYD KVWYVGTTSE ESGIIQGEVM AEGWKANPAW DKNGDGKIQY
VMLKGEPGHP DAEARTKYSI ETINKAGIET EELAMDTAMW DSTKATEKMD AWIAKNGDNI
EMVICNNDGM ALGAISSLEK AGYLDGTPEK FVPIYGVDAI PEALDKIKAG KMAGTVLNDA
KNQAQALVDS CMNLVNGKEI TEGTNWKLDD KKSIRVPYVG ITKDNINVAE DSYK