Gene CPF_0556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0556 
Symbol 
ID4203679 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp664125 
End bp665510 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content34% 
IMG OID638081438 
ProductABC transporter, solute-binding protein 
Protein accessionYP_695010 
Protein GI110800185 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.526712 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAA AATTAGTTAA AATGCTAACT GTTGCATGTG TTACCGCTAT AGCTGCCTCT 
GCATTTGTTG GATGTGGAAA TAAAGAAGAG GCACCAAAAG ATAATGAAAA GCCATCAACT
GAACAAGCTG AAGGTTCAGG AGAGAAAAAG GTTTTAGAAA TAGCTGTTTT TGAAGGTGGA
TTTGGTAAGG ATTACTGGGA TGCTTGTATA GATGCCTTTG AAGCTGAGCA CCCAGATGTA
GAAGTTAAGA TGGAAGCTAA TCCTAAAATA GGTGATATTA TAAGACCTAA ATTATCATCA
GAAAATACAC CTGACTTTAT ATACTTAAGT ACAAATGACC AATCAGGTAT AGCTAATGCT
TTAATAAAGG ATAAAGCTTT AGTAGATTTA AGTGATGTAT TTGATAGAGA AGATCCAGAT
AATCCAGGAC AAAAATTAAA AGATAAAATA TTACCAGGAT TTTTAGATAC ACCACTTACA
ACTCCATATG GAGATGGAAA AGTATTCTTA GCACCACTTT ACTATAATGT TACAGGTATG
TGGTATAACA AAGCATTATT CAAAGAAAAA GGATGGGAAG TTCCAAAAAC TTGGGATGAG
TTCTTTGAAT TAGGTAAGAA GGCAAAAGAT GAGGGAATAG CTCTTTATAC TTATCAAGGA
CAAGCGCCAG GATATAACGA GGCTGTAATA TTCCCAATGT TAGCTAGTGC AGCTGGAGAA
GAAGCTGTAG AAAAGATATT CAACTATGAA GAAGGTGCTT GGAAAGATCC AAATGTTAAA
AAAGCATTAG ATGTATTCCA AAGAATGGCT GATGAGGACA TGGTTCTTAA TGGAACAGTT
GGTATGACTC ATACTCAAGC ACAAGTTGAA TTCTTAAATG GAAAAGCATT ATTCTTACCA
TGTGGTAGCT GGTTAGAAGG AGAAATGAAA GATGCTATAC CAGAAGGATT TGAGTTTGGA
TTTATGGCTC CACCAGCATT TAAAGAAGGG GATACTCCAT ATGTAACTAC TACAATAGAG
CAAATGTATA TCCCAGCTAA ATCAGATCAA GTTGAATTAG CAAAAGAATT CTTAGCATTC
CAATATACAG ATGCTATGGT TCAAAAGAAT GCTGAAATAG CTAAGGCTGT AGTTCCAGTT
AAGGGAGCAG TTGAAAAAGC TAAATCTTCA TTAGATGCAT CAGGATATGA GTCTTATAAG
TTTGTTGAAG AAGGTGCTAA ACCAATTCCA CTTTCATTTA AACCAACAAA CTCTAAACTA
GATTTTAGAA ATGATAGTTT ATTTGGACCA GTAGGAAGTA TTATAAATAA AGAATTAACA
GTTGATGAAT GGATTAATAA CTTAGAATCT GATTCACAAA CTCTTGCTAA AGAAGTTGTT
GAATAA
 
Protein sequence
MKRKLVKMLT VACVTAIAAS AFVGCGNKEE APKDNEKPST EQAEGSGEKK VLEIAVFEGG 
FGKDYWDACI DAFEAEHPDV EVKMEANPKI GDIIRPKLSS ENTPDFIYLS TNDQSGIANA
LIKDKALVDL SDVFDREDPD NPGQKLKDKI LPGFLDTPLT TPYGDGKVFL APLYYNVTGM
WYNKALFKEK GWEVPKTWDE FFELGKKAKD EGIALYTYQG QAPGYNEAVI FPMLASAAGE
EAVEKIFNYE EGAWKDPNVK KALDVFQRMA DEDMVLNGTV GMTHTQAQVE FLNGKALFLP
CGSWLEGEMK DAIPEGFEFG FMAPPAFKEG DTPYVTTTIE QMYIPAKSDQ VELAKEFLAF
QYTDAMVQKN AEIAKAVVPV KGAVEKAKSS LDASGYESYK FVEEGAKPIP LSFKPTNSKL
DFRNDSLFGP VGSIINKELT VDEWINNLES DSQTLAKEVV E