Gene CPF_0439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0439 
Symbol 
ID4203745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp524854 
End bp526191 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content28% 
IMG OID638081323 
ProductCBS/transporter associated domain-containing protein 
Protein accessionYP_694896 
Protein GI110799963 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCCGA GTCCCAGTAT TTTACCAAAA ATTATTCTAA TATTAGTTCT TATCTTAATT 
AATGCGTTCT TTGCAGCTGC AGAGATGGCA ATGGTATCTG TAAATAAATC TAAGATAAAG
ATGCTTGCAG AGAAAGGGAA CAAAAAAGCC CTTTTATTAA AAAAGGTTTT AAAATCACCT
GGCAACTTTT TATCTACTAT TCAAATAGGA ATAACATTTG CAGGATTTTT TGCCAGTGCA
TCAGCAGCCA CTAGCATTTC AGAAACTCTA GCGCAATTCA TGTACAAGCT AAATATTCCT
TATGGTAATG AGATATCAGT TATACTTATA ACTGTGCTTT TGTCTTATAT AACTTTAGTT
TTTGGAGAAT TACTTCCAAA GAGAATTGCA TTACAAAAGC CAGAAGAAAT TGCTTTAATG
GCTATAAGAC CAATCAATGT TATTTCTAAA ATATCAACAC CATTTGTAAA GATTCTTTCA
GCTTCAACAA ACTTATTTAT AAAAATATTA GGTTTAAATA AGTCTGAAGA TAAAGAAACT
GTATCTAAGG ATGAAATAAA ATCCATGATA AGTATTGGAC AAGAGAGTGG TGTAATCGAT
AAAACTGAAA AGGATATGTT AGATAATATA TTTGAATTTG ATCATAAAGT TGTTAAAGAA
GTTATGACTC CTAGGGGAGA AGTCTTTGCT ATAAAATCAA CAACTCCAAA TGAAACAATT
GCTAAGAAAC TTATAAGTGA GCAATTTTCA AGAGTTCCTG TTTATAATGA AACTAGGGAT
AATATAGTAG GAATACTTTA TTTAAAAGAC TTCTTTGAAG CCGTTGTAAA GGTTGGAGTA
GATAACATTA AATTAGATCA ATTAATACGT CCAGCTTACT TTGTTCTTGA AAATAAAGCT
ATAGATGATT TATTTAAAGA GCTTCAAGAT AGTAAGCAAC ATATGGCTGT AATAATAGAT
GAATATGGTG GTTTTTCTGG AATTGTTACT ATAGAAGACT TAATTGAAGA AGTTATGGGT
GATATATTAG ATGAGTATGA CGATTCAGAA AACTATATAG ATAAAATAGA TAATAATACC
TATGTAGTTG ATGGTTTATT AACATTAGAT AAGTTAAATG ATTATTTAAA CCTAAATCTT
GAAAGTCAAA ATATAGAGAC TATTGGTGGT TTTGTTGTTA ACTTAATAGG AAATATTCCG
CAAAGTGAAA ATCAAATGGT TGAATATGAC AATCTTTCTT TCCAAGTTTG TAAAACAAAT
AAGAAGAGAA TTGAAAAACT AAAAATTTAT TTAAATAATT CAACTAGTTT CAATTCAGAT
GTTATATTAA ACAATTAA
 
Protein sequence
MDPSPSILPK IILILVLILI NAFFAAAEMA MVSVNKSKIK MLAEKGNKKA LLLKKVLKSP 
GNFLSTIQIG ITFAGFFASA SAATSISETL AQFMYKLNIP YGNEISVILI TVLLSYITLV
FGELLPKRIA LQKPEEIALM AIRPINVISK ISTPFVKILS ASTNLFIKIL GLNKSEDKET
VSKDEIKSMI SIGQESGVID KTEKDMLDNI FEFDHKVVKE VMTPRGEVFA IKSTTPNETI
AKKLISEQFS RVPVYNETRD NIVGILYLKD FFEAVVKVGV DNIKLDQLIR PAYFVLENKA
IDDLFKELQD SKQHMAVIID EYGGFSGIVT IEDLIEEVMG DILDEYDDSE NYIDKIDNNT
YVVDGLLTLD KLNDYLNLNL ESQNIETIGG FVVNLIGNIP QSENQMVEYD NLSFQVCKTN
KKRIEKLKIY LNNSTSFNSD VILNN