Gene CPF_1658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1658 
Symbol 
ID4203421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1875232 
End bp1876383 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content30% 
IMG OID638082535 
Producthypothetical protein 
Protein accessionYP_696099 
Protein GI110798886 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.2932 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAA AACCAAAAAT ATTATTAGTG ACTTCTTTAT CCCTAATAGT ATTATTAACC 
TTATCCATAT ATGTATCCTT AAACAAAAAG AAAACTTCAG CATTTTCAGA AGCCATAAAT
TTATTAAAAG AACCTGACAA AAAAGAAGAT GATGAATTAA AGGGAAAATT TGAAGAAGTA
CTTCAAGACT TATTTAAAAA TAGAAATATA GCCATATTAA ACAATGATTT AGAGGAATTA
AAGAAATTTT ATGATTTAGA AAAAAAGCCT AGTCTTTGGG CCTATGAAAG TGAAAGTAAA
AAAGTTAAGT ATTTAAACAA CTGGTCTCAA AAACAAGGAG TTGTATTTAA TGAAATAAAA
TCAAAAACTG AAATAAGAAA GGCTAGAGAA AGAGAAAAGG ACTTATACGG AATAATATGT
GTTGTTTCAA GTGAATTTAC ATATTATTAT CTTAATGATC CTCTTAAAAC TAATACCTTT
AGATTAGGTA CTTATCACTA TTTAAATTTA AAAGATGAGG GAGATAGGTA TATTATCACT
AAGGAATGGT ACACTGATCC TTTTGCTGAT TCTCTAGATT TAAATAATAT AAAATCTGAT
GAAATTAAAT CCTATATTTT AAATAGTTCT AGTCCATCTT ATTCACCTGA TGAAAGAACA
CAGAAAGCTA TAGATTATGC ACATACCTAT TGTGGAGCAG CTGCAGATGA TGAACTTGGT
TTTAACTATA ATAAAAAATA CACAGACTTT AACCCTCAAG GAGGAGACTG TGCAAACTTC
GCCTCTCAAA TTCTTTTTGA AGGTGGTGGA TTTAAGAAAA ATTCAACATG GAACTATTCT
GATGGTGAAG GTTCTAAGGC TTGGGTAAAT GCTCAAGCAT TTAAAAATTA CATGGTTAAT
AGTGGACGTG CTTCCTATAT TGCTAAGGGT AAATATTCCG AAATATATAA AGCGGCCTAT
AACTTAAGAC CTGGTGATTT TGTAGCTTAT GAAAAAAATG GACGAATAAC TCACATTTCA
ACAGTTACAG GATTAGATAG TAAAGGTTAT CCCCTAGTAA CTTGTCACAA CACAGATAGA
CTTCTTGTTC CTTTTGATTT AGGTTGGAGC AATGACAATA TACGCTTTCA TCTAGTAGAT
GTTTATTATT GA
 
Protein sequence
MKRKPKILLV TSLSLIVLLT LSIYVSLNKK KTSAFSEAIN LLKEPDKKED DELKGKFEEV 
LQDLFKNRNI AILNNDLEEL KKFYDLEKKP SLWAYESESK KVKYLNNWSQ KQGVVFNEIK
SKTEIRKARE REKDLYGIIC VVSSEFTYYY LNDPLKTNTF RLGTYHYLNL KDEGDRYIIT
KEWYTDPFAD SLDLNNIKSD EIKSYILNSS SPSYSPDERT QKAIDYAHTY CGAAADDELG
FNYNKKYTDF NPQGGDCANF ASQILFEGGG FKKNSTWNYS DGEGSKAWVN AQAFKNYMVN
SGRASYIAKG KYSEIYKAAY NLRPGDFVAY EKNGRITHIS TVTGLDSKGY PLVTCHNTDR
LLVPFDLGWS NDNIRFHLVD VYY