Gene CPR_1397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1397 
Symbol 
ID4205593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1570345 
End bp1571496 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content29% 
IMG OID642565951 
Producthypothetical protein 
Protein accessionYP_698716 
Protein GI110803353 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.240429 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAA AACCCAAAAT ATTATTAGTG ACTTCTTTAT CCCTAATAGT ATTATTAACC 
TTATCCATAT ATGTATCCTT AAACAAAAAG AAAACTTCAG CATTTTCAGA AGTCATAAAT
TTATTAAATG AACCTCATCA AAAAGAATAT GATGAATTAA AAGGAAAATT TGAAAAAGTA
CTTCAAGACT TATTTAAAAA TAGAAATATA GCCATATTGA ACAATGATTT AGAGGAATTA
AAGAAATTTT ATGATTTACA AAAAAAGCCT AGTCTTTGGG CCTATGAAAG TGAAAGTAAA
AAAGTTAAGT ATTTAAACAA CTGGTCTCAA AAACAAGGAG TTGTATTTAA TGAAATAAAA
TCAAAAATTG AAATAAGAAA GGCTAGAGAA AGAGAAAAGG ACTTATACGG AATAATATGT
GTTGTTTCAA GTGAATTTAC ATATTATTAT CTTAATGAAC CACTTAAAAC TAATACCTTT
AGATTAGGTA CTTATCACTA TTTAAATTTA AAAGATGAGG GAGATAGGTG TATTATCACT
AAAGAATGGT ACACCGATCC TTTTGCTGAT TCTCTAGATT TAAATAATAT AAAATCTGAT
GAAATTAAAT CATATATTTT AAATAGTTCT AGTCCAGATT ATTCACCTGA TGAAAGAACA
CAAAAAGCTA TAGATTATGC ACACACCTAT TGTGGAGCAG CTGCAGATAA TGAACTTGGT
TTTAACTATA ATAAAAAATA CACAGACTTT AACCCTCAAG GAGGAGACTG TGCAAACTTC
GCCTCTCAAA TTCTTTTTGA AGGTGGTGGC TTTAAGAAAA ATTCAACATG GAACTATTCT
GATGGTGAAG GTTCTAAGGC TTGGGTAAAT GCTCAAGCAT TTAAAAATTA CATGGTTAAT
AGTGGGCGTA CTTCCTATAT TGCTAAGGGT AAATATTCTG AAATATATAA AGCTGCCTAT
AACTTAAGAC CTGGTGATTT TGTTGCTTAT GAAAAAAATG GACGAATAAC TCACATTTCA
ACAGTTACAG GATTAGATAG TAAAGGTTAT CCCCTAGTAA CTTGCCATAA CACAGATAGA
CTTCTTGTTC CTTTTGATTT AGGTTGGAGC AATGATAATA TACGCTTTCA TCTAGTAGAT
GTTTATTATT GA
 
Protein sequence
MKRKPKILLV TSLSLIVLLT LSIYVSLNKK KTSAFSEVIN LLNEPHQKEY DELKGKFEKV 
LQDLFKNRNI AILNNDLEEL KKFYDLQKKP SLWAYESESK KVKYLNNWSQ KQGVVFNEIK
SKIEIRKARE REKDLYGIIC VVSSEFTYYY LNEPLKTNTF RLGTYHYLNL KDEGDRCIIT
KEWYTDPFAD SLDLNNIKSD EIKSYILNSS SPDYSPDERT QKAIDYAHTY CGAAADNELG
FNYNKKYTDF NPQGGDCANF ASQILFEGGG FKKNSTWNYS DGEGSKAWVN AQAFKNYMVN
SGRTSYIAKG KYSEIYKAAY NLRPGDFVAY EKNGRITHIS TVTGLDSKGY PLVTCHNTDR
LLVPFDLGWS NDNIRFHLVD VYY