Gene CPR_1938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1938 
Symbol 
ID4206194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2142980 
End bp2144062 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content30% 
IMG OID642566488 
Productspermidine/putrescine-binding periplasmic protein precursor 
Protein accessionYP_699248 
Protein GI110803710 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTAA AAAAATTTTT AGCATTAACT TGTTCAACAC TTCTTTTATC ATCTCTTTTT 
CTTGGTTGTG GACCAAAAAA GGATGAGGAA GCTACTCAAG ATAAAAACAA TAACGTTCTT
TATGTTTATA ACTGGGGAGA TTACATAGAT CCAGAACTAC TTACTAAATT TAAAGAAGAA
ACTGGAATAG ATGTTAAGTA CGATGTTTAT GATACTAATG AAATAATGTA TCAAAAACTT
AATAGTGGTA ATGTATCTTA TGATATAGTA ATTCCTTCTG ATTACATGAT AGAAAAAATG
AAAGAAGAAG ATATGTTAGC TAAGATAGAT TATTCTAATA TTCCTAATTA TAAATATATA
GGAGAACAAT TTAAAAATTT AGCTTATGAT CCAACTAATG AATACTCAGT ACCATATATG
TGGGGAACAG TTGGAATAAT ATATAATACA AAGAAAGTTA GTGATCCTGT AGATAGTTGG
AATATCCTTT GGAACCCTAA ATATAAGGAT CAGGTTATAA TGCCAGATAG CGTTAGAGAT
GCTATGGCTG TTGCAGAAAA AAAATTAGGA TATTCATTAA ATACTGAAAA CTTAGATCAA
ATAGAAGCTG CTAAAAAGGA ACTTATGACT CAAAAAAAAG ATGGATTAAT CTTAGCTTAT
ATGGTTGACC AAGTTAAAGA TGCCATGGTT GGTGGAGAAG CTTCCCTTGC TGTTGCTTGG
TCTGGAGATG CTGTAACAAT GATAGAGAGA AATCCTGATT TAGCTTATGC TATTCCTAAG
GAAGGATCAA ATAAATGGTT TGATGCTATA GGAATTCCTA AAAATGCAAA GCATAAAGAA
AATGCTGAGA AGTTTATAAA CTTCCTTTGC GATTCTGAAA ATGCTGAGCA AAATGTAGAA
TATATAGGAT ACTCTACTCC AAACACAGCC GCATATGACC TTTTACCTGA GGATATAAGA
GATAATAAAG TTGCATATCC AGATAAAGAG TCATTAAAAA ATTGTGAGGT ATTTATAGAT
TTACCTTCTA AAATACTTAG AAAATATGAT GAAGCTTGGT TAGAAATCAA GTGTATTTAT
TAA
 
Protein sequence
MKLKKFLALT CSTLLLSSLF LGCGPKKDEE ATQDKNNNVL YVYNWGDYID PELLTKFKEE 
TGIDVKYDVY DTNEIMYQKL NSGNVSYDIV IPSDYMIEKM KEEDMLAKID YSNIPNYKYI
GEQFKNLAYD PTNEYSVPYM WGTVGIIYNT KKVSDPVDSW NILWNPKYKD QVIMPDSVRD
AMAVAEKKLG YSLNTENLDQ IEAAKKELMT QKKDGLILAY MVDQVKDAMV GGEASLAVAW
SGDAVTMIER NPDLAYAIPK EGSNKWFDAI GIPKNAKHKE NAEKFINFLC DSENAEQNVE
YIGYSTPNTA AYDLLPEDIR DNKVAYPDKE SLKNCEVFID LPSKILRKYD EAWLEIKCIY