Gene CPR_1650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1650 
Symbol 
ID4204382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1843935 
End bp1845131 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content31% 
IMG OID642566200 
Productaspartate kinase I 
Protein accessionYP_698965 
Protein GI110801766 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAATAG TAGTACAAAA ATTTGGTGGT ACTTCTGTAT CAACAAAAGA AAGAAGAACT 
ATGGTTGTAG AAAAGGTAAA AGGGGCAATA AATAAAGGAT ATTTACCAGT TGTTGTTGTA
TCTGCTATGG GAAGAAAAGG AGAACCTTAT GCAACTGATA CACTTTTAGG ATTAGTATCA
GAAGAGTTTA AAAAAGAAAA TAAATTAGCT ACTGATTTAC TTATGGGATG TGGAGAAATT
ATAAGCACTG TTGTCATGAG TGATGAATTA AGAGAGGCAG AAATAGAAGC AGTTCCTTTA
ACAGGAGGAA ATGCAGGAAT ACTTACAGAT GATAATTATT CAAGTGCTGA TTTTATAGAT
ATAAATCCTA AATTAATATT AGAAGTTCTT AAAGAAGGAA AAGTGCCTGT TGTTGCTGGA
TTCCAAGGAG TTGACAGAAA TGGGTTCTTA ACTACTTTAG GAAGAGGCGG AAGTGATACT
ACGGCAGCAG TTTTAGGAGT TGCATTAAAG GCTGAAGAAA TTGAAATATA TACAGATGTT
GATGGAATAA TGACTGCTGA TCCAAGAATA GTTGGAGATG CTGAGCTTAT AAATAAAATA
AGTTATAATG AAGTATTCCA ATTAGCAGAT CAAGGGGCTA AGGTAATTCA TCCAAGAGCT
GTAGAAATAG CAATGAAGGG AAATGTAACC TTAGTAATAA AGAATACTAT GAGTACTTGC
ATTGGTACAA TGATAGATAG TCTAGGAGAT GTAGATAATA ATAAGTTTAT AACTGGAATA
ACTCATCAAG GAAATAGAAT TCAAGTTTCT ATAAAATCTG ATGACAATAA AGATAATATA
AATTATAAGA CTATTTTAGA AAGTCTTGCA AACAATAAAA TAAGTTTAGA CTTAATCAAT
ATCTTCCCTA AAGAAAAAGT GTTTACAATA GATGCAGGTG TTAAAGAATT ATTTGAGGAT
ATTATGAGAA AAAATAATTT AAAATATAGT TTAGTAGAAG ATTTAAGTAC TATAGCTATA
GTTGGTTCAA GAATGAGAGG AATTCCAGGA GTTATGGCTA AAATAGTAGG TGCTTTAGAC
AATGAAAACA TAGAAGTTTT ACAAACAGCA GATTCACATA TGACAATTTG GTGTCTTGTT
GAAAGTAAAA ATGTAAGAGA AGCTATAAAA GCACTTCATA GAGTATTTAT GAAATAA
 
Protein sequence
MKIVVQKFGG TSVSTKERRT MVVEKVKGAI NKGYLPVVVV SAMGRKGEPY ATDTLLGLVS 
EEFKKENKLA TDLLMGCGEI ISTVVMSDEL REAEIEAVPL TGGNAGILTD DNYSSADFID
INPKLILEVL KEGKVPVVAG FQGVDRNGFL TTLGRGGSDT TAAVLGVALK AEEIEIYTDV
DGIMTADPRI VGDAELINKI SYNEVFQLAD QGAKVIHPRA VEIAMKGNVT LVIKNTMSTC
IGTMIDSLGD VDNNKFITGI THQGNRIQVS IKSDDNKDNI NYKTILESLA NNKISLDLIN
IFPKEKVFTI DAGVKELFED IMRKNNLKYS LVEDLSTIAI VGSRMRGIPG VMAKIVGALD
NENIEVLQTA DSHMTIWCLV ESKNVREAIK ALHRVFMK