Gene CPR_1640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1640 
SymbolpurB 
ID4205473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1831762 
End bp1833192 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content30% 
IMG OID642566190 
Productadenylosuccinate lyase 
Protein accessionYP_698955 
Protein GI110802188 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0015] Adenylosuccinate lyase 
TIGRFAM ID[TIGR00928] adenylosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.591721 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAAATT TATATAGCAC ACCATTGAAT TCAAGATATG CGTCAAAAGA GATGTCATAT 
ATTTTCTCAG ATGATATGAA ATTCTCAACA TGGAGAAAAT TATGGGTTGC TCTTGCAGAG
GGTGAAAAAG AATTAGGATT AAATATAACT GACGAGCAAA TAGAAGAACT TAAAAGTCAT
ATTTCAGATA TAAATTACGA AGAAGCAATA AAAAAAGAAA AAGAAGTTAG ACACGATGTT
ATGAGTCACG TTTATGCATA TGGACTTCAA TGTCCTTCAG CAAAAGGTAT CATACATTTA
GGAGCAACAA GCTGCTATGT TGGAGATAAT ACAGATGTAA TAATAATGAG AGATGCATTA
TTATTAATAA AGAAAAAAAT AGTTGCAGTT TTAAATAATT TAAAAAGATT TGCTTTAGAA
TATAAGGATA TGCCTACTCT AGGATTTACT CATTTCCAAC CAGCACAGCT TACTACTGTA
GGTAAGAGAG CTACATTATG GATGCAAGAT TTAGTAATGG ATATGGAGAA CATAGATTTT
CTATTATCAA CTTTAAAATT AAGAGGGGTA AAAGGAACTA CTGGTACTCA AGCAAGCTTT
ATGAATCTTT TTGAAGGTGA TGAAGAAAAG GTTAAGGCTT TAGATAAAAT CGTTGCAGAA
AAAATGGGAT TTAAAAAGAG TTTTGGAGTT ACAGGTCAAA CTTATCCAAG AAAATTAGAT
TCTATAATTT TAAATACATT ATCAGAAATT GCACAAAGTG CATATAAATT CTCAAATGAC
TTAAGATTAC TTCAAAGTAT GAAAGAAATA GAAGAACCAT TTGAAAAAAA TCAAATAGGG
TCATCAGCTA TGGCATATAA GAGAAATCCT ATGAGAAGTG AAAGAATGGG AGCATTAGCT
AGATATGTTA TAGTAGATGC CTTAAATCCA GCTATTACGG CTTCAACTCA ATGGTTTGAG
AGAACACTAG ATGATTCAGC TAATAAGAGA ATTGCAGTAG CAGAAGCTTT CTTAGCTTTA
GATGGAGTTT TAAATCTTTA TATAAATATT GCTGAGAATA TGGTAGTTTA TGATAAAGTT
ATTGAAGCTC ATGTAAATCA AGAATTACCT TTCATGGCAA CTGAAAATAT AATGATGGAA
TCAGTTAAAA AAGGTGGAGA TAGACAAGAA CTTCATGAAA GAATAAGAGT TCATTCTATG
GATGCTGCTC AAAGAGTTAA AGGAGAAGGG CTTAATAATG ATTTAATAAA AAGAATAATA
AATGATCCTT CATTTAATCT TTCTAAAGAA GAAATTATAG CTATAATAGA TCCAGTTAAA
TTTGTTGGTA GAGCTCCAAG CCAAGTTGTA GAGTTTATTG ATGAGTATGT AAACCCTATA
ATAGAAGCTA ATAAGGATGC AGCTAGCTTA AGTAGTGATA TAACAGTTTA A
 
Protein sequence
MKNLYSTPLN SRYASKEMSY IFSDDMKFST WRKLWVALAE GEKELGLNIT DEQIEELKSH 
ISDINYEEAI KKEKEVRHDV MSHVYAYGLQ CPSAKGIIHL GATSCYVGDN TDVIIMRDAL
LLIKKKIVAV LNNLKRFALE YKDMPTLGFT HFQPAQLTTV GKRATLWMQD LVMDMENIDF
LLSTLKLRGV KGTTGTQASF MNLFEGDEEK VKALDKIVAE KMGFKKSFGV TGQTYPRKLD
SIILNTLSEI AQSAYKFSND LRLLQSMKEI EEPFEKNQIG SSAMAYKRNP MRSERMGALA
RYVIVDALNP AITASTQWFE RTLDDSANKR IAVAEAFLAL DGVLNLYINI AENMVVYDKV
IEAHVNQELP FMATENIMME SVKKGGDRQE LHERIRVHSM DAAQRVKGEG LNNDLIKRII
NDPSFNLSKE EIIAIIDPVK FVGRAPSQVV EFIDEYVNPI IEANKDAASL SSDITV