Gene CPF_1922 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1922 
SymbolpurB 
ID4201339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2157713 
End bp2159143 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content31% 
IMG OID638082791 
Productadenylosuccinate lyase 
Protein accessionYP_696355 
Protein GI110799469 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0015] Adenylosuccinate lyase 
TIGRFAM ID[TIGR00928] adenylosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAAATT TATATAGCAC ACCATTGAAT TCAAGATATG CGTCAAAAGA GATGTCATAT 
ATTTTCTCAG ATGATATGAA ATTCTCAACA TGGAGAAAAT TATGGGTTGC TCTTGCAGAG
GGTGAAAAAG AATTAGGATT AAATATAACT GACGAGCAAA TAGAAGAGCT TAAAAGTCAT
ATTTCAGATA TAAATTACGA AGAAGCAATA AAAAAAGAAA AAGAAGTTAG ACACGATGTT
ATGAGTCACG TTTATGCATA TGGACTTCAA TGTCCTTCAG CAAAAGGTAT CATACATTTA
GGAGCAACAA GCTGCTATGT TGGAGATAAT ACAGATGTAA TAATAATGAG AGATGCATTA
TTATTAATAA AGAAAAAAAT AGTTGCAGTT TTAAATAATT TAAAAAGATT TGCTTTAGAA
TATAAGGATA TGCCTACTCT AGGATTTACT CATTTCCAAC CAGCACAGCT TACTACTGTA
GGTAAGAGAG CTACATTATG GATGCAAGAT TTAGTAATGG ATATGGAGAA CATAGATTTT
CTATTATCAA CTTTAAAATT AAGAGGGGTA AAAGGAACTA CTGGTACTCA AGCAAGCTTT
ATGAATCTTT TTGAAGGTGA TGAAGAAAAG GTTAAGGCTT TAGATAAAAT CGTTGCAGAA
AAAATGGGAT TTAAAAAGAG TTTTGGAGTT ACAGGTCAAA CTTATCCAAG AAAATTAGAT
TCTATAATTT TAAATACATT ATCAGAAATT GCACAAAGTG CATATAAATT CTCAAATGAC
TTAAGATTAC TTCAAAGTAT GAAAGAAATA GAAGAACCAT TTGAAAAAAA TCAAATAGGA
TCATCAGCTA TGGCATATAA GAGAAATCCT ATGAGAAGTG AAAGAATGGG AGCATTAGCT
AGATATGTTA TAGTAGATGC ATTAAATCCA GCGATTACGG CTTCAACTCA ATGGTTTGAG
AGAACATTAG ATGATTCAGC TAATAAGAGA ATTGCAGTAG CAGAAGCTTT CTTAGCTTTA
GATGGAGTTT TAAATCTTTA TATAAATATT GCTGAGAATA TGGTAGTTTA TGATAAAGTT
ATTGAAGCTC ATGTAAATCA AGAATTACCT TTCATGGCAA CTGAAAATAT AATGATGGAA
TCAGTTAAAA AAGGCGGAGA TAGACAAGAA CTTCATGAAA GAATAAGAGT TCATTCTATG
GATGCTGCTC AAAGAGTTAA AGGAGAAGGG CTTAATAATG ATTTAATAGA AAGAATAATA
AATGATCCTT CATTTAATCT TTCTAAAGAA GAAATTATAG CTATAATAGA TCCAGTTAAA
TTTGTTGGTA GAGCTCCAAG CCAAGTTGTA GAGTTTATTG ATGAGTATGT AAACCCTATA
ATAGAAGCTA ATAAGGATGC AGCTAGCTTA AGTAGTGATA TAACAGTTTA A
 
Protein sequence
MKNLYSTPLN SRYASKEMSY IFSDDMKFST WRKLWVALAE GEKELGLNIT DEQIEELKSH 
ISDINYEEAI KKEKEVRHDV MSHVYAYGLQ CPSAKGIIHL GATSCYVGDN TDVIIMRDAL
LLIKKKIVAV LNNLKRFALE YKDMPTLGFT HFQPAQLTTV GKRATLWMQD LVMDMENIDF
LLSTLKLRGV KGTTGTQASF MNLFEGDEEK VKALDKIVAE KMGFKKSFGV TGQTYPRKLD
SIILNTLSEI AQSAYKFSND LRLLQSMKEI EEPFEKNQIG SSAMAYKRNP MRSERMGALA
RYVIVDALNP AITASTQWFE RTLDDSANKR IAVAEAFLAL DGVLNLYINI AENMVVYDKV
IEAHVNQELP FMATENIMME SVKKGGDRQE LHERIRVHSM DAAQRVKGEG LNNDLIERII
NDPSFNLSKE EIIAIIDPVK FVGRAPSQVV EFIDEYVNPI IEANKDAASL SSDITV