Gene CPF_0682 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0682 
SymbolargH 
ID4201776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp812918 
End bp814318 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content28% 
IMG OID638081567 
Productargininosuccinate lyase 
Protein accessionYP_695134 
Protein GI110798800 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTAT GGGGCGGAAG ATTCACTCAC CAAGTTGATG ATCTAGTTAA CACTTTTAAT 
TCCTCTATTT CTTTTGATTC AAGAATGTAT AAAGAAGATA TAATTGGAAG TATAGCTCAT
GTTACTATGC TTGGTGAAGA AAAAATTATT CCAAAGGAAG ATAGCAAAAA AATTGCTTCT
GGTTTATATG AAATATTAAA TAAATTAAAT CAAGGAGTAT TAAAAATAGA TAACTCTTCA
GAAGATATAC ACAGTTTTAT AGAAAGTACT CTTACAGATT ACATTGGTGA AGAAGGAAAA
AAACTACATA CTGGTAGAAG TAGAAATGAT CAAGTAACCT TAGACACAAA ATTATATTTA
AAAGGATATA TTAAAATTTT AATATGTGAA ATTTTAAACC TTGAAAAAAC TCTATTAAAT
CTTTCTTCAG AAAATAAAGA AACTATTATG CCAGGATATA CCCATATGCA AAAGGCTCAA
CCTATTACAT TTGCTCATCA TATTTTAGCA TATAGTGAAA TGTTTAAAAG AGATATATCT
AGATTACTAG ATTGTTATAA AAGACTTGAT GAAATGCCTT TAGGCAGTGG TGCTTTAGCA
ACTACTACTT ACCCTATAAA TCGTGAAAAA GTTGCAAATC TACTAGGCTT TTCAAAAGTT
ACACTAAATA GTTTAGATTC TGTTTCTGAT AGGGATTATG CTATTGAAAC ACTTTCTTGC
CTCTCTTTAC TTATGATGCA TCTTTCTAGA TTTTCAGAGG AAATAATCAT CTGGTCTACT
GATGAATTTA AATTTATTGA ATTAGATGAT AGTTATAGTA CTGGAAGCAG TATTATGCCA
CAAAAAAAGA ATCCTGATGT TGCAGAATTA GTAAGAGGAA AAACAGGACG TGTTTATGGA
GATTTAATGA CGCTATTAAC TGTTATGAAG GGACTTCCTT TAGCTTATAA TAAGGATATG
CAAGAAGACA AAGAAGCTTT ATTTGATGGG TTAGATACTA CTCTACTTTC TATAAAAACT
TTTAATGGAA TGATAAAAAC AATGAAAATT AATAAGAGTA TTATGAAAAC TTCAGCTTCT
TCTGGATTTA CTAACGCCAC TGACGTCGCT GATTATCTAG TAAAAAAAGG GGTAGCTTTT
AGAGATGCTC ATGAGATTGT AGGAAATTTA ATTCTTTATT GTATAGATGA AGGGAAATCT
ATTGATAACT TATCTTTATC TGAATTTAAA ACTTTCTCAA ATAAGTTTGA AAATGATATA
TATAAAGCTA TTAATCTTTT AACTTGTATA GAAGAAAGAA AAGTAATAGG TGGACCAAGT
ATTTCATCTA TAAACATTCA AATTGAACAT TTAAATAATT TTATACAAGA AAGTAATGAA
AAACTTAATC TTCTAAAATA G
 
Protein sequence
MKLWGGRFTH QVDDLVNTFN SSISFDSRMY KEDIIGSIAH VTMLGEEKII PKEDSKKIAS 
GLYEILNKLN QGVLKIDNSS EDIHSFIEST LTDYIGEEGK KLHTGRSRND QVTLDTKLYL
KGYIKILICE ILNLEKTLLN LSSENKETIM PGYTHMQKAQ PITFAHHILA YSEMFKRDIS
RLLDCYKRLD EMPLGSGALA TTTYPINREK VANLLGFSKV TLNSLDSVSD RDYAIETLSC
LSLLMMHLSR FSEEIIIWST DEFKFIELDD SYSTGSSIMP QKKNPDVAEL VRGKTGRVYG
DLMTLLTVMK GLPLAYNKDM QEDKEALFDG LDTTLLSIKT FNGMIKTMKI NKSIMKTSAS
SGFTNATDVA DYLVKKGVAF RDAHEIVGNL ILYCIDEGKS IDNLSLSEFK TFSNKFENDI
YKAINLLTCI EERKVIGGPS ISSINIQIEH LNNFIQESNE KLNLLK