Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0178 |
Symbol | |
ID | 4808666 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 214159 |
End bp | 215535 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640105589 |
Product | argininosuccinate lyase |
Protein accession | YP_001036612 |
Protein GI | 125972702 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0165] Argininosuccinate lyase |
TIGRFAM ID | [TIGR00838] argininosuccinate lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTTT GGGGCGGTAG ATTTGAAAAA AATACCGACA AATCGGTGGA TGATTTTAAT TCATCCATAA GATTTGACTG TCGTATGTAC AAACAAGATA TATTAGGAAG CATTGCCCAT GCCAAAATGC TTGGAAAATG TAAAATTATT TCCGAGGAAG ACTCCATTCT TATTCAAAAT ACTTTAAGGG AAATATTAAA AGACATTGAA GAAGGAAAAG TACAGTTTGA AATTGATGCC GAAGATATTC ATATGAACGT TGAAAAAATC CTCATATCGA GAATCGGGGA TGTGGGAAAG AAGCTTCACA CCGGAAGGAG CCGCAATGAC CAGGTTGCCC TCGATATAAG GATGTATCTT AGGGATGAGG TTGTTGAAAT CAGGAAATTG CTGGTGAACC TCGAAAGGAC GCTTATAGAA ATAGCAAAAA ACAATATTGA CACCATACTT CCGGGATATA CACACCTGCA GAGGGCTCAA CCCATAACTT TCGCCCATCA CATGATGGCA TATTTTCAGA TGTTCAAACG GGACATTGAA AGGCTTAACG ATTGCTATAA AAGAATAAAT GTAATGCCCC TGGGTTCCGG CGCCCTGGCC TCCACAACCT ATCCTCTTGA CAGATACATG GTGGCAAAAG AACTGGGATT TGACTCTATT ACCGAAAACA GCCTTGATGC GGTAAGCGAC AGGGATTTTG TCATTGAGCT TTCCGCCTGT CTTTCAATCC TTATGATGCA TCTTAGCCGG TTCAGTGAAG AAATCATTTT ATGGGCCTCC CATGAGTTTG GTTTTATTGA ATTGGATGAT GCATACAGTA CCGGAAGCAG TATAATGCCT CAAAAGAAAA ATCCGGATGT GGCAGAGCTG GTAAGGGGCA AAACCGGAAG GGTTTACGGC GACCTTATGG CACTTTTGTC AGTTATGAAA TCGCTTCCTC TAGCTTACAA CAAGGATATG CAGGAAGACA AGGAAGCCAT TTTTGACGCT GTGGATACTG TAAAAATGTG TCTGCCGGTG TTCACCAAAA TGATTGAAAC CATGAAAATA AAAAAAGAAA ACATGCTCCG GGCGGCTCAA GGCGGATTTA CAAATGCCAC CGACATGGCA GATTACCTTG TTAAAAAAGG TATTCCTTTC AGAAACGCCC ATGAAATTAT CGGGAAAATG GTTCTGTACT GCATCGAAAA CAACAAGGCT ATTGAGGAAC TTGACATGAG TGAGTTTAAA AGCTTCTCAG AGCTTATAGA AGAAGATGTG TATGAAGAAA TAAGCCTGTC AAAATGTGTT TCCGGCAGAA ATCTTCCCGG AGGACCGGCA AAGGAAAGTG TCATGGCTTC TATTGAAAAC GGGCTTAAGT TTTTGTCCAC ACAATAA
|
Protein sequence | MKLWGGRFEK NTDKSVDDFN SSIRFDCRMY KQDILGSIAH AKMLGKCKII SEEDSILIQN TLREILKDIE EGKVQFEIDA EDIHMNVEKI LISRIGDVGK KLHTGRSRND QVALDIRMYL RDEVVEIRKL LVNLERTLIE IAKNNIDTIL PGYTHLQRAQ PITFAHHMMA YFQMFKRDIE RLNDCYKRIN VMPLGSGALA STTYPLDRYM VAKELGFDSI TENSLDAVSD RDFVIELSAC LSILMMHLSR FSEEIILWAS HEFGFIELDD AYSTGSSIMP QKKNPDVAEL VRGKTGRVYG DLMALLSVMK SLPLAYNKDM QEDKEAIFDA VDTVKMCLPV FTKMIETMKI KKENMLRAAQ GGFTNATDMA DYLVKKGIPF RNAHEIIGKM VLYCIENNKA IEELDMSEFK SFSELIEEDV YEEISLSKCV SGRNLPGGPA KESVMASIEN GLKFLSTQ
|
| |