Gene Cthe_0178 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0178 
Symbol 
ID4808666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp214159 
End bp215535 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content41% 
IMG OID640105589 
Productargininosuccinate lyase 
Protein accessionYP_001036612 
Protein GI125972702 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTTT GGGGCGGTAG ATTTGAAAAA AATACCGACA AATCGGTGGA TGATTTTAAT 
TCATCCATAA GATTTGACTG TCGTATGTAC AAACAAGATA TATTAGGAAG CATTGCCCAT
GCCAAAATGC TTGGAAAATG TAAAATTATT TCCGAGGAAG ACTCCATTCT TATTCAAAAT
ACTTTAAGGG AAATATTAAA AGACATTGAA GAAGGAAAAG TACAGTTTGA AATTGATGCC
GAAGATATTC ATATGAACGT TGAAAAAATC CTCATATCGA GAATCGGGGA TGTGGGAAAG
AAGCTTCACA CCGGAAGGAG CCGCAATGAC CAGGTTGCCC TCGATATAAG GATGTATCTT
AGGGATGAGG TTGTTGAAAT CAGGAAATTG CTGGTGAACC TCGAAAGGAC GCTTATAGAA
ATAGCAAAAA ACAATATTGA CACCATACTT CCGGGATATA CACACCTGCA GAGGGCTCAA
CCCATAACTT TCGCCCATCA CATGATGGCA TATTTTCAGA TGTTCAAACG GGACATTGAA
AGGCTTAACG ATTGCTATAA AAGAATAAAT GTAATGCCCC TGGGTTCCGG CGCCCTGGCC
TCCACAACCT ATCCTCTTGA CAGATACATG GTGGCAAAAG AACTGGGATT TGACTCTATT
ACCGAAAACA GCCTTGATGC GGTAAGCGAC AGGGATTTTG TCATTGAGCT TTCCGCCTGT
CTTTCAATCC TTATGATGCA TCTTAGCCGG TTCAGTGAAG AAATCATTTT ATGGGCCTCC
CATGAGTTTG GTTTTATTGA ATTGGATGAT GCATACAGTA CCGGAAGCAG TATAATGCCT
CAAAAGAAAA ATCCGGATGT GGCAGAGCTG GTAAGGGGCA AAACCGGAAG GGTTTACGGC
GACCTTATGG CACTTTTGTC AGTTATGAAA TCGCTTCCTC TAGCTTACAA CAAGGATATG
CAGGAAGACA AGGAAGCCAT TTTTGACGCT GTGGATACTG TAAAAATGTG TCTGCCGGTG
TTCACCAAAA TGATTGAAAC CATGAAAATA AAAAAAGAAA ACATGCTCCG GGCGGCTCAA
GGCGGATTTA CAAATGCCAC CGACATGGCA GATTACCTTG TTAAAAAAGG TATTCCTTTC
AGAAACGCCC ATGAAATTAT CGGGAAAATG GTTCTGTACT GCATCGAAAA CAACAAGGCT
ATTGAGGAAC TTGACATGAG TGAGTTTAAA AGCTTCTCAG AGCTTATAGA AGAAGATGTG
TATGAAGAAA TAAGCCTGTC AAAATGTGTT TCCGGCAGAA ATCTTCCCGG AGGACCGGCA
AAGGAAAGTG TCATGGCTTC TATTGAAAAC GGGCTTAAGT TTTTGTCCAC ACAATAA
 
Protein sequence
MKLWGGRFEK NTDKSVDDFN SSIRFDCRMY KQDILGSIAH AKMLGKCKII SEEDSILIQN 
TLREILKDIE EGKVQFEIDA EDIHMNVEKI LISRIGDVGK KLHTGRSRND QVALDIRMYL
RDEVVEIRKL LVNLERTLIE IAKNNIDTIL PGYTHLQRAQ PITFAHHMMA YFQMFKRDIE
RLNDCYKRIN VMPLGSGALA STTYPLDRYM VAKELGFDSI TENSLDAVSD RDFVIELSAC
LSILMMHLSR FSEEIILWAS HEFGFIELDD AYSTGSSIMP QKKNPDVAEL VRGKTGRVYG
DLMALLSVMK SLPLAYNKDM QEDKEAIFDA VDTVKMCLPV FTKMIETMKI KKENMLRAAQ
GGFTNATDMA DYLVKKGIPF RNAHEIIGKM VLYCIENNKA IEELDMSEFK SFSELIEEDV
YEEISLSKCV SGRNLPGGPA KESVMASIEN GLKFLSTQ