Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0741 |
Symbol | |
ID | 4810359 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 903102 |
End bp | 904529 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640106158 |
Product | adenylosuccinate lyase |
Protein accession | YP_001037169 |
Protein GI | 125973259 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0015] Adenylosuccinate lyase |
TIGRFAM ID | [TIGR00928] adenylosuccinate lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000408437 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAAAATA CATATGAAAG TCCTTTAAAT TCAAGATATG CGAGCAAGGA GATGCAGGAA CTGTTTTCTC CCGACATGAA GTTTCGCACA TGGAGGAGGC TCTGGATTGC GCTCGCAGAG GCAGAAAAGG AACTGGGGCT TAACATTACC GATGAACAGA TAGAAGAGCT TAAGAAATAT AAAGATGACA TAAATTATGA TGTGGCTGAA ATGAAAGAAA AAGAGTTTCG CCATGACGTA ATGGCGCACA TACATGCTTA CGGAGAACAG TGTCCAAATG CCAGGCCCAT AATTCATTTG GGCGCTACAT CCTGCTATGT TGGCGACAAT ACCGATATTA TAATAATGAC GGAGGCTTTA AAACTCATAA AGAAAAAACT TCTTTGTGTA ATATCTAAAT TGTCCGATTT TGCGATGAAA TACAAAGATC TTCCCACTTT AGGGTATACA CATTTTCAGC CGGCACAGCT TGTTACGGTG GGGAAGAGAG CAACTTTGTG GATTCAGGAT TTGTTAATGG ATCTTGAAGA CTTGGATTAT ATTCTGTCCA ATATGAAACT TTTAGGTTCC AAAGGCACCA CGGGAACACA GGCAAGCTTT TTAAATCTTT TTGAAAACGA CCATGAAAAA GTAAAAAAAC TGGACATGCT TATAGCAAAG AAAATGGGCT TTGACAAGGT TTTCCCTGTA TCGGGACAAA CCTATACCAG AAAGCTTGAC AGCAGAATTC TAAATCTCTT AAGCTCAATT GCACAGAGTG CTTACAAGTT TGGCAATGAC TTAAGGCTTC TTCAGAGCAT GAAAGAAATT GAAGAACCTT TTGAAAAGCA TCAGATAGGC TCGTCTGCAA TGGCATACAA GAGAAATCCG ATGAGGTCCG AGAGGATTTG TGCTTTGGCA AGATATGTTA TTGTTAACGC TCTAAATCCC GCGATTACCG CATCCACCCA GTGGTTTGAA AGAACTTTGG ATGATTCGGC AAACAAACGT ATATGCATAC CGGAGGCTTT CCTTGCAGTG GATGCAATAC TGAACATATA TATAAATGTC GCAGACGGCA TGGTTGTGTA TCCAAAGGTT ATAGAAAAAC ACGTTTTGGA AGAACTTCCG TTTATGGCTA CGGAGAACAT AATGATGGAA GCCGTTAAAA AAGGCGGAGA CAGACAGGAG CTCCATGAAC GTATAAGGGT TCATTCAATG GAAGCTGCAA AACAGGTTAA GGTTGAAGGA AAGAAAAATG ACCTTATTGA AAGAATAGCG GCCGATGAAA TGTTTGGACT TAGCATTGAC GAACTGAATT CCGTTCTTGC TCCGGAAAAC TACGTTGGAA GAGCTCCGCA GCAGGTGGAG GAGTTTATCA ATGAATATGT AAAGCCTGTT CTTGAAAAGA ATAAGGTTGA GGATATAGAG GTTGAACTTA AGGTTTGA
|
Protein sequence | MKNTYESPLN SRYASKEMQE LFSPDMKFRT WRRLWIALAE AEKELGLNIT DEQIEELKKY KDDINYDVAE MKEKEFRHDV MAHIHAYGEQ CPNARPIIHL GATSCYVGDN TDIIIMTEAL KLIKKKLLCV ISKLSDFAMK YKDLPTLGYT HFQPAQLVTV GKRATLWIQD LLMDLEDLDY ILSNMKLLGS KGTTGTQASF LNLFENDHEK VKKLDMLIAK KMGFDKVFPV SGQTYTRKLD SRILNLLSSI AQSAYKFGND LRLLQSMKEI EEPFEKHQIG SSAMAYKRNP MRSERICALA RYVIVNALNP AITASTQWFE RTLDDSANKR ICIPEAFLAV DAILNIYINV ADGMVVYPKV IEKHVLEELP FMATENIMME AVKKGGDRQE LHERIRVHSM EAAKQVKVEG KKNDLIERIA ADEMFGLSID ELNSVLAPEN YVGRAPQQVE EFINEYVKPV LEKNKVEDIE VELKV
|
| |