Gene Cthe_0741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0741 
Symbol 
ID4810359 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp903102 
End bp904529 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content40% 
IMG OID640106158 
Productadenylosuccinate lyase 
Protein accessionYP_001037169 
Protein GI125973259 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0015] Adenylosuccinate lyase 
TIGRFAM ID[TIGR00928] adenylosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000408437 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAAATA CATATGAAAG TCCTTTAAAT TCAAGATATG CGAGCAAGGA GATGCAGGAA 
CTGTTTTCTC CCGACATGAA GTTTCGCACA TGGAGGAGGC TCTGGATTGC GCTCGCAGAG
GCAGAAAAGG AACTGGGGCT TAACATTACC GATGAACAGA TAGAAGAGCT TAAGAAATAT
AAAGATGACA TAAATTATGA TGTGGCTGAA ATGAAAGAAA AAGAGTTTCG CCATGACGTA
ATGGCGCACA TACATGCTTA CGGAGAACAG TGTCCAAATG CCAGGCCCAT AATTCATTTG
GGCGCTACAT CCTGCTATGT TGGCGACAAT ACCGATATTA TAATAATGAC GGAGGCTTTA
AAACTCATAA AGAAAAAACT TCTTTGTGTA ATATCTAAAT TGTCCGATTT TGCGATGAAA
TACAAAGATC TTCCCACTTT AGGGTATACA CATTTTCAGC CGGCACAGCT TGTTACGGTG
GGGAAGAGAG CAACTTTGTG GATTCAGGAT TTGTTAATGG ATCTTGAAGA CTTGGATTAT
ATTCTGTCCA ATATGAAACT TTTAGGTTCC AAAGGCACCA CGGGAACACA GGCAAGCTTT
TTAAATCTTT TTGAAAACGA CCATGAAAAA GTAAAAAAAC TGGACATGCT TATAGCAAAG
AAAATGGGCT TTGACAAGGT TTTCCCTGTA TCGGGACAAA CCTATACCAG AAAGCTTGAC
AGCAGAATTC TAAATCTCTT AAGCTCAATT GCACAGAGTG CTTACAAGTT TGGCAATGAC
TTAAGGCTTC TTCAGAGCAT GAAAGAAATT GAAGAACCTT TTGAAAAGCA TCAGATAGGC
TCGTCTGCAA TGGCATACAA GAGAAATCCG ATGAGGTCCG AGAGGATTTG TGCTTTGGCA
AGATATGTTA TTGTTAACGC TCTAAATCCC GCGATTACCG CATCCACCCA GTGGTTTGAA
AGAACTTTGG ATGATTCGGC AAACAAACGT ATATGCATAC CGGAGGCTTT CCTTGCAGTG
GATGCAATAC TGAACATATA TATAAATGTC GCAGACGGCA TGGTTGTGTA TCCAAAGGTT
ATAGAAAAAC ACGTTTTGGA AGAACTTCCG TTTATGGCTA CGGAGAACAT AATGATGGAA
GCCGTTAAAA AAGGCGGAGA CAGACAGGAG CTCCATGAAC GTATAAGGGT TCATTCAATG
GAAGCTGCAA AACAGGTTAA GGTTGAAGGA AAGAAAAATG ACCTTATTGA AAGAATAGCG
GCCGATGAAA TGTTTGGACT TAGCATTGAC GAACTGAATT CCGTTCTTGC TCCGGAAAAC
TACGTTGGAA GAGCTCCGCA GCAGGTGGAG GAGTTTATCA ATGAATATGT AAAGCCTGTT
CTTGAAAAGA ATAAGGTTGA GGATATAGAG GTTGAACTTA AGGTTTGA
 
Protein sequence
MKNTYESPLN SRYASKEMQE LFSPDMKFRT WRRLWIALAE AEKELGLNIT DEQIEELKKY 
KDDINYDVAE MKEKEFRHDV MAHIHAYGEQ CPNARPIIHL GATSCYVGDN TDIIIMTEAL
KLIKKKLLCV ISKLSDFAMK YKDLPTLGYT HFQPAQLVTV GKRATLWIQD LLMDLEDLDY
ILSNMKLLGS KGTTGTQASF LNLFENDHEK VKKLDMLIAK KMGFDKVFPV SGQTYTRKLD
SRILNLLSSI AQSAYKFGND LRLLQSMKEI EEPFEKHQIG SSAMAYKRNP MRSERICALA
RYVIVNALNP AITASTQWFE RTLDDSANKR ICIPEAFLAV DAILNIYINV ADGMVVYPKV
IEKHVLEELP FMATENIMME AVKKGGDRQE LHERIRVHSM EAAKQVKVEG KKNDLIERIA
ADEMFGLSID ELNSVLAPEN YVGRAPQQVE EFINEYVKPV LEKNKVEDIE VELKV