Gene Cthe_1840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1840 
Symbol 
ID4809386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2184775 
End bp2185710 
Gene Length936 bp 
Protein Length311 aa 
Translation table11 
GC content46% 
IMG OID640107254 
Productcysteine synthase 
Protein accessionYP_001038254 
Protein GI125974344 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0031] Cysteine synthase 
TIGRFAM ID[TIGR01136] cysteine synthases
[TIGR01139] cysteine synthase A 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000292185 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTAAAA TAGCTAAGAA TCTGACGGAA CTCATAGGAA ATACCCCGCT TTTGGAGTTG 
AGCAATTATA ACAGAGCAAA CAATTTGGAA GCTGTCCTGA TAGCAAAGCT CGAATACTTC
AATCCTGCAT CCAGTGTAAA GGACAGGATT GGTTATGCAA TGATAAAGGA CGCAGAAGAA
AAAGGAATAA TAAACAAAGA TACGGTTATT ATAGAGCCCA CAAGCGGAAA TACAGGTATT
GCCCTGGCTT TTGTGGCAGC TGCAAGAGGA TACAGGGTTA TACTTACAAT GCCAGAGACC
ATGAGTATTG AAAGAAGGAA TCTTTTAAAG GCTTTGGGTG CCGAGTTGGT GCTGACACCG
GGAGCCGACG GAATGGGAGG AGCGATCAGA AAGGCTGAGG AGCTTGCCCG TGAAATACCC
AACTCCTTTA TCCCTCAACA GTTCTCCAAT CCTGCAAATC CGGAGATTCA CAGAAGGACC
ACGGCAGAGG AAATCTGGAG AGACACCGAC GGACAGGTGG ATATATTTGT GGCGGGAGTT
GGAACAGGAG GAACAATTTC CGGTGTCGGT GAAGTGTTAA AGCAGCGCAA GCCGGATGTA
AAGATTGTTG CGGTGGAGCC TTTTGATTCA CCGGTTCTGT CCGGAGGAAC CAAAGGTCCT
CACAAGATAC AGGGAATAGG TGCCGGTTTT GTGCCGGATA ATTTCAACCG CGCAGTGGTG
GATGAAATAT TCAAGGTTAA AAATGAAGAG GCCTTTGAAA CATCCAGAAA GCTTGCAAGA
ACGGAAGGTC TTTTGGTGGG AATATCCTCG GGAGCTGCAG CTTTTGCAGC CACACAGATT
GCAAAAAGGC CTGAAAACAA AGGAAAGAAC ATTGTGGTTC TGCTTCCCGA TACAGGAGAG
AGATATTTGT CCACGGCATT ATTCCAGGAT GCATAG
 
Protein sequence
MAKIAKNLTE LIGNTPLLEL SNYNRANNLE AVLIAKLEYF NPASSVKDRI GYAMIKDAEE 
KGIINKDTVI IEPTSGNTGI ALAFVAAARG YRVILTMPET MSIERRNLLK ALGAELVLTP
GADGMGGAIR KAEELAREIP NSFIPQQFSN PANPEIHRRT TAEEIWRDTD GQVDIFVAGV
GTGGTISGVG EVLKQRKPDV KIVAVEPFDS PVLSGGTKGP HKIQGIGAGF VPDNFNRAVV
DEIFKVKNEE AFETSRKLAR TEGLLVGISS GAAAFAATQI AKRPENKGKN IVVLLPDTGE
RYLSTALFQD A