Gene Cthe_2204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2204 
Symbol 
ID4811069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2631796 
End bp2634444 
Gene Length2649 bp 
Protein Length882 aa 
Translation table11 
GC content43% 
IMG OID640107610 
Productcyanophycin synthetase 
Protein accessionYP_001038599 
Protein GI125974689 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0769] UDP-N-acetylmuramyl tripeptide synthase
[COG1181] D-alanine-D-alanine ligase and related ATP-grasp enzymes 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type
[TIGR02068] cyanophycin synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0097589 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGAATAC AAAGTATACA ATGTTTTGCA GGAAGAAATA TTTACAGTCA CAAGCCTGTC 
GTTAAGGTGA CATTGGATAT AGGAGAATTG TACAAATTAC CAACAAAGGA CCTTGGAGAT
TTTAATGAAA GACTTCTTGC CTTGTTTCCC GGATTAAAAA AACATTATTG TTCTTTGGGA
TATGAGGGCG GATTTGAAGA ACGTTTGAAA GAAGGCACAT ATATAGGCCA TGTAACGGAG
CATTTGATTA TAGAGCTTCA GAATATATTG GGATATGAAG TTAATTATGG CAAAACACGG
ATTGTTGAGG AACCGTCACT GTACTTTATA GTGTTTGAAT ATAAAAACGA AAAGTGCGCC
ATAGAGTGTG CAAGAGCGGC GGTAAACATT GTTTTAAAGC TTGTACGCAA TGAAGAGGTT
GATACGGAAG CAATTATAAA TAATTTAAGG GCTATTGCAG TGGAAACGGA TATGGGACCC
AGTACAAAAG CGATCTATGA AGAGGCAAAA AAGAGGGGAA TACCTGTAAC GAGAATAGGC
GACGGCAGTG TTCTAAGGCT GGGATACGGC AAGTATTCCC GGATTGTTCA GGCTTCTTTG
ACGGATTTTC CGAGCTGTAT CAATGTCGAT ATGGCAGGAA ACAAACAGCT TGCGAAACGC
CTCTTGGCGG AAAACAAAAT CCCTGTTCCC GACGGAGATA CGGCCTACAG TTTTGAAGGT
GCTTTGCAGA TAGCACGGGA GATAGGTTTT CCGGTTGTAA TAAAGCCGGT GGACAGCAAT
CAGGGAAAAG GAGTTACTCT TAATATTAAA GATGAGCAGG AAATGGAGAT TGCATATAAT
GAAGCCCGGA AATATTCAAG AGTGGTACTG GTGGAGAAAT ATGTAAAGGG AAAAGATTAC
AGAGTTTTGG TGGTTGGTGA CAGGGTTGCG GCGGTTGCCG AAAGAAGACC GCCTTTTGTA
ATTGGAGATG GCGTTCATAC GGTGGAGGAG CTTGTCGCAA TTGAAAATTT AAGCAGCTTA
AGGGGGGACG ACCATGAAAA GCCCCTTACA AAAATCAAGT TGGATGCCAC AGCATTAAAG
GTTTTGAAAG ATCAGGGCAT TGGCAAGGAC CATGTACCTT CTTTGGGTGA AAGAATATAT
CTAAGATACA ATGGAAACTT AAGCACAGGA GGTACCGCGA GGGAATGTAC TGATGAGATA
CATCCGTATA ATGCTGACAT TGCGGTAAAG GCTGCACAAA TTATAGGGCT TGATATTGCA
GGTGTGGATA TTACCACTGA GGATATATCG GTACCAATCA GCGAAAATGG TGGTGCCATA
ATTGAAATAA ATGCCGCGCC GGGACTTAGA ATGCATTTGT TTCCTTCGGA AGGGAAAACC
AGAAATGTGG CGGCGGATAT ACTGGATATG TTGTTTCCGG AAGCATCCCC CCATTCGATT
CCCATTGTAT CTGTTACCGG TACTAACGGA AAGACGACCA CCACGCGACT GATCGGGCAT
ACTCTTGCCT GCCTGGGGAA AAAGGTGGGA ATGACATCCA CCGGAGGCAT TTTCATTGGC
GATGAATGTG TCTTAAAAGG AGACAACACC GGGCCGACAA GTGCGGCAAT GGTATTGTCA
TCAAAAGAAG TGGAAGTTGC GGTACTTGAG ACTGCAAGAG GCGGAATTGT GAGAAAAGGT
CTGGGGTATG ACCTTGCCGA TGTGGGAGTT ATAACCAATA TTGCGGAAGA CCACCTTGGT
ATTGACGGGA TGAATACTCT TGAAGATTTG GCATTTGCCA AGTCATTGGT GGTTGAGGCG
GTAAAACCTG GCGGATATTC TGTGTTGAAT GCGGATGACA AGTTTGTGCG GTATTTTATG
GAGAGGGCAA AAGGAGAAAT TATTCTGTTC TCGAAAAATA ACAGCAATCC TGTTGTGAAG
GAGCATATGC AAAAAGGCGG AAAAGCGTTA TATGTGGACA AAGACTCAAT TTTCATATAC
AATGGAAAAA CTGCCGAAGT TCTTATGAGT GTAAAAGAGA TTCCAATAAC CTACGGAGGA
ATGATTGAAT GCAATATTGA AAATTCCCTT GCAGCTGCTT CGGCGCTGTA TGGCTTGAAT
CTTTCCATTG AAGCCATACG AAAAGGACTG GCAACCTTTA ATCCCGATAT GGAATCCAAC
CCGGGAAGGT TTAATATCGT GGATATGGGC GATTTCAAGG TGATGCTTGA TTACGGGCAC
AATCCGGCAG GATACCTTGA GGTTATGAAA TTTTTGGACA AAATTGATGC AAAAAGGCTT
GTAGGAGTTA TAGGTATGCC GGGCGACAGG GATGACTTAA GCATATACAA GGCAGGAGAA
ATATGCAGCA AATGTTTTTC TAAAATTTAT ATAAAAGAGG ATAGTGATCT TAGGGGAAGA
GAGCCGGGAG AAGTGGCGGG AATTCTCTAC GATGCCGTTA TCAGCAGCGG AACCAAAAAA
GAAAATGTGG AGATAATTTA TTCTGAGGAC AGAGCCCTTG AGAAAGCACT TCTTGACGCA
CAGCCCGGAG ATTTTGTGGT TATGTTTTAT GAGAATTTTG AAAGGGCGGC AGAGGTCGTG
GAACGGGTCC GCAGAGAGCT TCTGGAGAAT ACTGATACGG CCGGTCCGGT AATTCAGAAT
GTTGGATAA
 
Protein sequence
MRIQSIQCFA GRNIYSHKPV VKVTLDIGEL YKLPTKDLGD FNERLLALFP GLKKHYCSLG 
YEGGFEERLK EGTYIGHVTE HLIIELQNIL GYEVNYGKTR IVEEPSLYFI VFEYKNEKCA
IECARAAVNI VLKLVRNEEV DTEAIINNLR AIAVETDMGP STKAIYEEAK KRGIPVTRIG
DGSVLRLGYG KYSRIVQASL TDFPSCINVD MAGNKQLAKR LLAENKIPVP DGDTAYSFEG
ALQIAREIGF PVVIKPVDSN QGKGVTLNIK DEQEMEIAYN EARKYSRVVL VEKYVKGKDY
RVLVVGDRVA AVAERRPPFV IGDGVHTVEE LVAIENLSSL RGDDHEKPLT KIKLDATALK
VLKDQGIGKD HVPSLGERIY LRYNGNLSTG GTARECTDEI HPYNADIAVK AAQIIGLDIA
GVDITTEDIS VPISENGGAI IEINAAPGLR MHLFPSEGKT RNVAADILDM LFPEASPHSI
PIVSVTGTNG KTTTTRLIGH TLACLGKKVG MTSTGGIFIG DECVLKGDNT GPTSAAMVLS
SKEVEVAVLE TARGGIVRKG LGYDLADVGV ITNIAEDHLG IDGMNTLEDL AFAKSLVVEA
VKPGGYSVLN ADDKFVRYFM ERAKGEIILF SKNNSNPVVK EHMQKGGKAL YVDKDSIFIY
NGKTAEVLMS VKEIPITYGG MIECNIENSL AAASALYGLN LSIEAIRKGL ATFNPDMESN
PGRFNIVDMG DFKVMLDYGH NPAGYLEVMK FLDKIDAKRL VGVIGMPGDR DDLSIYKAGE
ICSKCFSKIY IKEDSDLRGR EPGEVAGILY DAVISSGTKK ENVEIIYSED RALEKALLDA
QPGDFVVMFY ENFERAAEVV ERVRRELLEN TDTAGPVIQN VG