Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2204 |
Symbol | |
ID | 4811069 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2631796 |
End bp | 2634444 |
Gene Length | 2649 bp |
Protein Length | 882 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640107610 |
Product | cyanophycin synthetase |
Protein accession | YP_001038599 |
Protein GI | 125974689 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0769] UDP-N-acetylmuramyl tripeptide synthase [COG1181] D-alanine-D-alanine ligase and related ATP-grasp enzymes |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type [TIGR02068] cyanophycin synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0097589 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGAATAC AAAGTATACA ATGTTTTGCA GGAAGAAATA TTTACAGTCA CAAGCCTGTC GTTAAGGTGA CATTGGATAT AGGAGAATTG TACAAATTAC CAACAAAGGA CCTTGGAGAT TTTAATGAAA GACTTCTTGC CTTGTTTCCC GGATTAAAAA AACATTATTG TTCTTTGGGA TATGAGGGCG GATTTGAAGA ACGTTTGAAA GAAGGCACAT ATATAGGCCA TGTAACGGAG CATTTGATTA TAGAGCTTCA GAATATATTG GGATATGAAG TTAATTATGG CAAAACACGG ATTGTTGAGG AACCGTCACT GTACTTTATA GTGTTTGAAT ATAAAAACGA AAAGTGCGCC ATAGAGTGTG CAAGAGCGGC GGTAAACATT GTTTTAAAGC TTGTACGCAA TGAAGAGGTT GATACGGAAG CAATTATAAA TAATTTAAGG GCTATTGCAG TGGAAACGGA TATGGGACCC AGTACAAAAG CGATCTATGA AGAGGCAAAA AAGAGGGGAA TACCTGTAAC GAGAATAGGC GACGGCAGTG TTCTAAGGCT GGGATACGGC AAGTATTCCC GGATTGTTCA GGCTTCTTTG ACGGATTTTC CGAGCTGTAT CAATGTCGAT ATGGCAGGAA ACAAACAGCT TGCGAAACGC CTCTTGGCGG AAAACAAAAT CCCTGTTCCC GACGGAGATA CGGCCTACAG TTTTGAAGGT GCTTTGCAGA TAGCACGGGA GATAGGTTTT CCGGTTGTAA TAAAGCCGGT GGACAGCAAT CAGGGAAAAG GAGTTACTCT TAATATTAAA GATGAGCAGG AAATGGAGAT TGCATATAAT GAAGCCCGGA AATATTCAAG AGTGGTACTG GTGGAGAAAT ATGTAAAGGG AAAAGATTAC AGAGTTTTGG TGGTTGGTGA CAGGGTTGCG GCGGTTGCCG AAAGAAGACC GCCTTTTGTA ATTGGAGATG GCGTTCATAC GGTGGAGGAG CTTGTCGCAA TTGAAAATTT AAGCAGCTTA AGGGGGGACG ACCATGAAAA GCCCCTTACA AAAATCAAGT TGGATGCCAC AGCATTAAAG GTTTTGAAAG ATCAGGGCAT TGGCAAGGAC CATGTACCTT CTTTGGGTGA AAGAATATAT CTAAGATACA ATGGAAACTT AAGCACAGGA GGTACCGCGA GGGAATGTAC TGATGAGATA CATCCGTATA ATGCTGACAT TGCGGTAAAG GCTGCACAAA TTATAGGGCT TGATATTGCA GGTGTGGATA TTACCACTGA GGATATATCG GTACCAATCA GCGAAAATGG TGGTGCCATA ATTGAAATAA ATGCCGCGCC GGGACTTAGA ATGCATTTGT TTCCTTCGGA AGGGAAAACC AGAAATGTGG CGGCGGATAT ACTGGATATG TTGTTTCCGG AAGCATCCCC CCATTCGATT CCCATTGTAT CTGTTACCGG TACTAACGGA AAGACGACCA CCACGCGACT GATCGGGCAT ACTCTTGCCT GCCTGGGGAA AAAGGTGGGA ATGACATCCA CCGGAGGCAT TTTCATTGGC GATGAATGTG TCTTAAAAGG AGACAACACC GGGCCGACAA GTGCGGCAAT GGTATTGTCA TCAAAAGAAG TGGAAGTTGC GGTACTTGAG ACTGCAAGAG GCGGAATTGT GAGAAAAGGT CTGGGGTATG ACCTTGCCGA TGTGGGAGTT ATAACCAATA TTGCGGAAGA CCACCTTGGT ATTGACGGGA TGAATACTCT TGAAGATTTG GCATTTGCCA AGTCATTGGT GGTTGAGGCG GTAAAACCTG GCGGATATTC TGTGTTGAAT GCGGATGACA AGTTTGTGCG GTATTTTATG GAGAGGGCAA AAGGAGAAAT TATTCTGTTC TCGAAAAATA ACAGCAATCC TGTTGTGAAG GAGCATATGC AAAAAGGCGG AAAAGCGTTA TATGTGGACA AAGACTCAAT TTTCATATAC AATGGAAAAA CTGCCGAAGT TCTTATGAGT GTAAAAGAGA TTCCAATAAC CTACGGAGGA ATGATTGAAT GCAATATTGA AAATTCCCTT GCAGCTGCTT CGGCGCTGTA TGGCTTGAAT CTTTCCATTG AAGCCATACG AAAAGGACTG GCAACCTTTA ATCCCGATAT GGAATCCAAC CCGGGAAGGT TTAATATCGT GGATATGGGC GATTTCAAGG TGATGCTTGA TTACGGGCAC AATCCGGCAG GATACCTTGA GGTTATGAAA TTTTTGGACA AAATTGATGC AAAAAGGCTT GTAGGAGTTA TAGGTATGCC GGGCGACAGG GATGACTTAA GCATATACAA GGCAGGAGAA ATATGCAGCA AATGTTTTTC TAAAATTTAT ATAAAAGAGG ATAGTGATCT TAGGGGAAGA GAGCCGGGAG AAGTGGCGGG AATTCTCTAC GATGCCGTTA TCAGCAGCGG AACCAAAAAA GAAAATGTGG AGATAATTTA TTCTGAGGAC AGAGCCCTTG AGAAAGCACT TCTTGACGCA CAGCCCGGAG ATTTTGTGGT TATGTTTTAT GAGAATTTTG AAAGGGCGGC AGAGGTCGTG GAACGGGTCC GCAGAGAGCT TCTGGAGAAT ACTGATACGG CCGGTCCGGT AATTCAGAAT GTTGGATAA
|
Protein sequence | MRIQSIQCFA GRNIYSHKPV VKVTLDIGEL YKLPTKDLGD FNERLLALFP GLKKHYCSLG YEGGFEERLK EGTYIGHVTE HLIIELQNIL GYEVNYGKTR IVEEPSLYFI VFEYKNEKCA IECARAAVNI VLKLVRNEEV DTEAIINNLR AIAVETDMGP STKAIYEEAK KRGIPVTRIG DGSVLRLGYG KYSRIVQASL TDFPSCINVD MAGNKQLAKR LLAENKIPVP DGDTAYSFEG ALQIAREIGF PVVIKPVDSN QGKGVTLNIK DEQEMEIAYN EARKYSRVVL VEKYVKGKDY RVLVVGDRVA AVAERRPPFV IGDGVHTVEE LVAIENLSSL RGDDHEKPLT KIKLDATALK VLKDQGIGKD HVPSLGERIY LRYNGNLSTG GTARECTDEI HPYNADIAVK AAQIIGLDIA GVDITTEDIS VPISENGGAI IEINAAPGLR MHLFPSEGKT RNVAADILDM LFPEASPHSI PIVSVTGTNG KTTTTRLIGH TLACLGKKVG MTSTGGIFIG DECVLKGDNT GPTSAAMVLS SKEVEVAVLE TARGGIVRKG LGYDLADVGV ITNIAEDHLG IDGMNTLEDL AFAKSLVVEA VKPGGYSVLN ADDKFVRYFM ERAKGEIILF SKNNSNPVVK EHMQKGGKAL YVDKDSIFIY NGKTAEVLMS VKEIPITYGG MIECNIENSL AAASALYGLN LSIEAIRKGL ATFNPDMESN PGRFNIVDMG DFKVMLDYGH NPAGYLEVMK FLDKIDAKRL VGVIGMPGDR DDLSIYKAGE ICSKCFSKIY IKEDSDLRGR EPGEVAGILY DAVISSGTKK ENVEIIYSED RALEKALLDA QPGDFVVMFY ENFERAAEVV ERVRRELLEN TDTAGPVIQN VG
|
| |