Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0504 |
Symbol | |
ID | 4808304 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 614616 |
End bp | 615851 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640105917 |
Product | poly-gamma-glutamate biosynthesis protein |
Protein accession | YP_001036934 |
Protein GI | 125973024 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000000299856 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAAAA AATATGTAGC ATTGGCGGTA ATTGTTTTTC TTATAGGAAT TTCAGCTCTT GGTGGCTGTA ATTTCAGATC TGACAGTGTT GCCGTTGTAA ATGAGAAAAA TACCGGCACT TATGAAAATG ATTTGGATCA AAAGAAAGTA GCGGATGCTG CAAAAGACGG CACAGTTGGT ACTGTAAGTG AAACTGTTTA CAGCACCGTC CAACCCACGG CAACTCCCGA AAAAGAGTTA AAGATTGTTG CGGTGGGAGA CATTCTTTTG GGCCGGGGTG TGGGCATGAG GCTTAAAAAC GGCAATAAAG ACTTTACATA TCCTTTTCTT GAGGTCAGGG ACATCCTAAG GAAAGGGGAC GTTGTGTTTG GCAATCTGGA AGAGCCCATT ACATCAAGCA CTCATTCCCT TACAGGCATA AAAGAAGGCG GAAAATATGT GCTCAAAAAT GATGTTGAGG CGATAGAGGG GATTAAGTAT GCCGGATTTA ATTTGATGAA CCTTGCGAAC AACCACATAC TTGATTATTA TGAGCGCGGG CTGTTTGATA CGATGGATAT TTTGGATAAA AACGGTATCA AGTATGCCGG AGCGGGAAGA AATTTGGAAG AAGCCAGAAA GCCCGCAATA ATGGAAGTAA AGAGCATGAA AGTGGGAATG CTGGCTTACA CCGATATGGC GGAAATTGTG TACAAGGGCA ATCCGAACTA CAAGTTTGCG GCCGGAGAGG ACAAGCCGGG GGTTGCACCA AGACCTTTGA AATTTGACGA TTCCATAAAA AAAGACATAG AAGAGTTACG GAGCAAGGTG GATATTTTAA TTGTTTCACT TCACTGGGGA GTGGAGGAAA GCTTTGAAGT TCTGCCTGAA CAGAGGGAAT TTGCCCACAG TCTTATAGAT AACGGAGTGG ATGTAATATT GGGACACCAT CCCCACCAGT TCCAAGGTAT AGAAATCTAC AAGGGCAAAC CTGTTTTCTA CAGTCTGGGT AATTTTATTT TTGATCAGAA CGATCCCGAA AACCAGGAGT CCTTTATTGT GACACTTGAT TACAAAGGCA GCAGACTGAC AGGAATAGAG GCTGTACCCG TGAGAACAAT CGGAAAAATA CAGGTAGTTC CTCAAAAAGG AGATGAAGCA AAACCTATTT TGGAAAGAGA GAAAAATTTA TGTAATAGGC TTGATACAAA CTGCATTATA AAAGATGACA AATTATATTT TGAAATTGGA AAATAA
|
Protein sequence | MRKKYVALAV IVFLIGISAL GGCNFRSDSV AVVNEKNTGT YENDLDQKKV ADAAKDGTVG TVSETVYSTV QPTATPEKEL KIVAVGDILL GRGVGMRLKN GNKDFTYPFL EVRDILRKGD VVFGNLEEPI TSSTHSLTGI KEGGKYVLKN DVEAIEGIKY AGFNLMNLAN NHILDYYERG LFDTMDILDK NGIKYAGAGR NLEEARKPAI MEVKSMKVGM LAYTDMAEIV YKGNPNYKFA AGEDKPGVAP RPLKFDDSIK KDIEELRSKV DILIVSLHWG VEESFEVLPE QREFAHSLID NGVDVILGHH PHQFQGIEIY KGKPVFYSLG NFIFDQNDPE NQESFIVTLD YKGSRLTGIE AVPVRTIGKI QVVPQKGDEA KPILEREKNL CNRLDTNCII KDDKLYFEIG K
|
| |