Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0375 |
Symbol | guaA |
ID | 4808452 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 471732 |
End bp | 473267 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640105789 |
Product | GMP synthase |
Protein accession | YP_001036806 |
Protein GI | 125972896 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0518] GMP synthase - Glutamine amidotransferase domain [COG0519] GMP synthase, PP-ATPase domain/subunit |
TIGRFAM ID | [TIGR00884] GMP synthase (glutamine-hydrolyzing), C-terminal domain or B subunit [TIGR00888] GMP synthase (glutamine-hydrolyzing), N-terminal domain or A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00216851 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAATAATG AATTGATTCT CGTTCTCGAC TTTGGAGGAC AGTATAATCA GCTTATTGCA AGAAGAGTCA GAGAGGCAAA TGTGTATTGC GAGGTTTTAC CCTACAACTC TTCTATTGAT AAAATAAAAT CAAAGAATCC GAAAGGGATT ATTTTTACGG GAGGTCCTGC ATCTGTACTG GATCCTAAAG CTCCAATCTG TGACAGGGAA GTTTTTGAGC TGGGGATTCC CATTTTGGGT ATCTGCTACG GTATGCAGCT GATGAGCCAT ATGCTTGGGG GAACTGTGGA AAAAGCAGAG CAGCGTGAGT ACGGAAAGGT AAATATTACA TTTGATACCT CAAGCATGCT CTTTGAAGGT ATTGAAAAGG AATCAACCTG CTGGATGAGT CATACCTATT ATGTCAATAA CCTTCCCGAA GGTTTTGTAA AATGTGCGGA TACTCCCAAT TGTCCTGTGG CGGCAATAGA AAACAGGGAG AAGAAATTGT ACGGTGTTCA ATTCCATCCG GAAGTGGTTC ATACTCCCAA GGGAAGAGAT ATACTAAACA ACTTCCTCTA CAAAATCTGC GGATGTTCCG GAGACTGGAA GATGGCTTCC TTTATAGAGC ATTCCATAAA CAGTATCAGG GAAAAAGTCG GAGATAAAAA GGTTTTATGC GCTCTTTCCG GAGGCGTTGA CTCTTCGGTG GCGGCGGTTC TGGTGCACAA AGCCGTTGGA AAGCAGCTTA CATGTATTTT TGTTGACCAT GGTCTTCTCA GAAAGTACGA AGGAGATCAG GTTGAGGAAG TGTTCAAAAA GCAGTTTGAC ATTAGTTTAA TAAGGGTTAA TGCGGAAGAC AGATTTTTGG AAAAACTTAA AGGAGTGACA GACCCCGAAA GAAAGAGAAA AATTATCGGG GAAGAGTTTA TCCGGGTGTT TGAAGAGGAA GCAAAGAAAA TCGGAACAGT TGACTTTCTG GTACAGGGAA CCATTTATCC GGATGTGATT GAAAGCGGAG TGGGAGATGC CGCTGTTATA AAGAGCCATC ACAATGTGGG CGGTCTTCCT GACTATATTG ATTTCAAAGA GATAATTGAA CCTCTCCGAA GCCTCTTTAA GGATGAGGTA AGGAAAGTGG GAATAGAGCT TGGAATACCC GAAGACATTG TTATGAGGCA GCCGTTCCCG GGACCTGGAC TGGCTGTCAG GGTTATCGGC GAGGTTACAA AGGAAAAAGT GGATATACTG AGGGATGCTG ATTATATTTT CAGAGAGGAA ATAAAAAATG CAGGACTGGA CAGGGAAATC AACCAGTACT TTGCAGTGCT CACGGGTATG AGAAGCGTCG GAGTCATGGG GGATGAAAGG ACTTATGACT ATACCCTTGC CCTAAGAGCG GTCACTACCA TCGACTTTAT GACTGCCGAC TGGGCAAAGA TTCCGTATGA TGTACTTGAA AAGGTATCGA ACAGAATTGT AAATGAAGTA AAGCACATTA ACAGGATAGT ATACGATATA ACAACCAAGC CGCCGGCAAC CATTGAGTGG GAGTAG
|
Protein sequence | MNNELILVLD FGGQYNQLIA RRVREANVYC EVLPYNSSID KIKSKNPKGI IFTGGPASVL DPKAPICDRE VFELGIPILG ICYGMQLMSH MLGGTVEKAE QREYGKVNIT FDTSSMLFEG IEKESTCWMS HTYYVNNLPE GFVKCADTPN CPVAAIENRE KKLYGVQFHP EVVHTPKGRD ILNNFLYKIC GCSGDWKMAS FIEHSINSIR EKVGDKKVLC ALSGGVDSSV AAVLVHKAVG KQLTCIFVDH GLLRKYEGDQ VEEVFKKQFD ISLIRVNAED RFLEKLKGVT DPERKRKIIG EEFIRVFEEE AKKIGTVDFL VQGTIYPDVI ESGVGDAAVI KSHHNVGGLP DYIDFKEIIE PLRSLFKDEV RKVGIELGIP EDIVMRQPFP GPGLAVRVIG EVTKEKVDIL RDADYIFREE IKNAGLDREI NQYFAVLTGM RSVGVMGDER TYDYTLALRA VTTIDFMTAD WAKIPYDVLE KVSNRIVNEV KHINRIVYDI TTKPPATIEW E
|
| |