Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0081 |
Symbol | |
ID | 4808776 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 111423 |
End bp | 112628 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640105490 |
Product | N-acetylglutamate synthase / glutamate N-acetyltransferase |
Protein accession | YP_001036515 |
Protein GI | 125972605 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1364] N-acetylglutamate synthase (N-acetylornithine aminotransferase) |
TIGRFAM ID | [TIGR00120] glutamate N-acetyltransferase/amino-acid acetyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAAACA TAATAGATGG AGGAGTTACG GCCCCAAAAG GGTTTAAAGC CGCAGGAGTC GCCTGCGGGC TTAAAAACAA CCAAAAAAAG GACATCGCAG TTGTTTGCTC CAGTGACTTG GCAGTTGCCG CCGGAGTATT TACAAAAAAT GTTGTAAAAG GACATTCGCT CCAGCTTACC ATGCAGCATA TAAAAAGCGG CCATGCCCGG GCATTGGTCA TAAACAGCGG TAATGCCAAT GCCTGTCTCG GAGAACAGGG CTACAAAGAT GCGGAGGAAA TGGCTTCTCT TGCCGCTCAG CTTCTAAATT GTGATGCCAA AAACGTCCTT GTCGGTTCCA CGGGAGTTAT CGGAATGCCG CTTGACATGC CGAAGGTGCG TTCCGGTATA AAGGAGGCAA TTTCAAAACT TTCCGAAGAA GGCGGTCACG ATGCGGCTGA GGCTATTATG ACCACAGACC TTGTTTTAAA GGAAATTGCC GTGGAATTTG AAATTCAGGG GCAAAAAGTA AGAATGGGAG CCATGGCAAA AGGCTCAGGA ATGATACATC CCAATATGGC AACAATGATA GGAGTCATAA CAACGGATGC AAATATTTCC AGAGAACTGC TGGACAAAGC GCTCAAAGAT GTAATATCCC ATACTTTCAA CCGGGTATCG GTTGACGGAG ACACCAGTGT TTGCGACATG GTTGTCATCC TTGCCAACGG AAAAGCAAAC AATGAAAATA TTGTCAAGGA GGATATTGAC TATTCCACTT TCAAATCCGC CCTTGAATAC GTCTGTACAC ACCTTTCCAA AATGATAGCA AAAGACGGAG AAGGGGCGAC CAAGCTTATT GAAGTTGTCG CCGAAGGTGC AAAAAGTGCT GAAGATGCTT ACAAAGCAGT AAGCGCAATT GCCAAATCCC CCCTTGTAAA AACAGCCATT TTCGGTGAGG ATGCAAACTG GGGAAGAATC ATAACGGCTG TCGGTTATTC CGGTGCGGAT TTTGACCCCA ATCTGGTTGA CATATACATC GGAGACCTTT TGGTATGCAA AAGCGGCGCC GCATTAAACT TTGACGAGGA AAAGGCAAAA GAAATACTTA AAGAAGATGA AGTCAGAATA AAAGTTGACT TTAACCAGGG AACCGCATCC GACAGAATCT GGACCTGTGA TTTTTCATAT GACTATGTAA AAATAAACGG AAGTTACAGA TCCTAA
|
Protein sequence | MINIIDGGVT APKGFKAAGV ACGLKNNQKK DIAVVCSSDL AVAAGVFTKN VVKGHSLQLT MQHIKSGHAR ALVINSGNAN ACLGEQGYKD AEEMASLAAQ LLNCDAKNVL VGSTGVIGMP LDMPKVRSGI KEAISKLSEE GGHDAAEAIM TTDLVLKEIA VEFEIQGQKV RMGAMAKGSG MIHPNMATMI GVITTDANIS RELLDKALKD VISHTFNRVS VDGDTSVCDM VVILANGKAN NENIVKEDID YSTFKSALEY VCTHLSKMIA KDGEGATKLI EVVAEGAKSA EDAYKAVSAI AKSPLVKTAI FGEDANWGRI ITAVGYSGAD FDPNLVDIYI GDLLVCKSGA ALNFDEEKAK EILKEDEVRI KVDFNQGTAS DRIWTCDFSY DYVKINGSYR S
|
| |