Gene Cthe_0081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0081 
Symbol 
ID4808776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp111423 
End bp112628 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content45% 
IMG OID640105490 
ProductN-acetylglutamate synthase / glutamate N-acetyltransferase 
Protein accessionYP_001036515 
Protein GI125972605 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1364] N-acetylglutamate synthase (N-acetylornithine aminotransferase) 
TIGRFAM ID[TIGR00120] glutamate N-acetyltransferase/amino-acid acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAACA TAATAGATGG AGGAGTTACG GCCCCAAAAG GGTTTAAAGC CGCAGGAGTC 
GCCTGCGGGC TTAAAAACAA CCAAAAAAAG GACATCGCAG TTGTTTGCTC CAGTGACTTG
GCAGTTGCCG CCGGAGTATT TACAAAAAAT GTTGTAAAAG GACATTCGCT CCAGCTTACC
ATGCAGCATA TAAAAAGCGG CCATGCCCGG GCATTGGTCA TAAACAGCGG TAATGCCAAT
GCCTGTCTCG GAGAACAGGG CTACAAAGAT GCGGAGGAAA TGGCTTCTCT TGCCGCTCAG
CTTCTAAATT GTGATGCCAA AAACGTCCTT GTCGGTTCCA CGGGAGTTAT CGGAATGCCG
CTTGACATGC CGAAGGTGCG TTCCGGTATA AAGGAGGCAA TTTCAAAACT TTCCGAAGAA
GGCGGTCACG ATGCGGCTGA GGCTATTATG ACCACAGACC TTGTTTTAAA GGAAATTGCC
GTGGAATTTG AAATTCAGGG GCAAAAAGTA AGAATGGGAG CCATGGCAAA AGGCTCAGGA
ATGATACATC CCAATATGGC AACAATGATA GGAGTCATAA CAACGGATGC AAATATTTCC
AGAGAACTGC TGGACAAAGC GCTCAAAGAT GTAATATCCC ATACTTTCAA CCGGGTATCG
GTTGACGGAG ACACCAGTGT TTGCGACATG GTTGTCATCC TTGCCAACGG AAAAGCAAAC
AATGAAAATA TTGTCAAGGA GGATATTGAC TATTCCACTT TCAAATCCGC CCTTGAATAC
GTCTGTACAC ACCTTTCCAA AATGATAGCA AAAGACGGAG AAGGGGCGAC CAAGCTTATT
GAAGTTGTCG CCGAAGGTGC AAAAAGTGCT GAAGATGCTT ACAAAGCAGT AAGCGCAATT
GCCAAATCCC CCCTTGTAAA AACAGCCATT TTCGGTGAGG ATGCAAACTG GGGAAGAATC
ATAACGGCTG TCGGTTATTC CGGTGCGGAT TTTGACCCCA ATCTGGTTGA CATATACATC
GGAGACCTTT TGGTATGCAA AAGCGGCGCC GCATTAAACT TTGACGAGGA AAAGGCAAAA
GAAATACTTA AAGAAGATGA AGTCAGAATA AAAGTTGACT TTAACCAGGG AACCGCATCC
GACAGAATCT GGACCTGTGA TTTTTCATAT GACTATGTAA AAATAAACGG AAGTTACAGA
TCCTAA
 
Protein sequence
MINIIDGGVT APKGFKAAGV ACGLKNNQKK DIAVVCSSDL AVAAGVFTKN VVKGHSLQLT 
MQHIKSGHAR ALVINSGNAN ACLGEQGYKD AEEMASLAAQ LLNCDAKNVL VGSTGVIGMP
LDMPKVRSGI KEAISKLSEE GGHDAAEAIM TTDLVLKEIA VEFEIQGQKV RMGAMAKGSG
MIHPNMATMI GVITTDANIS RELLDKALKD VISHTFNRVS VDGDTSVCDM VVILANGKAN
NENIVKEDID YSTFKSALEY VCTHLSKMIA KDGEGATKLI EVVAEGAKSA EDAYKAVSAI
AKSPLVKTAI FGEDANWGRI ITAVGYSGAD FDPNLVDIYI GDLLVCKSGA ALNFDEEKAK
EILKEDEVRI KVDFNQGTAS DRIWTCDFSY DYVKINGSYR S