Gene Cthe_1863 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1863 
Symbol 
ID4809414 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2208749 
End bp2209789 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content42% 
IMG OID640107282 
ProductN-acetyl-gamma-glutamyl-phosphate reductase 
Protein accessionYP_001038277 
Protein GI125974367 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0002] Acetylglutamate semialdehyde dehydrogenase 
TIGRFAM ID[TIGR01850] N-acetyl-gamma-glutamyl-phosphate reductase, common form 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAGTG TAGGAATTAT AGGAGCTACC GGTTATGTTG GAACAGAAAT TGTTCGACTT 
CTTCAAAATC ATCCGGATAT AAACATTACT TCTGTCGTAT CCCACAACTT TGCGGGGCAG
AAGATATCGG ACATATACCC AAATCTTAAG AATGTTTTTG AAATGGAATG CGATGAGCTT
GATATAGATA AAATTGCCGA CAAAGCTGAA GTGTTTGTCA CTGCGCTTCC TCACGGCATA
TCAAAGGAAG TGATACCCAA GCTTGTTGAA AAAGGTAAAA GAATAGTTGA CCACAGCGGC
GATTTTCGCT ACAAGTCTGT TGAAGTGTAT GAAAAATGGT ACAACGCTAC CCATGGAATG
CCGCATCTTT TGAAACTTTC GGCATATGGT CTGCCTGAGC TTCACAGAGA AGAAATAAAA
AATGCACAGA TAATAGGCAA TCCCGGCTGT TATCCGACTT GTTCGATACT GGCGCTGGCT
CCGTTAGTCA AAAACAGACT TGTTGACACA AAAAATATCA TAATTGACGC AGCTTCCGGA
GTTTCGGGAG CCGGAAGAAA AACCGATCTT CCCTACCAGT TCTGCGAGTG TGACGAAAAT
TTCAAAGCAT ACAGTGTTTC AAACCACAGG CATACCTCTG AAATTGAGCA GGAGCTCTCT
CTTTTGGCAG AAGAGGAAAT TACCGTTTCG TTCACTCCTC ATCTTGTACC AATGAAAAGA
GGAATGCTTG CAACCATTTA TGCAAATTTG AACTGTGAAA AATCAACATC GGAATTAATT
GAGCTGTATA AGGAATATTA TAAAAATGAA TATTTTGTGA GGATACTGGA TGAAGGCAAA
CTTCCTGAAA CCAAATTTGT AGCCGGATCA AACTTTATTG ACATCGGTCT TGTTGTGGAT
AAGCGTTTAA ACAGGGTTGT CATCCTCTCT GCCATTGACA ATTTGGGCAA AGGTGCTGCA
GGTCAAGCCG TCCAGGTTCT CAATATATTG TTCGGGCTTC CCGAGCACAG AGGTCTGACC
AATCCCGGTT TCTACCTATA A
 
Protein sequence
MASVGIIGAT GYVGTEIVRL LQNHPDINIT SVVSHNFAGQ KISDIYPNLK NVFEMECDEL 
DIDKIADKAE VFVTALPHGI SKEVIPKLVE KGKRIVDHSG DFRYKSVEVY EKWYNATHGM
PHLLKLSAYG LPELHREEIK NAQIIGNPGC YPTCSILALA PLVKNRLVDT KNIIIDAASG
VSGAGRKTDL PYQFCECDEN FKAYSVSNHR HTSEIEQELS LLAEEEITVS FTPHLVPMKR
GMLATIYANL NCEKSTSELI ELYKEYYKNE YFVRILDEGK LPETKFVAGS NFIDIGLVVD
KRLNRVVILS AIDNLGKGAA GQAVQVLNIL FGLPEHRGLT NPGFYL