Gene Moth_2290 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2290 
Symbol 
ID3831322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2402588 
End bp2403628 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content61% 
IMG OID637830210 
ProductN-acetyl-gamma-glutamyl-phosphate reductase 
Protein accessionYP_431120 
Protein GI83591111 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0002] Acetylglutamate semialdehyde dehydrogenase 
TIGRFAM ID[TIGR01850] N-acetyl-gamma-glutamyl-phosphate reductase, common form 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.587268 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATTAAAG CCGGAATTAT CGGTGCTACC GGTTACACGG GAGCCGAACT GGTCCGCATT 
TTGAGCCGGC ACCCGGAAGT AGAGCTGGTA GCCCTTACCT CACGCAGTTA CGCCGGGGAA
GGGATGGCCG GCGTTTACCC GTCCCTTACC GGCTATACCA ACCTCACCTG TGAGAATTTG
ACTCCCGATG AGGTTATGGA CCGGGCGGAA GTTATCTTTA TCGCCCTGCC CCACGGCCAC
GCCGTCCCGG TAGCCACCCG AGCCAGGGAA CGGGGGATCA AAGTAATTGA CCTGGGCGCC
GACTGGCGCT TCCGTAACGC CAGGACTTAC GAAGAATGGT ATAAAATCCA GCACGGCAAC
CACGAGCTGG CGGCCCGGGC CGTCTACGGG CTGCCGGAGA TTCACCGGGA GGCCATCCGT
AGCGCCGGCC TGGTGGCCAA TCCCGGTTGT TACCCCACCA GCGCCATCCT GGGCCTGGCT
CCCCTGCTTA AGGGGGGGTA CATTGACCCG GCGACCATCA TAATCGACGC CAAGTCAGGG
GTTTCCGGGG CCGGCCGGGA GGCCAGGGTT ACCAGCCTCT TTGTTGAGTG CAACGAAAGC
ATTAATCCCT ACGGCGTCGC CAGTCACCGT CATACCCCGG AGATCGAACA GGAACTCAGC
GCCCTGGCCG GCAAAGAGGT TAAAGTAACC TTTACCCCCC ACCTGCTTCC CATCAGCAGG
GGGATCTTGA GTACCATGTA CGCCACCCTG GTACGGCCGG CATCGACGGA GGAACTGCGA
AGGGTATATG AAAAATTTTA TGCCGGTGAG CCCTTCGTCC ACCTCCTACC CCCCGGCCAG
TGGCCCCACA CCCGCTGGGT ATATGGCAGC AACAACTGCC ACCTTAATCT CGCCGTAGAT
ACCCGCACCG GCCGGGTGGT GGTGGCCAGC GCCATCGACA ACCTGACCAA AGGCGCTTCC
GGCCAGGCGG TGCAGAACCT CAACCTTATG TGCGGTTTCC CGGAGACCAT GGCCCTGGAA
GTACCAGGAT TGTGTCCATA A
 
Protein sequence
MIKAGIIGAT GYTGAELVRI LSRHPEVELV ALTSRSYAGE GMAGVYPSLT GYTNLTCENL 
TPDEVMDRAE VIFIALPHGH AVPVATRARE RGIKVIDLGA DWRFRNARTY EEWYKIQHGN
HELAARAVYG LPEIHREAIR SAGLVANPGC YPTSAILGLA PLLKGGYIDP ATIIIDAKSG
VSGAGREARV TSLFVECNES INPYGVASHR HTPEIEQELS ALAGKEVKVT FTPHLLPISR
GILSTMYATL VRPASTEELR RVYEKFYAGE PFVHLLPPGQ WPHTRWVYGS NNCHLNLAVD
TRTGRVVVAS AIDNLTKGAS GQAVQNLNLM CGFPETMALE VPGLCP