Gene Moth_1764 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1764 
Symbol 
ID3831056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1819191 
End bp1820519 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content57% 
IMG OID637829689 
ProductL-glutamine synthetase 
Protein accessionYP_430608 
Protein GI83590599 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0174] Glutamine synthetase 
TIGRFAM ID[TIGR00653] glutamine synthetase, type I 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.705801 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGATA GCCGGGAAGC GATTTTACAG AAAGCTACCG ACCTGGGAGT CAAGTTTATT 
CGCCTCCAGT TTACCGATAT CTTTGGGGTG CTGAAAAATG TCGCCATCAC CCTGGACCAG
CTTCCCAAAG CCCTCAATAA CGAGTTGATG TTCGACGGGT CCTCTATTGA GGGTTTCGTC
CGGATTGAAG AATCTGATAT GTACCTGCGC CCGGACCCAT CCACCTTTAC CATCTTTCCC
TGGAAGCCCA ATGGCGACGC CGTGGCGCGG TTGATCTGCG ACGTTTATAA TGCCGACGGC
ACGCCTTTTA TCGGGTGCCC ACGCGGGACC TTAAAGCGGG TTATCGCTGA AGCCGAGGCC
CTGGGTTATA CCATGAATGT CGGCCCGGAG GCGGAGTTTT TCCTCTTCCA TACCGACGCC
ACCGGCCGGC CGACCCTGGA AACCCACGAT CGGGCCGGTT ATTTTGACCT GACCCCGGTA
GACCTGGGGG AAGACGCCCG GAGGGATATG GTCCTGACCC TGCAGCAGAT GGGGTTTGAA
ATCGAGGCCT CCCACCATGA GGTGGCTCCC GGCCAGCATG AGATAGATTT TAAATACGCC
GACGCCTTAA GGACGGCCGA CAATATCGCT ACTTTCAAGT TTGTGGTGCG GACCATCGCC
CAGCGCCACG GCCTCCATGC CACCTTTATG CCCAAGCCCA TTTACGGCAT TGCCGGTTCG
GGCATGCACT GCAATATCTC TCTTTTCCGC GCCGGCCAAA ATGCCTTTTA CGACCCTGAC
GACGATCTCC AGCTGAGCCA GGTTGCCTAC TACTTTATCG GCGGCCTGAT AGCCCACGCC
CGGGGCATGA CGGCAATTAC CAATCCCACT GTTAACTCAT ACAAGCGTCT GGTACCTGGT
TACGAAGCCC CGGTGTATAT AGCCTGGTCA CCTCAAAACC GTAGCCCCCT GATCCGTATC
CCCGCCAAGC GGGGCCTGTC AACGCGGCTG GAGGTACGCC ATCCCGACCC CTCGACCAAC
CCCTACCTGG CCATTGCCGT CATGTTGAAG GCCGGCCTGG ACGGCATTAA GAACCGTATC
CAACCGCCGG CGCCCATCAA TGCCAATATT TTTGACATGG AAGCGACTAG ACTACGGGCC
GAAGGTATCG ACCTCCTGCC GGGCACCCTG GATGAAGCCC TGGATGCATT AGAAAAAGAC
CCCGTTATCC GCGAGGCCCT GGGGCCCCAT ATCTACCAGC GGTTCATGGA GGCGAAACGC
ATCGAATGCG AGGAGTACCG CACCCGGGTG CACCAGTGGG AAATCGAACA TTACCTGACT
AAGTTCTAA
 
Protein sequence
MVDSREAILQ KATDLGVKFI RLQFTDIFGV LKNVAITLDQ LPKALNNELM FDGSSIEGFV 
RIEESDMYLR PDPSTFTIFP WKPNGDAVAR LICDVYNADG TPFIGCPRGT LKRVIAEAEA
LGYTMNVGPE AEFFLFHTDA TGRPTLETHD RAGYFDLTPV DLGEDARRDM VLTLQQMGFE
IEASHHEVAP GQHEIDFKYA DALRTADNIA TFKFVVRTIA QRHGLHATFM PKPIYGIAGS
GMHCNISLFR AGQNAFYDPD DDLQLSQVAY YFIGGLIAHA RGMTAITNPT VNSYKRLVPG
YEAPVYIAWS PQNRSPLIRI PAKRGLSTRL EVRHPDPSTN PYLAIAVMLK AGLDGIKNRI
QPPAPINANI FDMEATRLRA EGIDLLPGTL DEALDALEKD PVIREALGPH IYQRFMEAKR
IECEEYRTRV HQWEIEHYLT KF