Gene Moth_2299 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2299 
Symbol 
ID3831331 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2413292 
End bp2414719 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content55% 
IMG OID637830219 
ProductL-glutamine synthetase 
Protein accessionYP_431129 
Protein GI83591120 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0174] Glutamine synthetase 
TIGRFAM ID[TIGR00653] glutamine synthetase, type I 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00444261 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAGA CACCAGGCGA AGTACTTGAG ATGGCCCGAA AGAACAATAT CCAGATGGTC 
GATCTCAAAT TTATCGACCT GCCAGGGACC TGGCAGCACT TCAGTGTGCC CTTAGACGAA
TTCGATGAAG GCGCTTTTAC CAGCGGCGTG GGTTTTGACG GCTCCAGCAT TCGCGGTTTT
AAAACCATTA ACGAGAGCGA TATGATCTTA GTTCCCGACC CCGATACCGC CTTTATCGAT
CCCTTCTGTG ATGTGCCCAC CCTCAGCCTG GTATGCAATG TCTACGACCC TATGACCGGG
CAGAATTACA ACCGCGATCC CCGGTTTGTG GCCAAAAAGG CTGAAGCCTA TCTAAAGGAG
ACCGGCATTG CCGATACCAG TTACTGGGGA CCCGAGGCCG AGTTCTTTAT CCTCGATCAT
GTCCGCTTTG ACCAGAGCCA GTACGCTGGT TACTATTTCC TGGATTCTGA TGAAGGCTTC
TGGAACTCGG GCGTGGAAAT GAACGGTCAT CCCAACCTCG GCTACCGGCC GCGCTATAAA
GAAGGTTATT TCCCGGTACC ACCTACCGAC ACCCTGCAAA ACTTGCGGAC GGAGATGGTC
CTGCTGCTGA AGCAGATGGG CATCGCCGTG GAAGCCCATC ACCACGAGGT GGCTACTGCC
GGCCAGGGCG AGATCGATAT GAAGTACGCC CCCCTGACCC GGATGGCCGA CCAGCTGATG
ATGTTCAAAT ATGTGGTCAA GAATGTGGCC ATCAAGCATA ATAAGACGGC TACCTTCATG
CCCAAACCGG TTTTCCAGGA CAACGGCTCG GGCATGCACG TCCACCAGAG CCTGTGGAAG
GGTAACGAAC CCCTCTTCTA CGATGCCAAT GGTTATGCGG GGCTTAGCGA ACTGGCCCTG
TATTATATCG GCGGTTTGTT AAAACACGCC CCCGTCCTGA CGGCCTTCTG CAGCCCGACC
ACCAACTCCT TCAAACGATT GGTGCCCGGC TTTGAAGCCC CCGTCAACCT GGTTTACTCC
CAGCGTAACC GCAGCGCGGC CATCCGCATT CCCATGTACT CCAGCAGCCC GGCCGCCAAG
AGGATCGAAT ACCGGCCGCC CGATCCTTCC TGCAACCCGT ACCTGGCCTT TGCCGCCCTG
TTAATGGCCG GCCTGGACGG TATTAAGAAC AAGATCCACC CCGGCGAGCC CCTGGATAAA
GATATCTATG ACCTGCCGCC GGAAGAAGCG GCCAGGGTGA AGTCCCTGCC CGACTCCCTG
GAGGAAGCCA TCGCCGCCCT GGAAAAAGAC CACGAGTTCC TGCTTCAGGG AGGCGTCTTC
GACGAGGACC TCATCAACGC CTGGATTGAA TACAAGCAAA AGCGGGAGAT CAACCAGATC
AAGCTGCGTC CCCATCCCTA TGAGTTCGTT TTGTATTATG ATGTTTAA
 
Protein sequence
MSKTPGEVLE MARKNNIQMV DLKFIDLPGT WQHFSVPLDE FDEGAFTSGV GFDGSSIRGF 
KTINESDMIL VPDPDTAFID PFCDVPTLSL VCNVYDPMTG QNYNRDPRFV AKKAEAYLKE
TGIADTSYWG PEAEFFILDH VRFDQSQYAG YYFLDSDEGF WNSGVEMNGH PNLGYRPRYK
EGYFPVPPTD TLQNLRTEMV LLLKQMGIAV EAHHHEVATA GQGEIDMKYA PLTRMADQLM
MFKYVVKNVA IKHNKTATFM PKPVFQDNGS GMHVHQSLWK GNEPLFYDAN GYAGLSELAL
YYIGGLLKHA PVLTAFCSPT TNSFKRLVPG FEAPVNLVYS QRNRSAAIRI PMYSSSPAAK
RIEYRPPDPS CNPYLAFAAL LMAGLDGIKN KIHPGEPLDK DIYDLPPEEA ARVKSLPDSL
EEAIAALEKD HEFLLQGGVF DEDLINAWIE YKQKREINQI KLRPHPYEFV LYYDV