Gene Moth_2245 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2245 
Symbol 
ID3831291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2347311 
End bp2349131 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content62% 
IMG OID637830165 
Productglutamine--fructose-6-phosphate transaminase 
Protein accessionYP_431075 
Protein GI83591066 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0449] Glucosamine 6-phosphate synthetase, contains amidotransferase and phosphosugar isomerase domains 
TIGRFAM ID[TIGR01135] glucosamine--fructose-6-phosphate aminotransferase (isomerizing) 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCGGTA TCGTAGGTTA TCTGGGCCCG CGCCCGGCGG TCCCCATATT GGTTCAGGGA 
CTGGAAAGGC TGGAATACCG GGGCTATGAT TCTGCCGGGA TAGCGGTATT GAATGGTGGA
GGATTGGTAG TCGAAAAGAG CGCGGGCAAG CTACATGTCC TGAAGAGCCG TCTCAATGGC
AATCTCGCCG GTGCCCGGGT GGGGATTGGT CACACCCGCT GGGCCACCCA TGGCCGGCCG
TCCGATGTCA ACGCCCATCC CCATCTAGAT TGCACGGGCA GGATTGCCGT CGTCCATAAC
GGGATCATCG AAAACTACCA GGAACTGCGC CAGGAACTGG CGGCTAAAGG CCATCGCTTT
ATATCCGAAA CCGATACAGA AGTCCTGGCC CACCTGGTGG AAGAGTTCTA TACCGGCGAT
CTCCTCCAGG CGGTATTCAA GATGCTGCCC GTTCTTCGAG GCTCCTACGC CCTGGCTGTC
ATGAGCGCCG ATCATCCCCG GGAACTGGTA GGCGCGCGCC AGGACAGCCC CTTAATCGTC
GGCCTGGCTG CAGGGGAAAC CTACCTGGCC TCCGATATCC CGGCCCTGCT GCCTTATACC
CGGGATAATT ACATCCTGGA GAACGGTGAA GTGGCCTGGA TCACCCCCGG GGAAGTGACC
GTTTACGACG CCGACGGCAG CCGCAAGTCC AAGGAGGTCT TCCACGTGGC CTGGGACCTC
CAGGCGGCGG AAAAGGGCGG TTATGCCCAC TTTATGCTGA AAGAGATCCA CGAGCAGCCG
CGGGCCTTAA GGGATACCCT GGCTGGCCGC CTGCAAGATG ACGGGCGGGT ACGCCTGGAA
GGAGTTAACT TCACACCGGA GGAAGCGGCG GCCCTGGAAA AAGTGGCCAT CATTGCCTGC
GGTACGGCCC ACTACGCCGG CATGGTGGGT AAATATCTCC TGGAGAAGCT CCTGCGCCTG
CCGGTGGAAG ACGACGTGGC CTCGGAATTT CGCTACCGGG AACCGATCCT CAACGAGCAT
ACCCTGGGCC TGGTCATCAG CCAATCGGGC GAAACGGCCG ACACCCTGGC CAGCCTGCGG
GAGGCTAAAA AAGCCGGGGC ACCGGTGCTG GCCATTACCA ACGTCGTCGG CAGTTCGGTA
GCCCGGGAAG CCGATCACGT CATCTATACC TGGGCCGGCC CGGAGATTGC CGTGGCGTCC
ACTAAAGCGT ATCTCACCCA GGTAGCATCG CTGTACCTCC TGGCCCTGCA CCTGGGCGAG
AAAAGGAGCC ATGCTCCCTG GGCGCAGGAA ATCGCCGGGG GCCTTAAGGA CCTGCCCGGC
CAGGTGGAAG AAGTTTTAAA GCTGGAACCA CGGATAAAGG AACTGGCCGG CCGGATAGCA
CCCCACGAGC ACGCCTTCTT TATCGGCCGC GGCCTGGATT ACCCCGTCTC CCTGGAGGGA
TCCCTGAAGC TAAAGGAGAT CTCCTACCTG CACGCCGAGG CTTATGCGGC CGGGGAACTG
AAGCACGGCA CCCTGGCATT GATCGAAGAG GGTACGCCCG TTATCGCCCT GGCCACCCAG
GCCGAGCTCC TAGAGAAGAT GCTCAGCAAC ATCAAGGAAG TCAAGGCCCG GGGCGCCTGG
GTCCTGGCCC TCACCCAGGA GGGTAACACA GCCGTGGCCG AAGAGGCCGA CGCCGTCCTC
TACCTGCCGC CGGTACCATC TATCCTGGCG CCGGCGGTGA CGGTGGTGCC CCTGCAACTC
CTGGCCTACT ACACCGCCGT CGCCCGCGGC TGCGACGTCG ACAAACCCCG AAACCTGGCC
AAGAGCGTGA CGGTGGAGTA G
 
Protein sequence
MCGIVGYLGP RPAVPILVQG LERLEYRGYD SAGIAVLNGG GLVVEKSAGK LHVLKSRLNG 
NLAGARVGIG HTRWATHGRP SDVNAHPHLD CTGRIAVVHN GIIENYQELR QELAAKGHRF
ISETDTEVLA HLVEEFYTGD LLQAVFKMLP VLRGSYALAV MSADHPRELV GARQDSPLIV
GLAAGETYLA SDIPALLPYT RDNYILENGE VAWITPGEVT VYDADGSRKS KEVFHVAWDL
QAAEKGGYAH FMLKEIHEQP RALRDTLAGR LQDDGRVRLE GVNFTPEEAA ALEKVAIIAC
GTAHYAGMVG KYLLEKLLRL PVEDDVASEF RYREPILNEH TLGLVISQSG ETADTLASLR
EAKKAGAPVL AITNVVGSSV AREADHVIYT WAGPEIAVAS TKAYLTQVAS LYLLALHLGE
KRSHAPWAQE IAGGLKDLPG QVEEVLKLEP RIKELAGRIA PHEHAFFIGR GLDYPVSLEG
SLKLKEISYL HAEAYAAGEL KHGTLALIEE GTPVIALATQ AELLEKMLSN IKEVKARGAW
VLALTQEGNT AVAEEADAVL YLPPVPSILA PAVTVVPLQL LAYYTAVARG CDVDKPRNLA
KSVTVE