Gene Moth_2047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2047 
Symbol 
ID3831193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2138044 
End bp2139441 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content63% 
IMG OID637829976 
Productamidophosphoribosyltransferase 
Protein accessionYP_430886 
Protein GI83590877 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0034] Glutamine phosphoribosylpyrophosphate amidotransferase 
TIGRFAM ID[TIGR01134] amidophosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.330153 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTCCT GGCACGAGGA GTGCGGTGTC TTTGGCATCT ACGCTCCGGG CCAGGACGTG 
GCCCGGCTGG CCTACTACGG ACTCTTTGCC CTCCAGCACC GCGGCCAGGA GAGCGCTGGT
ATCGCCGTGG CCAACGGCCG CCATATCGCC GTCCACAAGG GTATGGGGCT GGTGGCGGAG
GTCTTTAACC GGGACAACCT TCGGGCTTTA CATGGTGACG TGGCCATCGG CCACGTGCGT
TACTCCACCA CTGGTGCCAG TTCCCTGGTC AACGCCCAGC CCCTGGTCTT CCGCTACCTC
AGGGGCATGG TGGCCATCGC CCATAACGGT AACCTGACCA ACGCCAGCGA GCTGCGGCGG
GAGCTTGGAG CCAGCGGGTC TATCTTCCAG TCCTCCACCG ATAGTGAAAT CATCGTTAAC
CTTATCGCCC GCCACAGCCA GGAGCCCGTC GAAGCAGCCT TGCTCCATTG CCAGGAAGAG
CTTCGCGGTG CTTATTCCCT GGTGGTCATG ACCGAGGAAC AACTCATCGG CGTCCGTGAT
CCCCATGGTG TCCGGCCCCT GTGCCTGGGC AGGATGGATG GGGCCTGGAT CCTGGCCTCG
GAGTCCTGCG CCCTGGATAC CCTGGGGGCC GATTTTGTCC GCGATCTGGA ACCCGGGGAG
ATTGTCATTA TCGACAGCCG GGGGGTGCGT TCCCTCCAGG GACCCCGGGC GGCCCACCGG
GCCCACTGCA TTTTTGAATA TGTCTACTTT GCCCGGCCGG ATAGCATCCT GGACGGCGAG
ACCGTCAACC TGGTGCGGCG GGAACTGGGC CGGAATCTAG CCCGGGAATA CCGGGTGGCG
GCCGACGCCG TCATTCCGGT ACCCGACTCC GGTATTGCCG CCGCCGCCGG CTATGCCGAG
GTGGCCGGCC TGCCCTTTGT GGAGGGGTTG ATGAAAAACC GCTACGTCGG CCGGACCTTC
ATCCAGCCCA CCCAGGAGAT GCGGGACCTG GGGGTGCGCT TGAAGCTCAA CCCTATCAAG
CCCATCTTAA AGGATAAAAG GGTTATTATA ATTGATGACT CCCTGGTCCG GGGAACCACC
AGCCGGAGGA TAGTAGCCAT GCTGCGCCAG GCCGGGGTCC GGGAGGTGCA CCTGCTGGTG
GCCTCGCCGC CGGTCCTGTA TCCCTGTTAC TACGGCATTG ATACCAGCGC CCGGGGAGAG
CTCATTGCCG CCCGGTATCC CCTGGAGGAC ATCCGCCGCC ATGTGGATGC CGACAGTCTC
CACTACCTCA GCCTGGAAGG GTTGTTTCGT TCCGTGCAGA GGGGGATGGA AGACTTCTGC
GCCGCCTGCT TCACCGGCCG CTACCCCATC CCCATCCCTT CCCCGGAGGA GGCTACCAAG
TACAGCCTGG AAGGGTAG
 
Protein sequence
MSSWHEECGV FGIYAPGQDV ARLAYYGLFA LQHRGQESAG IAVANGRHIA VHKGMGLVAE 
VFNRDNLRAL HGDVAIGHVR YSTTGASSLV NAQPLVFRYL RGMVAIAHNG NLTNASELRR
ELGASGSIFQ SSTDSEIIVN LIARHSQEPV EAALLHCQEE LRGAYSLVVM TEEQLIGVRD
PHGVRPLCLG RMDGAWILAS ESCALDTLGA DFVRDLEPGE IVIIDSRGVR SLQGPRAAHR
AHCIFEYVYF ARPDSILDGE TVNLVRRELG RNLAREYRVA ADAVIPVPDS GIAAAAGYAE
VAGLPFVEGL MKNRYVGRTF IQPTQEMRDL GVRLKLNPIK PILKDKRVII IDDSLVRGTT
SRRIVAMLRQ AGVREVHLLV ASPPVLYPCY YGIDTSARGE LIAARYPLED IRRHVDADSL
HYLSLEGLFR SVQRGMEDFC AACFTGRYPI PIPSPEEATK YSLEG