Gene Moth_1935 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1935 
Symbol 
ID3832427 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2009767 
End bp2010783 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content58% 
IMG OID637829866 
Productgalactose-1-phosphate uridylyltransferase 
Protein accessionYP_430776 
Protein GI83590767 
COG category[C] Energy production and conversion 
COG ID[COG1085] Galactose-1-phosphate uridylyltransferase 
TIGRFAM ID[TIGR00209] galactose-1-phosphate uridylyltransferase, family 1 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0280643 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000283662 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCGAAT TACGCCAGGA CCCGGTGAGC CAGCGCTGGG TAATCATTGC CACCGAAAGG 
GCCAAAAGGC CTTCGGACTT CAAGCCTCCG CATCAAGAAA AGAATGGCAG CACCGGCTGT
CCTTTCTGCC CCGGCCATGA AAGAGAAACC CCGCCGGAGG TGCTGGCCTT CCGGGTCGCC
GGCACCGCCC CCGACACCCC CGGCTGGCGG GTACGGGTTG TCCCCAATAA ATTCGCCGCC
CTGGCCCCCG GCGACGATGT AACCATAGAA AATAGGGGTT TATACCGGAC CATGAGCGGC
ACCGGTGCCC ACGAAGTCAT AATCGAGGGT CCCGATCACA ACACCTTTTT CCCCGATATG
GCCCCGGATC ATGCCGTCCA GGTCTTTAAG GCCTGGCGCC AGCGTTATCT GCAATTGAGC
CGGGATAAGA AACTACAATA TATCCAGCTC TTTAAAAACC ATGGCCGCAC GGCCGGAGCC
TCCCTGGAAC ACCCCCACAG CCAGTTAATC GCCACTCCCC TGGTACCAGT TACCGTCAGC
CAGGAAATGG ATAGATTTAA GTCCTACTGG CAGGAACAAG AGAGCTGCCT CCTCTGCGAT
GTCGTAGAGG CCGAACTGGA AGCCGGGGCC AGGGTCACCG GCTTCAATAG TGAGTTCCTG
GCCTTTTGTC CCTTTGCCTC CCGGTTTCCC ATGGAGACCT GGATCGTACC CCGCAGGCAC
CAGACAGGCT TCGGCGACTG CGACGACGTG CAGCTTAAGC AACTGGGCGC CATCGTACAG
GAAACCCTGG GCCGGCTGAA AAAGGCGGCT GGTGACCCGC CCTTTAACCT GGTCCTCCAT
ACGGCCCCCC TACATCAGGA CGACGTTATC TATCACTGGC ATTTGGAACT ATTGCCCCGG
CTGGCCATCG TCGCCGGGTT TGAGTGGGGA ACAGGTATCT ATATCAACCC CACCCCGCCG
GAAATTGCGG CCCAGTCTCT CAATGAAATT AAACTGGATC AGGAGGCACT GGCTTAA
 
Protein sequence
MPELRQDPVS QRWVIIATER AKRPSDFKPP HQEKNGSTGC PFCPGHERET PPEVLAFRVA 
GTAPDTPGWR VRVVPNKFAA LAPGDDVTIE NRGLYRTMSG TGAHEVIIEG PDHNTFFPDM
APDHAVQVFK AWRQRYLQLS RDKKLQYIQL FKNHGRTAGA SLEHPHSQLI ATPLVPVTVS
QEMDRFKSYW QEQESCLLCD VVEAELEAGA RVTGFNSEFL AFCPFASRFP METWIVPRRH
QTGFGDCDDV QLKQLGAIVQ ETLGRLKKAA GDPPFNLVLH TAPLHQDDVI YHWHLELLPR
LAIVAGFEWG TGIYINPTPP EIAAQSLNEI KLDQEALA