Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1935 |
Symbol | |
ID | 3832427 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 2009767 |
End bp | 2010783 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637829866 |
Product | galactose-1-phosphate uridylyltransferase |
Protein accession | YP_430776 |
Protein GI | 83590767 |
COG category | [C] Energy production and conversion |
COG ID | [COG1085] Galactose-1-phosphate uridylyltransferase |
TIGRFAM ID | [TIGR00209] galactose-1-phosphate uridylyltransferase, family 1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0280643 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000283662 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCCGAAT TACGCCAGGA CCCGGTGAGC CAGCGCTGGG TAATCATTGC CACCGAAAGG GCCAAAAGGC CTTCGGACTT CAAGCCTCCG CATCAAGAAA AGAATGGCAG CACCGGCTGT CCTTTCTGCC CCGGCCATGA AAGAGAAACC CCGCCGGAGG TGCTGGCCTT CCGGGTCGCC GGCACCGCCC CCGACACCCC CGGCTGGCGG GTACGGGTTG TCCCCAATAA ATTCGCCGCC CTGGCCCCCG GCGACGATGT AACCATAGAA AATAGGGGTT TATACCGGAC CATGAGCGGC ACCGGTGCCC ACGAAGTCAT AATCGAGGGT CCCGATCACA ACACCTTTTT CCCCGATATG GCCCCGGATC ATGCCGTCCA GGTCTTTAAG GCCTGGCGCC AGCGTTATCT GCAATTGAGC CGGGATAAGA AACTACAATA TATCCAGCTC TTTAAAAACC ATGGCCGCAC GGCCGGAGCC TCCCTGGAAC ACCCCCACAG CCAGTTAATC GCCACTCCCC TGGTACCAGT TACCGTCAGC CAGGAAATGG ATAGATTTAA GTCCTACTGG CAGGAACAAG AGAGCTGCCT CCTCTGCGAT GTCGTAGAGG CCGAACTGGA AGCCGGGGCC AGGGTCACCG GCTTCAATAG TGAGTTCCTG GCCTTTTGTC CCTTTGCCTC CCGGTTTCCC ATGGAGACCT GGATCGTACC CCGCAGGCAC CAGACAGGCT TCGGCGACTG CGACGACGTG CAGCTTAAGC AACTGGGCGC CATCGTACAG GAAACCCTGG GCCGGCTGAA AAAGGCGGCT GGTGACCCGC CCTTTAACCT GGTCCTCCAT ACGGCCCCCC TACATCAGGA CGACGTTATC TATCACTGGC ATTTGGAACT ATTGCCCCGG CTGGCCATCG TCGCCGGGTT TGAGTGGGGA ACAGGTATCT ATATCAACCC CACCCCGCCG GAAATTGCGG CCCAGTCTCT CAATGAAATT AAACTGGATC AGGAGGCACT GGCTTAA
|
Protein sequence | MPELRQDPVS QRWVIIATER AKRPSDFKPP HQEKNGSTGC PFCPGHERET PPEVLAFRVA GTAPDTPGWR VRVVPNKFAA LAPGDDVTIE NRGLYRTMSG TGAHEVIIEG PDHNTFFPDM APDHAVQVFK AWRQRYLQLS RDKKLQYIQL FKNHGRTAGA SLEHPHSQLI ATPLVPVTVS QEMDRFKSYW QEQESCLLCD VVEAELEAGA RVTGFNSEFL AFCPFASRFP METWIVPRRH QTGFGDCDDV QLKQLGAIVQ ETLGRLKKAA GDPPFNLVLH TAPLHQDDVI YHWHLELLPR LAIVAGFEWG TGIYINPTPP EIAAQSLNEI KLDQEALA
|
| |