Gene Moth_1369 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1369 
Symbol 
ID3832292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1414715 
End bp1415659 
Gene Length945 bp 
Protein Length314 aa 
Translation table11 
GC content58% 
IMG OID637829305 
ProductUDP-glucose pyrophosphorylase 
Protein accessionYP_430225 
Protein GI83590216 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1210] UDP-glucose pyrophosphorylase 
TIGRFAM ID[TIGR01099] UTP-glucose-1-phosphate uridylyltransferase 


Plasmid Coverage information

Num covering plasmid clones49 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAGAC GTGAAAATAA GAACGAGGTG ATATCTTTGA CCAGGATTCA GAAAGCTATC 
ATTCCCGCTG CCGGCTGGGG TACCCGTTTC CTGCCGGCCA CCAAAGCCCA GCCCAAGGAA
ATGTTACCCA TTGTTGACAA ACCGGCCATT CAATTCATCG TCGAAGAAGC CGTTAATGCC
GGTGCGGAGG ATATCCTGAT TATCACCGGT AAAAATAAAC GAGCCATCGA GGACCACTTT
GACCGTTCCC TGGAACTGGA GAACCTCTTG CGGGAAAAGG GAAAGGACGA ACTCCTGGCC
CTGGTGGAGG GGATTGCTGA ACTCGCCGAT ATCCACTACA TCCGCCAGAA GGAGCAACTG
GGCCTGGGGC ACGCCGTTTA CTGTGCCCGC AAATTTATCG GCCAGGAACC CTTTGCCGTC
CTCCTGGGGG ATGATATTAT CGTCAACTCC CCCTCCTGCC TGGAACAGAT GCTCGCCGTT
TATGAAGAAG TCGGGGCTAC CATAGTCGCC GTCCAGGAGG TGCCCCGGGA GGAAGTAAAC
CGTTACGGTG TCATTGACCC CCTGGAAGTC GACGGCTCCC TCATCCGGGT CAGGGACCTG
GTTGAGAAGC CGCGCCCCGA GGAAGCCCCT TCCAATCTGG CCGTCATTGG TCGTTACATC
CTGGTGCCGG CGATCTTCCC TCTCCTGGAG AAAGTAAAAC CGGGCGCTGG CGGCGAGATC
CAGCTGACCG ACGCCCTGCG CCTGCTGGCC CGCCAGGACC GTGTCTATGC CTACCGTTTC
CAGGGTAAGC GCTACGACAT CGGTGACAAG CTGGGCTTTC TCCAGGCCAC GGTGGAATTT
GCCCTAGCCC GCCCGGATCT CGCCGGCCCC TTTAGCGAGT ACCTGGCCGG CATCATTACT
TCCACTGCGG ACAAGTTGCA GGTTGCCGCC ACCAGGGAAG GATAA
 
Protein sequence
MNRRENKNEV ISLTRIQKAI IPAAGWGTRF LPATKAQPKE MLPIVDKPAI QFIVEEAVNA 
GAEDILIITG KNKRAIEDHF DRSLELENLL REKGKDELLA LVEGIAELAD IHYIRQKEQL
GLGHAVYCAR KFIGQEPFAV LLGDDIIVNS PSCLEQMLAV YEEVGATIVA VQEVPREEVN
RYGVIDPLEV DGSLIRVRDL VEKPRPEEAP SNLAVIGRYI LVPAIFPLLE KVKPGAGGEI
QLTDALRLLA RQDRVYAYRF QGKRYDIGDK LGFLQATVEF ALARPDLAGP FSEYLAGIIT
STADKLQVAA TREG