Gene Moth_0669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0669 
Symbol 
ID3832156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp701458 
End bp702804 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content54% 
IMG OID637828607 
Productundecaprenyl-phosphate galactosephosphotransferase 
Protein accessionYP_429537 
Protein GI83589528 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000381309 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGCGCTGGA CAAAGGTGGT AAAAATAGCC GGGGATCTTT TTTTAGTTAA CTTCAGCTTC 
TGGTTGGCTT TCTGGCTCCG CTTCGGCGGC AGCATTCCTG CTGTCAACTG GCACGCCTAC
CGGACGATCT CCATCTGGAT TACCCTGGCG GCCTTTGTCC TTTTTTACAG CTACGGGCTC
TACGTCGGCG GCCGTTACCG CTGGATAGAG ATCTTTGCCG CCCTGGTCTG GGTGGTGGCT
TTGACTATCC TCAGCGGCCT GGGCATTTCA TACATGCTGC AAAAGTATGC CTTCCCCCGC
TCGGTATTTC TCCTGACTGC ACCTATCCAG CTCGCGCTTT TGAGTATCTG GCGGTATGCT
GTGTGGCGCT TTTCTATCTG GTTACAGGGT ACCCTCACGC TGGTTGTCAT TGGTCCTACA
GAGACAGCCT GTCAGCGGGC GCGGGAGGTT ACCCGGGAAG ACAATCGCCT TTATCAGGTG
GCGGGTCTGG TAGTCGAGGG CGGTTCGACA ACGTCTGAGA TAGAGTTTCC AGTGCTGGGA
ACTTATCATG AACTTCCTCA AGCCCTTGAT GCCAGCCGGC CCGGAGCGGT ACTATTTTGC
GATGGGATTC CCTTGGAATA TCGCGAAATG ATGCTGAAAG AAATTATGGC TCGTAATCTG
CCCGCTTTCA TAGTTCCTGA TATCTATGAA ATCTTTTTAG CCCAGGCCCG CCTGGGACAG
CTCGACGGCA TCCCTGTTTT TCGGGTGGAC GGCTTTATAG CTGTGCCTTC GCGCGCCTGG
AAGCGGGCCT TCGATATTGC TTTGTCCTTA TGTCTATCCA TAATAGCCAT ACCCCTGATT
TTGCTGGCGG CCCTGGCGAT TAAGATTGAA TCTCCCGGCG GACCGGTATT TTACCGCCAG
CAGCGGGTGG GCCAGGGTGG TCGGGTGTTC CAGTTGATCA AACTGCGAAC CATGGTACCT
GATGCGGAGA AGATTACCGG CCCGGTACTG GCCACCGATA AGGACCCGCG TATTACCAGG
GTGGGCCGGA TTCTAAGGGC CACTCGTATT GACGAACTAC CCCAGCTCTG GAATGTTCTC
AAAGGGGAGA TGAGCTTCAT CGGCCCCCGG CCGGAACGCC CTTTCTTTGT AGAGCAATTC
AAGAAAGAAG TGCCGGGCTA CGACTGGCGC CATCAGTTGA AGGTGGGTAT CACCGGCCTG
GCCCAGGTCC AGGGGCGCTA CAGCACCACT CCGGCCGATA AACTTCGTTA CGACCTGCTC
TATGCTAAAA CCATCTCACC TTTAAGCGAC GCCCAGATTC TTTTACATAC CCTGAAAGTC
ATGCTCATGC GGGACAAAGC ATCATAG
 
Protein sequence
MRWTKVVKIA GDLFLVNFSF WLAFWLRFGG SIPAVNWHAY RTISIWITLA AFVLFYSYGL 
YVGGRYRWIE IFAALVWVVA LTILSGLGIS YMLQKYAFPR SVFLLTAPIQ LALLSIWRYA
VWRFSIWLQG TLTLVVIGPT ETACQRAREV TREDNRLYQV AGLVVEGGST TSEIEFPVLG
TYHELPQALD ASRPGAVLFC DGIPLEYREM MLKEIMARNL PAFIVPDIYE IFLAQARLGQ
LDGIPVFRVD GFIAVPSRAW KRAFDIALSL CLSIIAIPLI LLAALAIKIE SPGGPVFYRQ
QRVGQGGRVF QLIKLRTMVP DAEKITGPVL ATDKDPRITR VGRILRATRI DELPQLWNVL
KGEMSFIGPR PERPFFVEQF KKEVPGYDWR HQLKVGITGL AQVQGRYSTT PADKLRYDLL
YAKTISPLSD AQILLHTLKV MLMRDKAS