Gene Moth_0218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0218 
Symbol 
ID3831369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp214821 
End bp215870 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content58% 
IMG OID637828154 
Productglycosyl transferase family protein 
Protein accessionYP_429096 
Protein GI83589087 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0472] UDP-N-acetylmuramyl pentapeptide phosphotransferase/UDP-N-acetylglucosamine-1-phosphate transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATATCC TGGCCTTGGC ACTAGCCGGG ATAGTCAGTT TCTTACTGAC ACCTTTGCTT 
TGCCGGATAG CACCCCGCCT GGGGGCTGTA GATAAACCCA ACGCCCGCAA GATTCATCAT
ACCCTTATGC CCCGCCTGGG AGGGGTGGCC ATTTATGCGG GGTTTATGCT GGCTTACTGG
CTGGGAGGGT ATCGCCATCA GGAATACCTG GGGCTCTTCC TTGCCGGGAC CTTTATCATG
CTTGTGGGTA TAATTGATGA TATCCGCTCC TTGAGCCCCC GGCTGAAGCT CCTGGGGCAG
ATCATAGCGG CCGTCATCCT GGTGGCCTTC GGTGTCCGGG TGGATTTCCT GACCAACCCC
TTCGATGGCC TTTTTATCCT GGGGAAACTG GCTATCCCGG TTACTATTTT CTGGCTGGTA
GGCGTTACCA ACGCCTTGAA CCTTGTCGAC GGACTGGACG GGCTGGCAGC AGGAACCTCT
TTGATAGCGG CCGTAACCAT CGCCGTTGTC GCCTGGTTCA ACGGCGAGCT GGTAGTAGCT
TTTCTCTCCC TGGCTCTGGC GGCAGCAGTC CTGGGTTTCC TGCCCTTCAA CTTTCACCCG
GCGCGGATTT TTATGGGCGA TAGCGGGTCC ATGTTCCTGG GCTTTAACCT GGCGGCCCTG
GCTACCATCG GCCTGACCAA GAGCGCTACG GTAATATCCC TCTTTATCCC GGTGGTAATC
CTTGGACTGC CAATTCTGGA CACCATGTTT GCCATTGTCC GTCGCTTCCT CAATCACCGG
CCCATCTTTG CCCCGGATAA GGGGCACCTG CACCACCGCT TGCTGGCCCA GGGTTTGAGC
CAGCGCCAGG CAGTAGGGGT TATTTACCTC GTTGATGCCT GCCTGGGTGG CAGCGCCATC
CTCCTTAGCC GGGTGGCTAC GGACCAGGGG GTGCTGATCC TGATCGGGCT GGCGGTAATT
ATCCTGGTGG GTTGCGATAA ATTGGGGATT ATTGGCAGGG GCGGCCTGGC CAGGGCTAAA
ACCCGGCATA ATACCCTGCA CATGTTCTAG
 
Protein sequence
MDILALALAG IVSFLLTPLL CRIAPRLGAV DKPNARKIHH TLMPRLGGVA IYAGFMLAYW 
LGGYRHQEYL GLFLAGTFIM LVGIIDDIRS LSPRLKLLGQ IIAAVILVAF GVRVDFLTNP
FDGLFILGKL AIPVTIFWLV GVTNALNLVD GLDGLAAGTS LIAAVTIAVV AWFNGELVVA
FLSLALAAAV LGFLPFNFHP ARIFMGDSGS MFLGFNLAAL ATIGLTKSAT VISLFIPVVI
LGLPILDTMF AIVRRFLNHR PIFAPDKGHL HHRLLAQGLS QRQAVGVIYL VDACLGGSAI
LLSRVATDQG VLILIGLAVI ILVGCDKLGI IGRGGLARAK TRHNTLHMF