Gene Moth_0996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0996 
Symbol 
ID3830872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1023579 
End bp1024946 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content58% 
IMG OID637828925 
ProductUDP-N-acetylmuramyl tripeptide synthase 
Protein accessionYP_429854 
Protein GI83589845 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0769] UDP-N-acetylmuramyl tripeptide synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00110541 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.028804 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCGGC GTATCCTGGC CATCCTGGCC GGCCGCCTGG TAGCTTTTTT ATGCCGCTGC 
TTAAAAAAAG GCGGCACTTC CCTTCCGGGT TTTCTGGCCC TGAAGTTGGA CCCGGATTTG
ATAGAGGGCT TAATAGCAGG CTACCGCAAG GTAATCATAG TCACAGGTAC CAACGGTAAA
ACGACCACCA CCAACCTCCT GGCCAGCATC CTGCGGGCAT CAGGATTAAG AGTCGTTTCC
AATAGCGAAG GGGCCAATAT GCCCGCCGGC GTTGCTACGG CCCTGCTAGG ACAGCGGGGC
GAAGTGGCTG TGCTGGAGGT TGATGAAGGA TCCCTGGCCC TTGTTACCGG CCAGGTGCAA
GCTGATGTTG TGGTTGTAAC TAATCTGTTG CGGGATCAAC TGGATCGTTA CCATGAATTG
GAACAACTGG CCGAGGCCAT AAAAATAGCC CTGGCCCATA CCCCGGATGC CGCCCTGGTC
CTTAACGCCG ATGATTCCCT GGTAACCTCC TTGGGAGATG GGCGTAGTAC CGTGCGTTAC
TTCGGCCTGG CACGGACCCC CTGGAGCCAG GAGACCACCA GGGAGGTGCT GGAGGGGCAT
ATATGTCCCC ATTGCCACCA GCCCTTGGGG TTTAAATTTT ACCACTACAG CCACCTGGGG
GATTATTTCT GCCCCCGCTG CTCCTACCGG CGTCCCCGAG CCGAGTATGA AGGCCGGGGG
CTCCAGTTAA AACCGGGAGG AACTGACTTT AACCTGGTGC ATCCAGGTGG TGCCCTGTTC
CTCCATACTC CCATGCCGGG GATTTATAAC GTCTACAATA TCCTGGCCGC CGCAGGTACA
GCGCTATATC TGGGGGTTGA GCCGGCAACC ATTGTCCGGG TGGTTGCTTC CTTCCTTCCC
GGCCAGGGCC GGGCTGAAGC CTTCAACCTC CGGGACAGAC GTATTACTTT AATGCTGGTC
AAAAACCCCA CCGGCATGGG AGTGGCCCTC CGGACCCTGG CTACGGGCCG GCAAAAAATG
GCTTTTCTCC TGGCCATTAA TGACCTGGCC GCCGACGGCC GGGATGTCTC CTGGCTCTGG
GACGCCGACC TTACTCCCCT GCTGGCAATC CCTGGCAATC CCATCATTTG CGCCGGCCTC
CGGGCCGGAG ATATGGCCAT CTGCCTTAAA TACCAGGGGA TAAAGGAGGC CGACCTGGAA
GTAATCCCCG ACCCCGCAGC CAGTATAGAG TGTCTCCTGG CCAAACCGGT CAAGGAAGCC
CTAATCCTCT GTACCTATAC CAACCTGGCC GTTTACCGCC GTCTTTTGCA GCATAGGGGG
GCTCGCAGTG AAGTTACACC TGGGACATCT CTACCCGGAG TTTCTTAA
 
Protein sequence
MLRRILAILA GRLVAFLCRC LKKGGTSLPG FLALKLDPDL IEGLIAGYRK VIIVTGTNGK 
TTTTNLLASI LRASGLRVVS NSEGANMPAG VATALLGQRG EVAVLEVDEG SLALVTGQVQ
ADVVVVTNLL RDQLDRYHEL EQLAEAIKIA LAHTPDAALV LNADDSLVTS LGDGRSTVRY
FGLARTPWSQ ETTREVLEGH ICPHCHQPLG FKFYHYSHLG DYFCPRCSYR RPRAEYEGRG
LQLKPGGTDF NLVHPGGALF LHTPMPGIYN VYNILAAAGT ALYLGVEPAT IVRVVASFLP
GQGRAEAFNL RDRRITLMLV KNPTGMGVAL RTLATGRQKM AFLLAINDLA ADGRDVSWLW
DADLTPLLAI PGNPIICAGL RAGDMAICLK YQGIKEADLE VIPDPAASIE CLLAKPVKEA
LILCTYTNLA VYRRLLQHRG ARSEVTPGTS LPGVS