Gene Moth_1140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1140 
Symbolddl 
ID3833238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1170723 
End bp1171652 
Gene Length930 bp 
Protein Length309 aa 
Translation table11 
GC content58% 
IMG OID637829070 
ProductD-alanine--D-alanine ligase 
Protein accessionYP_429997 
Protein GI83589988 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1181] D-alanine-D-alanine ligase and related ATP-grasp enzymes 
TIGRFAM ID[TIGR01205] D-alanine--D-alanine ligase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.000909084 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTG CCGTCCTAAT GGGTGGCCCC TCTTCCGAAA GGGAGATTTC TTTAAAAAGC 
GGGTCTGCCG TTGCCGCCGC CCTGTCCGGC CTGGGTCACC AGGTGATAAC TATAGATCTG
AACAGGGAGG TAGTAGCCAG GTTAAAAAAC TTCGCCCCCG ACGTTGTCTT TAACGCCCTC
CACGGTAAAC CCGGGGAAGA CGGTTCTGTC CAGGGCCTGC TGGAGGTCCT GGGCCTGCCT
TATACTGGCA GCCGCGTCCT GGCCAGTGCC ATAACAATGG ATAAAATTAT GACTAAACGC
GTCCTGCTCC AGGCCGGTAT CCCCACCCCC AAATTTTTAG CCTGGACCGG TGCTGAATAC
GCCACCGGCA AGAAAGAGAT AAAGGCGGCG ATATTAAAGG AACTAGGTTT GCCGGTGGTC
ATTAAGGCCC CGACCCAGGG TTCCACCATT GGCACCTTTA TCGTCAGGGA AGAAGGGGAA
CTGGAGCCGG CCATAGCCGG GGCCCTGAAA TATGACCTTT CCTTTATGGC CGAAGCCTAC
CTGGCAGGCC CGGAGATCAC GGCTGCCGTC CTGGGGAACC GGAAACCCCA GGTCTTGCCT
TTAATAGAAA TCGTCTCCCA TACCGGCTTT TATGATTATC AGGCCAAGTA CACCCCCGGC
CTGAGTGATC ATATTATCCC GCCCCGATTG CCGGATGACG TCCTGGCAGC AGCTACCTCC
CTGGCCGGCC GGACCTATGC CCTCCTGGGT TGCCGCGGTT TCGCCCGGGT GGATTTTATC
GTGGCGGGGG GCCGGGAGCC CCAGGTCATT GAAGTCAATA GCGTCCCGGG GATGACCGCC
ACCAGCCTGG TACCGGACGC CGCCCGGGCG GCAGGATTGG ATTTTCCGGA TCTGGTCCAG
AAAATCGTCG ACCTGGCCCT GGAGCCTTGA
 
Protein sequence
MKIAVLMGGP SSEREISLKS GSAVAAALSG LGHQVITIDL NREVVARLKN FAPDVVFNAL 
HGKPGEDGSV QGLLEVLGLP YTGSRVLASA ITMDKIMTKR VLLQAGIPTP KFLAWTGAEY
ATGKKEIKAA ILKELGLPVV IKAPTQGSTI GTFIVREEGE LEPAIAGALK YDLSFMAEAY
LAGPEITAAV LGNRKPQVLP LIEIVSHTGF YDYQAKYTPG LSDHIIPPRL PDDVLAAATS
LAGRTYALLG CRGFARVDFI VAGGREPQVI EVNSVPGMTA TSLVPDAARA AGLDFPDLVQ
KIVDLALEP