Gene Emin_1090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1090 
Symbol 
ID6263823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1183030 
End bp1184091 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content42% 
IMG OID642611570 
ProductD-alanine/D-alanine ligase 
Protein accessionYP_001875979 
Protein GI187251497 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1181] D-alanine-D-alanine ligase and related ATP-grasp enzymes 
TIGRFAM ID[TIGR01205] D-alanine--D-alanine ligase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones97 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAAGT TAACAATAGC TGTCGTATAC GGAGGCAAAA GCGTTGAGCA TGAGGTTTCC 
GTCCATTCCG CCGCGGATGT GTGCGGCATT CTTTCCAAAA AATATAATGT TATAAATATA
TTTATAAGTA AAGAGGGGCT TTGGTTTGCA CAGGAAAAGT GCGGCCCCGC CCAGCCTTCC
GACATAAATG TAACGCCTGT ATTTTCAAAC GACTGTATGT TTCAAACCGC AGACGGCGGC
ACAATAAGCG CGGACGTTCT TTTCCCTGTT TTGCACGGTA CTAAAGGGGA AGACGGCTGC
ATACAAGGTT TATTTGAACT TATGGAAACG CCTTACGTAG GCTGCGGCGT TATGGCTTCG
GCCATAGGTA TGGATAAAGA AGTAAGCAAA GTTTTAGCCG CTTATGCGGG CGTTCCCACT
TTGCCTTATA TAACTTTTAA AAAAGGGGAC AGCGTTACCA AAGATTTTAA GGCCAAAACA
GCCAAACTCG GCTATCCTGT TTTTGTAAAA CCGGTTAATT TAGGCTCTTC CATAGGCATT
ACAAAAGTTA AGGAAGAGGC CTTTTTGGAA AAGGCCATAG AGTTCGCGTT AAAATTTGAT
AATGATATTT TGATTGAAAA AGGCGTGCAA TCCCCCAAAG AAATATTTTG CGCCGTTTGC
GGCGAGGGCA ACAATACCGA ATCCTCGTCT TGCGGGGAAC TTATTCCCAA CGGGCATGAA
TTTTTTGACT ACCACTCAAA ATATATTGAC CCCAACGGAT GTACGGTTAA GGTGCCGGCC
TCGGTGTCTG AAGAAGAGGA CAAACTTATG CAAAAATATT CCCGCATGAT ATTTAAAGTT
TTTAAAGCCT GCGGTTTTGC CAGGGTAGAT TTTTTATTGG GCAAGGACGG GCAAATTTAC
TTTTCCGAAA TAAATACTTT GCCCGGTATG TCTAATGCAA GTTTATTTCC GCAGTTATGG
CGCGCGGCGG GAAAAAAGTA TGAAGATGTT TTAGATACAT TAATAAATTT GGCGCTAAAA
CGCCGTGAAC AAATAAAAAC CCTTAAAACG GATAAAGAAT GA
 
Protein sequence
MGKLTIAVVY GGKSVEHEVS VHSAADVCGI LSKKYNVINI FISKEGLWFA QEKCGPAQPS 
DINVTPVFSN DCMFQTADGG TISADVLFPV LHGTKGEDGC IQGLFELMET PYVGCGVMAS
AIGMDKEVSK VLAAYAGVPT LPYITFKKGD SVTKDFKAKT AKLGYPVFVK PVNLGSSIGI
TKVKEEAFLE KAIEFALKFD NDILIEKGVQ SPKEIFCAVC GEGNNTESSS CGELIPNGHE
FFDYHSKYID PNGCTVKVPA SVSEEEDKLM QKYSRMIFKV FKACGFARVD FLLGKDGQIY
FSEINTLPGM SNASLFPQLW RAAGKKYEDV LDTLINLALK RREQIKTLKT DKE