Gene Emin_1014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1014 
Symbol 
ID6263191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1106341 
End bp1107570 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content42% 
IMG OID642611494 
ProductL,L-diaminopimelate aminotransferase 
Protein accessionYP_001875904 
Protein GI187251422 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID[TIGR03542] LL-diaminopimelate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAAC TTAATGAAAA CTATTTAAAA TTACAAAGCA GCTATCTTTT TTCAACTATA 
GCTAAAAAAG TGGCTGCTTA CAAACAGGAA AATCCTTCGG CGGAAATTAT ACGTTTAGGT
ATCGGCGACG TTACTCTGCC ATTGCCTTCG GCAGTTATAG AAGCCATGCA CAAAGCGGTT
GACGAAATGG CGGTTTCCTC AAGTTTTAAA GGCTACGGGC CTGATTACGG ATATGATTTT
TTAAGGCAAA AAATTGTTGA AACCGATTAT TTGGCAAGAG GCGTACAAAT TACCGAAGAT
GAGGTTTTTA TAAGCGACGG CGCTAAGAGC GACGTGGGTA ACTTTCAGGA AATTTTTGAC
GCCAAAGCGT CCGTTGCTAT TACGGACCCG GTGTACCCGG TGTATCTTGA CACAAATGTT
ATGGCTGGCA GAACGGGCGC TTTTAAAAAA GGTAAATACA GTAAAATAGT TTATTTGCCC
TGTACGGCAA AAAATAATTT TATTCCTTTA TTGCCTAAAA AACATGTTGA TTTAATTTAT
ATTTGCTCCC CAAATAACCC GACGGGCACC TGCTTAAACA AAGAAGAACT TTCCAAATGG
GTTGAGTACG CGTTAAACAA TAAATCCGTT ATTTTATTTG ACTCCGCTTA CGAGGCTTTT
ATAAGCGAGC CTGACATTCC GCATTCAATT TTTGAAATCC CCGGGGCGGA AAAAGTAGCC
GTTGAGTTCC GCTCATTCTC AAAAACCGCG GGGTTTACAG GTACAAGATG CGCTTATACT
GTTGTTCCCA AAGCGCTCAA GGTATTTGAT AAAGAGGGGG GAGAACATTC TTTAAACTCT
TTATGGGGCC GCAGGCAGTC AACAAAATTT AACGGCGTTC CTTATATAGT GCAAAAAGGG
GCGGAGGCCG TCTATTCCCC GGAAGGGCAA AAGCAGATTA AAGAAAATAT AGCCTATTAT
ATGGAAAACG CCAAAATTAT ACGTGAAGGT TTGCGTTCTT TAGGTTTAAA AATATTTGGC
GGCGTTAACG CGCCTTATAT TTGGATTAAA CTTCCTAAAG GGGTAACCTC GTGGGATTTC
TTTGGCAAAC TGCTTAAAGA AGCAAACGTG GTAGGCACTC CGGGCGCGGG CTTTGGCCCT
TGCGGCGAAG GCTGCTTTAG GCTTACGGCA TTTGGCAGCA GGGAAAATAC AATTAAGGCC
GTTGAAAGAA TTAAACAGTT AAAATTATAA
 
Protein sequence
MTKLNENYLK LQSSYLFSTI AKKVAAYKQE NPSAEIIRLG IGDVTLPLPS AVIEAMHKAV 
DEMAVSSSFK GYGPDYGYDF LRQKIVETDY LARGVQITED EVFISDGAKS DVGNFQEIFD
AKASVAITDP VYPVYLDTNV MAGRTGAFKK GKYSKIVYLP CTAKNNFIPL LPKKHVDLIY
ICSPNNPTGT CLNKEELSKW VEYALNNKSV ILFDSAYEAF ISEPDIPHSI FEIPGAEKVA
VEFRSFSKTA GFTGTRCAYT VVPKALKVFD KEGGEHSLNS LWGRRQSTKF NGVPYIVQKG
AEAVYSPEGQ KQIKENIAYY MENAKIIREG LRSLGLKIFG GVNAPYIWIK LPKGVTSWDF
FGKLLKEANV VGTPGAGFGP CGEGCFRLTA FGSRENTIKA VERIKQLKL