Gene Emin_0286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0286 
Symbol 
ID6262849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp304952 
End bp306217 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content42% 
IMG OID642610751 
ProductO-acetylhomoserine/O-acetylserine sulfhydrylase 
Protein accessionYP_001875184 
Protein GI187250702 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAAC AACATATTGA AACTTTGGCC GTATCATACG GCTTTGAAAT TGACGAAACG 
GGATCAAGCA ACCCGCCGTT ATATCTTTCA AACGCTTATA AATTTAATGA CGCTAAACAT
GCAAAAGACC TGTTTGACCT CAAAGCCCCG GGTTATATTT ACACAAGGCT AAACAACCCG
ACAAACAATT TTTTGGAAGA AAGAATTAAC GCTCTCGAAG GCGGCGCGGG AACATTGGTA
ACGGCTTCGG GCCATTCGGC CGAGTTTATG ACAATATGCG CCCTTGCCGA AACGGGCGAC
GAAATAATTT CCTCTAACGC TTTATACGGC GGCACATTTA ACATGTTTTC CCATTCGCTC
CGCCGTTTGG GCATAAAAGT AAAATTTGCC GATGTTTCAA ACCCCGCCGA GTTTGAAAAC
CTGGTAACGG ATAAAACAAA AGCCATTTTT GTTGAGTCCA TAAGCAACCC CGGCTGTGAG
ATACCTGACT TTGAGCAACT TTCCAAAATA GCTAAAAAAC ATAAAATCCC CTTTATAGTT
GATAATACCT GCATGACCCC ATACCTTTTT AAACCCAAAG ATTTCGGCGC GGATATAATA
ATACATTCAA CCACAAAGTT TTTGTCGGGC CACGCGGCGG TGATGGGCGG CTCTGTAACG
GATTGCGGCA CTTTTGACTG GACAAGCGGG CGTTTCCCCT CTTTTTGCAA CCCCGACCCA
AGCTACCACA ATATAGTTTA CGCCAAAGAT TTTGCACAAA ACGCTTTTAT AGTAAAACTG
CGCACCCAGG TTTTAAGAGA TATCGGAGCG TGCCAAAGCC CTTTTAACTC TTATCTTACA
TTGCAAGGCA TACAAACTCT TCATGTGCGT ATGGACAGAC ATTTGGAAAA CACTCTAAAA
CTTATTGATT ACTTAAAAAA TAATCCCAAA ATAGCGTGGG TAAAATACCC GCTTGTAGAA
GGGAATCCTT TTAAACAAAC GGCTGAAAAA TATTTTAAAA AAGGTTGCGG GTCGCTTTTT
TCCTTTGGGT TAAAAGGCGG TTATGAAGCG GGAAAGAAAC TAATAGAAAA CGTAACCCTT
TGCCTGCACG CCACAAATTT AGGAGACGTA AGGACAATAG TCACTCACCC GGCCAGCACA
ACGCACAGCC AGTTAACGAA GGAAGAAAAA CAAAAAACGG CAATAGGCGA TGACCTTATA
AGGATTTCCG TAGGCATTGA AAATATTGAT GACATTATAG CCGACTTGGA GAAAGCGCTT
ATATGA
 
Protein sequence
MTKQHIETLA VSYGFEIDET GSSNPPLYLS NAYKFNDAKH AKDLFDLKAP GYIYTRLNNP 
TNNFLEERIN ALEGGAGTLV TASGHSAEFM TICALAETGD EIISSNALYG GTFNMFSHSL
RRLGIKVKFA DVSNPAEFEN LVTDKTKAIF VESISNPGCE IPDFEQLSKI AKKHKIPFIV
DNTCMTPYLF KPKDFGADII IHSTTKFLSG HAAVMGGSVT DCGTFDWTSG RFPSFCNPDP
SYHNIVYAKD FAQNAFIVKL RTQVLRDIGA CQSPFNSYLT LQGIQTLHVR MDRHLENTLK
LIDYLKNNPK IAWVKYPLVE GNPFKQTAEK YFKKGCGSLF SFGLKGGYEA GKKLIENVTL
CLHATNLGDV RTIVTHPAST THSQLTKEEK QKTAIGDDLI RISVGIENID DIIADLEKAL
I