Gene Elen_0295 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0295 
Symbol 
ID8414579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp389562 
End bp390842 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content68% 
IMG OID645023272 
ProductO-acetylhomoserine/O-acetylserine sulfhydrylase 
Protein accessionYP_003180675 
Protein GI257790069 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAT CCATCAGCAC CACCTGCGTA CAGGGCGGCT ACCGTCCCGG CGACGGCGAG 
CCGCGCCAGA TTCCCATCTA CCAGTCCACC ACCTGGAAGT ACGACACGAG CGAGCACATG
GGCCGCCTGT TCGACCTCGA GGAGGCCGGG TACTTCTACA CGCGTCTGGC GAACCCCACG
AACGACTTCG TGGCGGCCAA AATCGCCGAG CTCGAGGGCG GCACGGCGGC CATGCTCACG
TCGTCGGGCC AGGCTGCGAA CTTCTTCGCC GTGTTCAACA TCGCCGGCGC GGGCGATCAT
GTCGTAGCCA GCTCGGCCAT CTACGGCGGC ACGTACAACC TGCTGGCCGT GACCATGAAG
CGCATGGGCC TGGAATGCAC GTTCGTTTCG CCCGACTGCA CCGACGAGGA GCTGGAGGCC
GCGTTCCGGC CGAACACGAA GGCCGTGTTC GGCGAGACCA TCGCGAATCC GGCGCTGGCG
GTGCTCGACA TCGAGCGCTT CGCCGCCGCC GCGCACGCGC ACGGGGTGCC GCTGATCGTG
GACAACACGT TCCCCACGCC GGTGAACTGC CGTCCCATCG AGTGGGGCGC CGACATCGTG
ACGCACTCCA CCACGAAGTA CATGGACGGC CATGGCGCCA GCGTGGGCGG GGCCATCGTG
GACTCCGGGA AGTTCGATTG GACGGCGCAT GCCGACAAGT TCCCCGGCCT GTGCGAGCCC
GACGAGAGCT ACCACGGAGT GACGTACACC GAGCGCTTCG GCTTGGGCGG CGCGTTCATC
ACGAAGGCGA CGGCCCAGCT CATGCGCGAC TTCGGCGCCA TCCAGTCGCC GCAGAACGCC
TACCTCGTCA ACCTGGGGCT GGAAAGCCTG CACCTGCGCA TGGCGCAGCA CAGCAAGAAC
GGCCTGGCGT TGGCGCAGCA TCTGGCCGCG CATCCGAAGA TCGCCTGGGT GCGCTACCCC
GGCTTGCCGG GCGACGACCA ATACGAGCTG GCGCAGAAGT ACCTGCCGAA CGGCGCCAGC
GGCGTGGTGA GCTTCGGCGT GGCCGGCGGG CGCGCCGCGG CCGAGACGTT CATGGCGAAC
CTCAAGCTGG CGCAGATAGC CACGCACGTG GCCGATGCGC GCACCTGCGT GCTGCATCCG
GCCAACGCGA CGCATCGCCA GATGAACGAC GCGGAGCTTG CGGCCGCGGG CATCACGCCC
GACCTCATCC GGCTGTCGTG CGGCATCGAG GCCACCGAGG ACCTCGTCGC CGACATAGAC
CAGGCGCTGG CGGCCGTGTA G
 
Protein sequence
MSESISTTCV QGGYRPGDGE PRQIPIYQST TWKYDTSEHM GRLFDLEEAG YFYTRLANPT 
NDFVAAKIAE LEGGTAAMLT SSGQAANFFA VFNIAGAGDH VVASSAIYGG TYNLLAVTMK
RMGLECTFVS PDCTDEELEA AFRPNTKAVF GETIANPALA VLDIERFAAA AHAHGVPLIV
DNTFPTPVNC RPIEWGADIV THSTTKYMDG HGASVGGAIV DSGKFDWTAH ADKFPGLCEP
DESYHGVTYT ERFGLGGAFI TKATAQLMRD FGAIQSPQNA YLVNLGLESL HLRMAQHSKN
GLALAQHLAA HPKIAWVRYP GLPGDDQYEL AQKYLPNGAS GVVSFGVAGG RAAAETFMAN
LKLAQIATHV ADARTCVLHP ANATHRQMND AELAAAGITP DLIRLSCGIE ATEDLVADID
QALAAV