Gene Apar_0372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0372 
Symbol 
ID8413221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp427063 
End bp428343 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content50% 
IMG OID645021940 
ProductO-acetylhomoserine/O-acetylserine sulfhydrylase 
Protein accessionYP_003179394 
Protein GI257784177 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAGA GCATCGAGAC TCGCTGCGTA CAAGGCGGTT ATCAGCCTGG TTGTGGCGAG 
CCTAGGCAAG TGCCTATCAT TCAATCAACC ACGTTCAAAT ATGACGAGTC GATGCAGCTT
GGAGAGTTGT TCGACCTCAA AGCTGCTGGC TATTTTTATT CTCGTGTTCA GAATCCAACG
CTTGATAATG TAGCGTCAAA GATTTGTGCT CTTGAGGGCG GCACTGCTGC AATGCTGACA
TCCTCTGGTC AAGCAGCAAA CTTCTTTGCC GTCTTTAATA TTGCTGAGGC TGGAGATCAT
TTTATTGCGC TTTCCACCAT TTATGGCGGC ACCTTTAACC TGTTTGCTAT CACCCTCAAG
AAGATGGGTG TGGAGTGTAC GTTTATCTCG CCAGATGCAA CTGACGAGGA GATCAATGCT
GCCTTTAGGT CAAATACTAA GTGCGTTTTT GGCGAGACTA TTGCAAATCC TGCACTGGTT
GTTCTTGATA TTGAACGCTG GGCAAAGGCT GCTCATGACC ATGGTGTGCC GCTGATTGTT
GACAATACCT TTGCTACACC TGTGAACTGT CGTCCTCTTG AGTGGGGAGC AGATATTGTG
ACCCATTCTA CTACCAAGTA TATGGACGGT CACGGCTGTG CTGTTGGCGG TGCAATTGTT
GATGGCGGCA ACTTTGATTG GGCTACTCAT GCAGACAAGT TCCCAGGTCT GACCCAGCCC
GACCCTTCGT ATCACAACCT TGTTTATACC GATACCTTTG GCAACGGCGG CGCTTTTATT
ACGAAGGCAA CTGTTCAGCT TATGCGCGAC TTTGGCTCCA TTCAGTCACC TCAGAGCGCT
TTCTATCTCA ACCTTGGCCT TGAGTCTCTA CACGTTCGTA TGGCTCAGCA TTGCAAGAAT
GGCCAAGCAG TTGCCGAGGC ACTTGCTAAT AACCCTAAGG TTGCCCATGT AAGCTATCCA
GATCTTCCGG GTGATACGTA CTATGACCTG GCTCAGAAGT ACCTTCCTGG TGGGTCCTGT
GGTGTTATCA CCGTTGATGT TGCGGGTGGT CGCGAGGCTG CCGAGAAATT CCTGGGTAAC
CTCAAAGTCT TCTCCATTGC AACACATGTT GCAGACGCTC GTTCTTGCTG TCTGCACCCA
GCTTCTTCAA CGCATCGTCA GCTTACTGAC GAGGAGCTTG TAGCAGCAGG TATTACTCCA
GGTACCGTTC GCTTAAGCTG TGGTATTGAG GGCACTGAGG ACCTTATCAA CGACGTCGAG
CAGGCACTTG CTGCTCTGTA G
 
Protein sequence
MSESIETRCV QGGYQPGCGE PRQVPIIQST TFKYDESMQL GELFDLKAAG YFYSRVQNPT 
LDNVASKICA LEGGTAAMLT SSGQAANFFA VFNIAEAGDH FIALSTIYGG TFNLFAITLK
KMGVECTFIS PDATDEEINA AFRSNTKCVF GETIANPALV VLDIERWAKA AHDHGVPLIV
DNTFATPVNC RPLEWGADIV THSTTKYMDG HGCAVGGAIV DGGNFDWATH ADKFPGLTQP
DPSYHNLVYT DTFGNGGAFI TKATVQLMRD FGSIQSPQSA FYLNLGLESL HVRMAQHCKN
GQAVAEALAN NPKVAHVSYP DLPGDTYYDL AQKYLPGGSC GVITVDVAGG REAAEKFLGN
LKVFSIATHV ADARSCCLHP ASSTHRQLTD EELVAAGITP GTVRLSCGIE GTEDLINDVE
QALAAL