Gene Acid345_4499 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4499 
Symbol 
ID4070177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5340405 
End bp5341748 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content60% 
IMG OID637986538 
ProductO-acetylhomoserine/O-acetylserine sulfhydrylase 
Protein accessionYP_593573 
Protein GI94971525 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.675979 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.356601 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAGCA ATAACGGAAA AGAAGCACCG AAACCGCATT TAGCTACGCT GGCGGTCCAT 
GGCGGCCAGG AACGCGATCC CGCTACCAAG TCGCGCGCCG TCCCCATCTA CCAGACCACC
TCGTACCTCT TCGACGACGC CGACCACGCC GCCCGCCTCT TCGCCCTCCA GGAATTCGGC
AACATCTACA CCCGCATCAT GAACCCCACC ACCGACGTCT TCGAGAAGCG CATTGCCGCA
CTAGAAGGCG GCGCCGCCGG TCTTGCGACG GCCTCAGGCC AGGCCGCCGA GACGCTGACG
ATTATCACCC TCGCCAATGC TGGCGATGAG ATCGTCTCAA CCACGTCGCT TTACGGTGGA
ACCTACAACC TCTTCCACTA CACGTTCCCG AAGCTCGGCA TCAATGTGAA GTTCGTGGAT
GCCGACGACT TCGACGGCCT GCGCAAGGCC ATCACGCCGA AGACGAAAGC GGTCTTTGCG
GAAACGCTTG GCAATCCCAA GCTCGACGTG ACCGACATCG AAACGATTGC AAAGATCGCA
CACGAGAACG GCCTTCCGTT CATCATCGAC AACACGTCGG CTTCACCCGC GCTGCTGCGT
CCGATCGAGT GGGGCGCCGA CATCGTGATC AACTCGGCGA CGAAATTCAT CGGCGGCCAC
GGCACCACCA TCGGCGGCAT CATCGTGGAT GCTGGCAAGT TCGACTGGAA GGCCAGCGGC
CGCTTCCCGG ATTTCGTAAA CCCCGACCCG TCGTATCACG GTCTCAGCTT CTGGGACGCT
TTCGGTCCGT TGGCGTTCAT CCTCAAGGCG CGCGTGCAAG GCTTGCGTGA TACCGGCGCG
GCGCTCTCGC CGTTCAATTC GTTCCTGCTG CTGCAAGGCA CGGAAACACT GCACCTTCGT
TTGCAGCGAC ACTCCGAGAA TGCGCTCAAA GTTGCGAAGC ATCTCGAGGA GCATCCGGCG
ATCGAGTGGG TGAACTATCC CGGACTGAAG TCGAGCAAGT ACTACGCCCG CGCGCAGAAG
TATCTGCCTG ATGGCCAGGG CGCGCTGCTC ACCTTTGGCA TCAAGGGCGG ATTCGAGGCC
GGCAAGAAGC TGATCAACTC GCTGAAGTTG TTTAGCCTCG TCGCTAACAT CGGAGACTCG
AAGTCCCTCG TCATCCATCC GTCGTCAACA ACCCACCAAC AGTTATCCGA AGCAGAACAG
AAAGATACCG GTGTCACACC CGAGCTCGTT CGTCTCAGCG TCGGCATCGA GGACGTCCGC
GACATCATCG CCGACCTCGA CCAGGCTCTT GAGGTTGCAA CCGGTGTCTC CAACCAACTC
CAACCAGCAG GAAGTGCACG ATGA
 
Protein sequence
MSSNNGKEAP KPHLATLAVH GGQERDPATK SRAVPIYQTT SYLFDDADHA ARLFALQEFG 
NIYTRIMNPT TDVFEKRIAA LEGGAAGLAT ASGQAAETLT IITLANAGDE IVSTTSLYGG
TYNLFHYTFP KLGINVKFVD ADDFDGLRKA ITPKTKAVFA ETLGNPKLDV TDIETIAKIA
HENGLPFIID NTSASPALLR PIEWGADIVI NSATKFIGGH GTTIGGIIVD AGKFDWKASG
RFPDFVNPDP SYHGLSFWDA FGPLAFILKA RVQGLRDTGA ALSPFNSFLL LQGTETLHLR
LQRHSENALK VAKHLEEHPA IEWVNYPGLK SSKYYARAQK YLPDGQGALL TFGIKGGFEA
GKKLINSLKL FSLVANIGDS KSLVIHPSST THQQLSEAEQ KDTGVTPELV RLSVGIEDVR
DIIADLDQAL EVATGVSNQL QPAGSAR