Gene Acid345_3918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3918 
Symbol 
ID4071301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4631745 
End bp4632782 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content56% 
IMG OID637985944 
Producthomoserine O-acetyltransferase 
Protein accessionYP_592992 
Protein GI94970944 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID[TIGR01392] homoserine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.759741 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0489209 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTATCGAT GGTTATTAAT CTGCGCTACG GTACTCATGC TTGGTGGATT TGCGCTTGCC 
GAGGGTGAGC AGCAATTCGC GGATCTTGGG CAGTGCAAGG TCGAGAGCGG CGAGACTATC
CAGAATTGTC GCATCGGGTA TCGCACTTGG GGCAAGTTGA ACGCGGAGCA GTCGAATATC
GTGGTCCTGC TGACGTGGTT CACCGGAACG AGCGAGCAGC AGGCCGGGAG CGTGGGCGCC
GATAAATACG TGGATCCCGC GCACTATTAC GTTGTCGCGA TTGATGCGCT GGCGAATGGC
GTGAGTTCAT CACCGTCAAA CAGCAAAGCG CAGCCGAGAA TGAAGTTTCC GCAGATCACC
ATCGCCGACA TGGTGGAATC GCAGCATCGG TTGTTGACCG AGACGCTGAA GTTGAAGCAC
ATTCGCGCTG TGCTCGGTGG TTCCATGGGC GGGATGCAGG CGTTTCAATG GGCGGTGCAA
TATCCGGATT ACATGGACGC GGTGATCTCC ATCGTGGGCA CCACGCAGAT GACAGCACAC
GACCTGTTGC TGTGGCGCGC GGAGAAGAAT GCGATTCTCG AAAACAAGAA CTTTAACGAT
GGAGATTACA AGGCGGGCTT GCTGATTCCG TCGGTGGCAG ACATTCACCA CTTGGAGTTG
ACGACGCCGG ACAGAATCAA CGACGACACG CTGCCAAAGA ACTTTCCGAC GGCAGCGGAG
AAGATCGAAG CGTCAGAGAC GATGGACCCA TGCGATCGGT TGCGTCAACT CGATGCGATG
ATGACGCACG ACATCTCGAT GCGATTCAAC GGACAGATGT CTGGCGCGGC GAAGGCGGTG
AAGGCACACA TGCTGATTAT CGTGTCGAAC AGTGACCACA TGGTCAACCC GCATCCGGCC
ATGGTGTTTG CCGAGCTATT GCTGAATGTC CCGATGCAGC TCGATTCTAC TTGCGGCCAT
CTTGCGCCTG GATGCCGCGA AGAACAAGTG GTACCGGCAG TTCACCGAGC TCTCGAACTG
AAGTCATTCT TGCAATGA
 
Protein sequence
MYRWLLICAT VLMLGGFALA EGEQQFADLG QCKVESGETI QNCRIGYRTW GKLNAEQSNI 
VVLLTWFTGT SEQQAGSVGA DKYVDPAHYY VVAIDALANG VSSSPSNSKA QPRMKFPQIT
IADMVESQHR LLTETLKLKH IRAVLGGSMG GMQAFQWAVQ YPDYMDAVIS IVGTTQMTAH
DLLLWRAEKN AILENKNFND GDYKAGLLIP SVADIHHLEL TTPDRINDDT LPKNFPTAAE
KIEASETMDP CDRLRQLDAM MTHDISMRFN GQMSGAAKAV KAHMLIIVSN SDHMVNPHPA
MVFAELLLNV PMQLDSTCGH LAPGCREEQV VPAVHRALEL KSFLQ