Gene Acid345_4498 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4498 
Symbol 
ID4070176 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5339269 
End bp5340408 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content61% 
IMG OID637986537 
Producthomoserine O-acetyltransferase 
Protein accessionYP_593572 
Protein GI94971524 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID[TIGR01392] homoserine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.202595 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.356601 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACAG GCACATGTAC CGTGAGCGCG GGTGAACCGA TTCCGGCCCC GCGCTCTCAA 
CGCAATCTTC ACCTTATCCA GGGCGCGTTC ACCTTCGCCG ATGAAGGCTT CCCCCTGGAT
AACGGTGGCT CCCTTCGGCC CGTCACCATT CGCTATGCGC AATACGGCGA GCCCAACGCG
AAGGCCGACA ATGTCGTTCT CGTCTGCCAC GCTCTCTCCG GATCCGCCAA GGTTGACGAC
TGGTGGCCCG AACTCTTCGC CGAAGGCGGA TTGCTCGACC TCGATAAATT CTGCGTGATC
GGCACCAACA TCCTCGGCTC CTGCTACGGC TCTACCGGCC CGAATTCCAT CAACGCCGAA
ACCGGACAGC CCTATGGTGC AGATTTTCCG CTCGTCACCA TCAGCGACAT CGTGCGCGCC
CAGGCGAAGC TCCTCGATCA TCTCGGCATC AAGAAGTTGA AGCTTGCCAT CGGCGGTTCG
ATCGGCGGCA TGCAGGCCCT GCACTGGGCC ATGGATTATC CCGATCGCGT CGAGCAGGCG
ATTGCCATCG GCACCGCGCC GCTGGGCGCC CTCGGCCTCG CGCTTAACCA TATCCAGCGC
CAGGTTATCC GCCTCGATCC CAAGTGGAAT GCCGGCTCCT ACTCGCACGA GAATTCGCCA
AGCCAAGGCA TCTCCATCGC GCGCCAGATC GCCATGCTCT CCTACAAATC CGCGGAGTTG
TTCGACGAGC GCTATGGCCG CAAGCTCAAC CGCAACGGCG AAGATCCGTA CACGCACCAT
GAGGCACGCT TCGATGTCGG CGGCTATCTC GATCACCAGG GCGAGAAATT CGTCCAGCGC
TTCGATGCGA ACTCGTACGT TTCGATCACG CGCACCATGG ACACGTTCGA CCCCGTCCGC
AAATACCGCA GTGCCAAAGC CGCGTACAGT CGCATCAAGG CGAAGATCAC GCTGGTAGGG
ATTTCGTCCG ACTGGCTCTT CCCACCGGAA GACGTTCGCA AACTCGCGCA AGAAATGATC
GCTGCCGGAG CCAGCTGCGA TTATCGCGAA ATCATCTCCG CCCACGGCCA CGACGCATTC
TTAGCCGAAC CGGAGAAACT CCTCGAAGTC CTAAGCGACG CCCACGCCCG CCCGGTTTAG
 
Protein sequence
MSTGTCTVSA GEPIPAPRSQ RNLHLIQGAF TFADEGFPLD NGGSLRPVTI RYAQYGEPNA 
KADNVVLVCH ALSGSAKVDD WWPELFAEGG LLDLDKFCVI GTNILGSCYG STGPNSINAE
TGQPYGADFP LVTISDIVRA QAKLLDHLGI KKLKLAIGGS IGGMQALHWA MDYPDRVEQA
IAIGTAPLGA LGLALNHIQR QVIRLDPKWN AGSYSHENSP SQGISIARQI AMLSYKSAEL
FDERYGRKLN RNGEDPYTHH EARFDVGGYL DHQGEKFVQR FDANSYVSIT RTMDTFDPVR
KYRSAKAAYS RIKAKITLVG ISSDWLFPPE DVRKLAQEMI AAGASCDYRE IISAHGHDAF
LAEPEKLLEV LSDAHARPV