Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4498 |
Symbol | |
ID | 4070176 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 5339269 |
End bp | 5340408 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637986537 |
Product | homoserine O-acetyltransferase |
Protein accession | YP_593572 |
Protein GI | 94971524 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2021] Homoserine acetyltransferase |
TIGRFAM ID | [TIGR01392] homoserine O-acetyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.202595 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.356601 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACAG GCACATGTAC CGTGAGCGCG GGTGAACCGA TTCCGGCCCC GCGCTCTCAA CGCAATCTTC ACCTTATCCA GGGCGCGTTC ACCTTCGCCG ATGAAGGCTT CCCCCTGGAT AACGGTGGCT CCCTTCGGCC CGTCACCATT CGCTATGCGC AATACGGCGA GCCCAACGCG AAGGCCGACA ATGTCGTTCT CGTCTGCCAC GCTCTCTCCG GATCCGCCAA GGTTGACGAC TGGTGGCCCG AACTCTTCGC CGAAGGCGGA TTGCTCGACC TCGATAAATT CTGCGTGATC GGCACCAACA TCCTCGGCTC CTGCTACGGC TCTACCGGCC CGAATTCCAT CAACGCCGAA ACCGGACAGC CCTATGGTGC AGATTTTCCG CTCGTCACCA TCAGCGACAT CGTGCGCGCC CAGGCGAAGC TCCTCGATCA TCTCGGCATC AAGAAGTTGA AGCTTGCCAT CGGCGGTTCG ATCGGCGGCA TGCAGGCCCT GCACTGGGCC ATGGATTATC CCGATCGCGT CGAGCAGGCG ATTGCCATCG GCACCGCGCC GCTGGGCGCC CTCGGCCTCG CGCTTAACCA TATCCAGCGC CAGGTTATCC GCCTCGATCC CAAGTGGAAT GCCGGCTCCT ACTCGCACGA GAATTCGCCA AGCCAAGGCA TCTCCATCGC GCGCCAGATC GCCATGCTCT CCTACAAATC CGCGGAGTTG TTCGACGAGC GCTATGGCCG CAAGCTCAAC CGCAACGGCG AAGATCCGTA CACGCACCAT GAGGCACGCT TCGATGTCGG CGGCTATCTC GATCACCAGG GCGAGAAATT CGTCCAGCGC TTCGATGCGA ACTCGTACGT TTCGATCACG CGCACCATGG ACACGTTCGA CCCCGTCCGC AAATACCGCA GTGCCAAAGC CGCGTACAGT CGCATCAAGG CGAAGATCAC GCTGGTAGGG ATTTCGTCCG ACTGGCTCTT CCCACCGGAA GACGTTCGCA AACTCGCGCA AGAAATGATC GCTGCCGGAG CCAGCTGCGA TTATCGCGAA ATCATCTCCG CCCACGGCCA CGACGCATTC TTAGCCGAAC CGGAGAAACT CCTCGAAGTC CTAAGCGACG CCCACGCCCG CCCGGTTTAG
|
Protein sequence | MSTGTCTVSA GEPIPAPRSQ RNLHLIQGAF TFADEGFPLD NGGSLRPVTI RYAQYGEPNA KADNVVLVCH ALSGSAKVDD WWPELFAEGG LLDLDKFCVI GTNILGSCYG STGPNSINAE TGQPYGADFP LVTISDIVRA QAKLLDHLGI KKLKLAIGGS IGGMQALHWA MDYPDRVEQA IAIGTAPLGA LGLALNHIQR QVIRLDPKWN AGSYSHENSP SQGISIARQI AMLSYKSAEL FDERYGRKLN RNGEDPYTHH EARFDVGGYL DHQGEKFVQR FDANSYVSIT RTMDTFDPVR KYRSAKAAYS RIKAKITLVG ISSDWLFPPE DVRKLAQEMI AAGASCDYRE IISAHGHDAF LAEPEKLLEV LSDAHARPV
|
| |