Gene Clim_1824 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1824 
Symbol 
ID6355164 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1999279 
End bp2000355 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content55% 
IMG OID642669427 
Producthomoserine O-acetyltransferase 
Protein accessionYP_001943842 
Protein GI189347313 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID[TIGR01392] homoserine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGGATT ACAGGGAGCT CATTTCAGAG AAGACTCGAT ATTTTGTATC GCAGAAACCG 
TTTGCAACAG AGTCTGGCGG CGTGCTGCCC GAACTGCGTA TCGCTTACAG AACATGGGGA
AAACCTGATC AGGAGAAGAG TAACGTTATT CTGATCTGCC ATGCGTTGAC TGGTTCGGCC
GATGCCGATG TATGGTGGGA CGGCATGTTC GCCGAAGGGG GTGCGTTCGA TGAGGCGAAA
GATTTCATTA TCTGCTGCAA TGTGCTTGGA AGCTGTTACG GCACAACCGG TCCGCTGTCG
CTGAATCCGC TGACAGGCCG ACATTACGGG CCTGATTTTC CCCGAATCAC CATCCGCGAC
ATGGTGCATG CCCAGAGGCT GCTGCTTGAC GAATTCGGTA TCGATCGCAT TCGTCTTGTG
GTCGGCGCTT CACTCGGCGG CATGCAGGTG CTCGAGTGGG GATTCCTTTA CCCGAAAATG
GTGCAGGCCA TGATGCCGAT GGGGGTTTCC GGGCGACATT CGTCATGGTG CATTGCCCAG
AGTGAGGCTC AGCGTCAGGC TATCTATGCC GATCGCGACT GGAACGGCGG CTGGTATGCG
GCAGATTGTC CGCCGGCTTC GGGTCTGGCG GCTGCGAGGA TGATGGCCAT GTGCAGCTAC
CGGAGTTTCG AGAATTTCCA GTCCCGTTTC GGGCGTGATG TTCAGGATGA CGGGTTGTTC
CGGGTGGAGA GCTATCTGCA CTATCAGGGG CGGAAGCTGG TTGACCGGTT TGATGCCAAC
ACCTATGTGA CCCTGACGAA AGCCATGGAT ATGCATGATC TTTCGAGGGG AAGAGGCGTG
TATGAAGAGG TTCTCGGCTC ATTGCAGATA CCGGTGGAAA TTCTCTCCAT CATCAGTGAT
GTGCTCTATC CGAAAGAGGA GCAGGAGGAG CTCGGACGGC TCATGCAGCA TTCACGGGTG
ATCTATCTCG ACGAACCTTA CGGCCATGAC GCTTTTCTTA TCGATGTCGA AAAGGTAGGC
CGGATGGTCA GGGAGTTCAA GGATGAACGG GCAGTCAAGG CGCACAGCGC AGCCTGA
 
Protein sequence
MRDYRELISE KTRYFVSQKP FATESGGVLP ELRIAYRTWG KPDQEKSNVI LICHALTGSA 
DADVWWDGMF AEGGAFDEAK DFIICCNVLG SCYGTTGPLS LNPLTGRHYG PDFPRITIRD
MVHAQRLLLD EFGIDRIRLV VGASLGGMQV LEWGFLYPKM VQAMMPMGVS GRHSSWCIAQ
SEAQRQAIYA DRDWNGGWYA ADCPPASGLA AARMMAMCSY RSFENFQSRF GRDVQDDGLF
RVESYLHYQG RKLVDRFDAN TYVTLTKAMD MHDLSRGRGV YEEVLGSLQI PVEILSIISD
VLYPKEEQEE LGRLMQHSRV IYLDEPYGHD AFLIDVEKVG RMVREFKDER AVKAHSAA