Gene Cpha266_1766 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1766 
Symbol 
ID4570110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2007411 
End bp2008490 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content49% 
IMG OID639766349 
Producthomoserine O-acetyltransferase 
Protein accessionYP_912207 
Protein GI119357563 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID[TIGR01392] homoserine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGACT ACAGAGAGCT TATTTCAGAT AGAACACTTT ATTTTGCCTC CGATGTGCCG 
TTTCAGACCG AACTTGGAGC TTTGTTACCA GAAGTTCGTA TTGCCTATCG TACATGGGGA
ACCCTTAATG CATCGAAAAA CAATGTTATT CTGATCTGTC ACGCCCTCAC CGGTTCAGCC
GATGCCGATT TCTGGTGGAA GGGTATGTTT GCAGAAGGAG GAGCATTCGA TGAAACGGAA
AATTTTATCA TTTGCAGCAA TGTTCTCGGG AGCTGTTACG GTACGACTGG ACCAATTTCA
CCTAACCCGT TGACCGGGCG CCGTTACGGT TCAGATTTTC CTCTGATCAC CATTCGTGAT
ATGGTGAACC TTCAGCATCG CCTGCTCGAC GAACTTGGCA TTGCAGAACT TCAACTTGTT
GTCGGAGCCT CTCTGGGAGG TATGCAGGTG CTTGAATGGG GTTTTCTTTA TCCTGGCATG
GTTAAGGCTA TGATGCCAAT GGGTATTTCG GGGCGACATT CAGCATGGTG TATTGCGCAG
AGTGAGGCGC AACGCCAGGC TATCTATGCT GATCGGGATT GGCGTGACGG ATGGTATCGG
GAGGATGCGC CTCCTGCCGG AGGGTTTGCT GCGGCAAGGA TGATGGCCAT GTGCTCCTAT
AGAAGTTACG AGAATTTTCA GACACGGTTT GGACGCAATC AGACCGAAAG CTCTCTTTAT
GAGGTTGAAA ACTACCTTCA CCATCAGGGA GAAAAGCTGG TAGAACGCTT TGATGCCAAT
ACCTACATTA CCCTGACAAA AGCAATGGAT ACGCATGATG TCGCAAGGGG AAGGGGGGCT
TATGAGGATG TGCTTGGATC GTTACGGATT CCTGTTGAAA TTCTTTCAAT CAATTCCGAT
GTTCTATACC CGAAAGAGGA GCAGGAGGAG CTGGCCAGGC TTATTCCAAC CGCAGGCATC
ATCTACCTTG AAGAGCCATA CGGACATGAC GCTTTTCTTA TTGATACCGA AAAGGTCAGC
CGCATGGTAA GGGAATTTAT GAATGATCGG GCGTCGGGCG GAGATCAGGA TGTTAGCTGA
 
Protein sequence
MRDYRELISD RTLYFASDVP FQTELGALLP EVRIAYRTWG TLNASKNNVI LICHALTGSA 
DADFWWKGMF AEGGAFDETE NFIICSNVLG SCYGTTGPIS PNPLTGRRYG SDFPLITIRD
MVNLQHRLLD ELGIAELQLV VGASLGGMQV LEWGFLYPGM VKAMMPMGIS GRHSAWCIAQ
SEAQRQAIYA DRDWRDGWYR EDAPPAGGFA AARMMAMCSY RSYENFQTRF GRNQTESSLY
EVENYLHHQG EKLVERFDAN TYITLTKAMD THDVARGRGA YEDVLGSLRI PVEILSINSD
VLYPKEEQEE LARLIPTAGI IYLEEPYGHD AFLIDTEKVS RMVREFMNDR ASGGDQDVS