Gene Cagg_0969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0969 
Symbol 
ID7268043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1197231 
End bp1198367 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content57% 
IMG OID643565818 
Producthomoserine O-acetyltransferase 
Protein accessionYP_002462323 
Protein GI219847890 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID[TIGR01392] homoserine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0768373 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.579179 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGCAA TTGCTCCGGC GCCAACCTCC GAAGGGGTTG GCTTCGTGCG TACCCAGCGC 
ATGCATTGGA CGACACCGCT GACCCTGACG AGCGGCGTGA CATTGGGGCC GCTGACGATT
GCCTACGAGA CTTACGGTGA ACTAGCGCCC GACCGCTCGA ATGCCATTCT GATTCTGCAC
GCTTTGTCGG GTGATGCACA CGCAGCCGGT TATCATAGTC CTACTGATCG CAAACCGGGC
TGGTGGGATG GAATGATCGG GCCGGGCCGC GCATTTGATA CCAACCGCTA CTTTGTGATC
TGCTCGAATG TGATCGGTGG CTGCCGCGGT TCGACCGGCC CGTCGAGTCC ACATCCGGTT
GATGGGAAGC CCTACGGTTC ACGGTTTCCG ATCATTACCA TTGAGGATAT GGTACACGCC
CAACAACGCC TGATTGATGC CCTTGGGATT GATACCTTGT TGGCCGTAGC CGGTGGTTCG
ATGGGTGGAT TTCAGGCATT GGCATGGGCG GTCGAGTATC CGCAGCGTGT GCGTGGTGCG
ATCTTGTTGG CGACGAGTGC GCGTTCGAGT CCGCAGACGG TAGCGTGGAA TTATATCGGC
CGGCGTGCAA TTATGGCCGA TCCGCGTTGG CGTGGTGGCG ACTACTACGA TGGTGAGCCG
CCACGTGATG GTTTGGCGGT AGCACGCATG CTCGGTCATA TTACGTATCT CTGTGAGCCA
AAGTTGGAGC AGCGGTTTGG GCGGCGCGGT GATCCCGGAC CGCTTGACCT TGGGCCACAT
TTTGCGATTG AACACTATCT TGAGCATCAG GCGGCACGTT TCAACGAACG GTTTGATGCC
AATTCTTATT TGACGATCAC GCGCGCTATG GACAGTTGGG ATCTTGCGGC GCGCTACGGC
TCCTTAACAG CGGCGTTTGA TCTGGCACGA GCACGGTTTT TGGCGTTGGC CTACAGCAGC
GATTGGCTCT ATCCACCGAG CGAGACGTAT CACATGGCAG TAGCGGCACA GGCTGCCGGG
CGGTCGTTTA CAACGCATCT GATCATGACT GACGCCGGCC ACGATGCGTT TCTGACCGAT
ATAGCTGCCC AGAGTGTTGT CATTCGGGAA TTTTTGGATC GGTTAGGGTC GGAATAG
 
Protein sequence
MEAIAPAPTS EGVGFVRTQR MHWTTPLTLT SGVTLGPLTI AYETYGELAP DRSNAILILH 
ALSGDAHAAG YHSPTDRKPG WWDGMIGPGR AFDTNRYFVI CSNVIGGCRG STGPSSPHPV
DGKPYGSRFP IITIEDMVHA QQRLIDALGI DTLLAVAGGS MGGFQALAWA VEYPQRVRGA
ILLATSARSS PQTVAWNYIG RRAIMADPRW RGGDYYDGEP PRDGLAVARM LGHITYLCEP
KLEQRFGRRG DPGPLDLGPH FAIEHYLEHQ AARFNERFDA NSYLTITRAM DSWDLAARYG
SLTAAFDLAR ARFLALAYSS DWLYPPSETY HMAVAAQAAG RSFTTHLIMT DAGHDAFLTD
IAAQSVVIRE FLDRLGSE