Gene Caci_6242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_6242 
Symbol 
ID8337605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp7179277 
End bp7180512 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content67% 
IMG OID644959343 
Producthomoserine O-acetyltransferase 
Protein accessionYP_003116937 
Protein GI256395373 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID[TIGR01392] homoserine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0424143 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.823707 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACAAAC CCGCCACGGT GCCGCCGACC GGCTCGGTCG GCACCGTCGA GACGCGGTTC 
CTCGACCTGC CCGAACCGGT CCCGCTGGAA TGCGGCCGGG AACTGAGCGG AGTGCGGGTC
GCCTACGAGA CGTACGGGAG CCTGTCCCCC GCGCGTGACA ACGTCATTCT GATCTGTCAC
GCCTTGAGCG GGGACGCCCA CGCCGCCGGG ATATCCGCGG CCGCAACCAC GACGGCAGGC
ACGCGCGACG GCTTCGCCGC CGAGGACCGC GACGGCAGCG CCGGAGCGAG CCTGGGCTGG
TGGGACGGGA TGATCGGGCC CGGCAAGGCC TTCGACACCG AGCGATACTT CATCATCTCC
ACCAACTTGC TGGGCGGATG CCGCGGGACG ACCGGACCGC GATCGACGAA CCCTGGCACC
GGGCTCCCCT ACGGACCGGA CTTCCCGGTG ATCACCGTCG CGGACATGGT GCGGACCCAG
CGACGCTTTC TGGACCGGCT CGGCATCGAA CGCCTCGCGG CGGTCGCGGG CGGATCCCTT
GGCGGTATGC AGGCGCTGGA ATGGGCTGTG CTGTTCCCGG ATCAGGTCGA CGCGATCGTG
GTCATAGCGT CCACGCATGC CCTGCATCCG CAAGGGGTGG CGTGGAACGC AATCGCCCGC
GAAGCCATCA TGGGCGACCC GGCGTGGCAG GGTGGCCGCT ACCACGGGAC CGGCCGGACG
CCTGACGCCG GCATGGGTGT GGCGCGCATG GTCGGGCATG TCACCTACCT GTCGGGTCCT
GCGCTGGAGG CGAAGTTCGC CCGGCGGTTG CAGGCCTCCG AGCAGATCCG CCACACCCTC
ACCGAGCCTG AGTTCGCGGT TGAGAGCTAT CTGAACCATC AGGCTGCCTC GTTCGTGAAG
CGGTTTGATG CGAACACTTA TCTATACATG TCGCGCGCGC TGACGTACTT CGACCTGGCG
CGCCAGCACG GCGACGGCTC GTTGAAGCAC GCGCTGGAAG GCGTCTTGGC GCGGACGCTG
CTCATCGCGT TCAGCTCGGA CTGGCTGTAT CCGCCTTCGG CTTCGGACGA GATCGCCGAT
GCGTTGCGCT CGCTCGGCAA GCCGGTGGAC TACCACTTGA TCGAGGCGCC GTACGGGCAC
GACAGTTTCC TGCTTGAGGA AGCACGCCAG ATTCCCATCG TCCGCCAGTT CCTGGAGGAT
GGGATCCAGA CGACGATGAG GACTGCGACT CCATGA
 
Protein sequence
MHKPATVPPT GSVGTVETRF LDLPEPVPLE CGRELSGVRV AYETYGSLSP ARDNVILICH 
ALSGDAHAAG ISAAATTTAG TRDGFAAEDR DGSAGASLGW WDGMIGPGKA FDTERYFIIS
TNLLGGCRGT TGPRSTNPGT GLPYGPDFPV ITVADMVRTQ RRFLDRLGIE RLAAVAGGSL
GGMQALEWAV LFPDQVDAIV VIASTHALHP QGVAWNAIAR EAIMGDPAWQ GGRYHGTGRT
PDAGMGVARM VGHVTYLSGP ALEAKFARRL QASEQIRHTL TEPEFAVESY LNHQAASFVK
RFDANTYLYM SRALTYFDLA RQHGDGSLKH ALEGVLARTL LIAFSSDWLY PPSASDEIAD
ALRSLGKPVD YHLIEAPYGH DSFLLEEARQ IPIVRQFLED GIQTTMRTAT P