Gene Caul_0385 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0385 
SymbolmetX 
ID5897659 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp424650 
End bp425801 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content68% 
IMG OID641560870 
Producthomoserine O-acetyltransferase 
Protein accessionYP_001682020 
Protein GI167644357 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID[TIGR01392] homoserine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.859111 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAAC TCGACTTCGC GGGACCGGTG CTGGCAGAGG GCGGAACCTG GCGCTTCCCG 
CCCGATCGCC CGCTGCCGCT GGATTCTGGC GCCAAGCTGG AAAACCTCGA GATCGCCTAT
CGCACCTGGG GCACGCTGAA TGCGGACGGC ACGAACGCCG TCCTGATCTG TCACGCCCTG
ACCGGCGACC AGCACGTGGC CGGCAGCCAT CCCACGACCG GCAAGCCTGG CTGGTGGGCG
CGCCTCGTCG GCCCCGGCAG GCCGCTTGAT CCGGCCAGGC ATTTCATCAT CTGTTCGAAT
GTGGTCGGCG GCTGCATGGG ATCGACCGGT CCGGCCTCGA TCAACCCGGC CACCGGCAAG
GTCTACGGCC TGACCTTCCC GGTGATCACC ATCGCCGACA TGGTGCGCGC CCAGGCGATG
CTGGTCGAGG CGCTGGGCGT CCAGACCCTG CTGGCCGTGG TCGGCGGTTC GATGGGCGGC
ATGCAGGTCC AGCAATGGGC GGCCGACTAT CCTGGCAAGC TGTTCAGCGC GGTGATCGTC
GCCTCGGCGT CACGCCACTC CGCCCAGAAC ATCGCCTTCC ACGAGGTGGG TCGCCAGGCG
ATCATGGCCG ATCCCGACTG GAAGGCCGGG GCCTACGCCC AGGGCAAGTC GCGCCCTGAA
AAGGGCCTGG CCGTCGCCCG GATGGCCGCC CATATCACCT ATCTGTCCGA GCCGGCCCTG
CAACGGAAGT TCGGCCGCGA GCTGCAGCGC GACGGCCTGT CCTGGGGTTT TGACGCCGAC
TTCCAGGTGG AGAGCTATCT GCGCCACCAG GGCGCGACCT TCGTCGACCG CTTCGACGCC
AATTCCTATC TCTACATCAC CCGGGCCATG GACTATTTCG ACCTGGCCGC CGCGCACGGC
GGGGTGTTGG CCCAGGCCTT CGCCGGCGCG CGCGACGTGC GCTTCTGCGT GCTGTCGTTC
ACCAGCGACT GGCTCTATCC GACCGCCGAG AACCGCCACA TCGTGCGGGC GCTCACCGCC
GCCGGTTGCC GCGCGGCCTT CGTCGAGATC GAGAGCGACA AGGGCCACGA CGCCTTCCTG
CTGGACGAGC CGGTCATGGA CGCGGCGCTG CACGGATTTC TGAGCTCGGT GGAACGGGAG
CGGGGGCTCT AG
 
Protein sequence
MAELDFAGPV LAEGGTWRFP PDRPLPLDSG AKLENLEIAY RTWGTLNADG TNAVLICHAL 
TGDQHVAGSH PTTGKPGWWA RLVGPGRPLD PARHFIICSN VVGGCMGSTG PASINPATGK
VYGLTFPVIT IADMVRAQAM LVEALGVQTL LAVVGGSMGG MQVQQWAADY PGKLFSAVIV
ASASRHSAQN IAFHEVGRQA IMADPDWKAG AYAQGKSRPE KGLAVARMAA HITYLSEPAL
QRKFGRELQR DGLSWGFDAD FQVESYLRHQ GATFVDRFDA NSYLYITRAM DYFDLAAAHG
GVLAQAFAGA RDVRFCVLSF TSDWLYPTAE NRHIVRALTA AGCRAAFVEI ESDKGHDAFL
LDEPVMDAAL HGFLSSVERE RGL