Gene Caul_5144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5144 
Symbol 
ID5897382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010335 
Strand
Start bp64004 
End bp65206 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content68% 
IMG OID641555247 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_001676578 
Protein GI167621793 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.227388 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.833116 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCC AACCCGATCC GATCGTCATC GTCGCGGCTG TTCGTACGCC GCTGGGTCGT 
TTGAAGGGCG TGCTTTCGAG CTTGCCCGCC CATCAACTGG GAGCGCACGC CATCGCCTCT
GCGCTGGACC GGGCGGGTCT GGCGGTGGAT CGGGTTGATG AGGTCGTCAT GGGCTGCGTC
CTGCCCGCCG GCCAGGGCCA GGCGCCGACA CGCCAGGCCG CGCGCGGCGC GGGTTTGCCG
GACGCCGTGG CGGCCGCCAC CGTCAACAAG GTCTGTGGAT CGGGCATGAA GGCCACGATG
CTGGCCCACG ACATGATCCT GGCGGGCTCG GCCGACTTCG TGGTGGCCGG CGGCATGGAG
TCGATGTCCA ACGCCCCCTA CCTGCTCGAA AAGGCTCACG CCGGAGAGTA TCGCCCCGGG
TACGACCGCC TCTTCGACCA CATGATGCTC GACGGCCTCG AGGACGCCTA TGAGCGAGGC
CGTCCCATGG GGGATTTTGG TGAAGCGGCC GCCGAAGCCT ACGGGTTTAG TCGCCAAGAC
CAGGACGACT ACGCGCTGCG TACGCTCACA CGAGCCCGGG CCGCGATCGA AAGCGGTGAT
TTCGACGCCG AGATCGCGCC CCTGGAGTTT GACGGGCGGG ACGGCGCGGT GCGGGTCGAG
CACGACGAGC ATCCGCTCAA GGTCTCTCCG GAAAAGATCC CCAAGCTCAA GCCCGCCTTC
CGTCCGGACG GCACCATCAC GGCCGCCAGC GCCTCTGCCA ACGCCGATGG CGCGGCCGCC
TTGCTGTTGA CCCGTCGTTC GATCGCCAAG CGCGAAGGCC TTCCTATTCT GGCCGAGATC
AGGGGCCACG CGACCCACAG CCAGGATCCG GCCTGGTACA CGACGGCCCC TGCGCCAGCG
ATCCGCAAGC TGCTCGACAA GGTGGGATGG ACGCCACAGG ACGTGGATCT GTACGAGGTC
AACGAGGCCT TCGCGGTCGT GGCCATGATC GCCGGGCGCG AGCTGGACAT CCCGACCGAG
AAGCTCAACG TCAATGGCGG CGCCTGCGCT CTTGGCCACC CGATCGGCGC CACGGGAGCG
CGGTTGATCG TGACCCTCCT GCACGCCCTG GCGCGGCGTG GCCAAACGCG CGGCGTTGCG
GCGCTGTGCA TCGGCGGCGG CGAGGCCACT GCCGTGGCCA TCGAACTCCC CAAGGCCGCC
TAG
 
Protein sequence
MSAQPDPIVI VAAVRTPLGR LKGVLSSLPA HQLGAHAIAS ALDRAGLAVD RVDEVVMGCV 
LPAGQGQAPT RQAARGAGLP DAVAAATVNK VCGSGMKATM LAHDMILAGS ADFVVAGGME
SMSNAPYLLE KAHAGEYRPG YDRLFDHMML DGLEDAYERG RPMGDFGEAA AEAYGFSRQD
QDDYALRTLT RARAAIESGD FDAEIAPLEF DGRDGAVRVE HDEHPLKVSP EKIPKLKPAF
RPDGTITAAS ASANADGAAA LLLTRRSIAK REGLPILAEI RGHATHSQDP AWYTTAPAPA
IRKLLDKVGW TPQDVDLYEV NEAFAVVAMI AGRELDIPTE KLNVNGGACA LGHPIGATGA
RLIVTLLHAL ARRGQTRGVA ALCIGGGEAT AVAIELPKAA