Gene Mvan_5067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5067 
Symbol 
ID4644703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5435688 
End bp5436899 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content68% 
IMG OID639808537 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_955844 
Protein GI120406015 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.172446 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAAG AAGCCTTCAT CTACGAAGCG ATCCGCACCC CCCGCGGCAA GCAGAGAAAC 
GGCGCCCTGA ACGAGATCAA GCCGGTGAAC CTTGTCGTCG GCCTGATCGA CGAGATGCGC
GTCCGCTTCC CTGACCTCGA CGAGAACCTG ATCAGCGACC TCATCCTGGG TTGCGTCTCG
CCCGTCGGCG ACCAGGGCGG AGACATCGCC CGCACCGCCG GCCTGGTGGC CGGCCTGCCG
GACACCACCG GTGGCTTCCA GCTCAACCGG TTCTGCGCCT CCGGCCTGGA AGCGGTCAAC
CTGGGCGCCC AGAAGGTGCG GTCGGGCTGG GACGACCTGG TGCTCGCCGG CGGCGTGGAG
TCGATGAGCC GCGTCCCGAT GGGGTCCGAC GGCGGCGCCT GGGCCGGCGA TCCCGAGACC
AACTACCGGA TCGGGTTCGT CCCACAGGGC ATCGGCGCGG ACCTGATCGC CACCATCGAG
GGCTTCTCCC GCGAGGACGT CGACGCCTAC GCGGCGCGGT CGCAGGAGCG TGCCGCGGCA
GCCTGGGCGG GCGGCTACTT CGCCAAGTCC GTGGTGCCGG TCAAGGATCA GAACGGCCTG
GTCGTGCTGG ACCATGACGA GCACATGCGT CCCGGTTCCA CCGTGGAGAG CCTGGGCAAG
CTCAAGACCG CGTTCGACGG TATCGGCGCG ATGGGCGGCT TCGACGATGT GGCGCTGCAG
AAGTACCACT TCGTCGAGAA GATCAACCAC GTCCACACCG GCGGGAACAG CTCGGGCATC
GTCGACGGCG CCGCCCTGCT GCTCATCGGC AGCGAGGCCG CGGGCAAGTC GCAGGGGCTG
ACCCCGCGGG CGCGCATCGT CGCCACCGCG ACCAGCGGCG CCGATCCGGT CATCATGCTG
ACCGGTCCGA CCCCGGCCAC CCAGAAGGTG CTCGACCGGG CCGGGCTGAC CGTCGACGAC
ATCGACCTGT TCGAACTGAA CGAGGCCTTC GCCTCGGTGG TGCTCAAGTT CCAGAAGGAT
CTGAACATCC CGGACGAGAA GCTCAACGTC AACGGTGGCG CCATCGCGAT GGGTCACCCG
CTGGGCGCCA CCGGCGCCAT GATCACCGGA ACCATGGTCG ACGAGCTCGA GCGTCGTGGC
GCGAAGCGTG CGCTGATGAC GCTGTGTGTC GGCGGCGGCA TGGGCGTGGC CACCATCATC
GAGCGAGTCT GA
 
Protein sequence
MSEEAFIYEA IRTPRGKQRN GALNEIKPVN LVVGLIDEMR VRFPDLDENL ISDLILGCVS 
PVGDQGGDIA RTAGLVAGLP DTTGGFQLNR FCASGLEAVN LGAQKVRSGW DDLVLAGGVE
SMSRVPMGSD GGAWAGDPET NYRIGFVPQG IGADLIATIE GFSREDVDAY AARSQERAAA
AWAGGYFAKS VVPVKDQNGL VVLDHDEHMR PGSTVESLGK LKTAFDGIGA MGGFDDVALQ
KYHFVEKINH VHTGGNSSGI VDGAALLLIG SEAAGKSQGL TPRARIVATA TSGADPVIML
TGPTPATQKV LDRAGLTVDD IDLFELNEAF ASVVLKFQKD LNIPDEKLNV NGGAIAMGHP
LGATGAMITG TMVDELERRG AKRALMTLCV GGGMGVATII ERV