Gene Mflv_2040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_2040 
Symbol 
ID4973362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp2118059 
End bp2119276 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content69% 
IMG OID640456249 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_001133306 
Protein GI145222628 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.239863 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.333816 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGAAG CCGTCATCGT CGCCACCGCA CGTTCACCCA TCGGCAGGGC CAACAAGGGC 
TCGCTGGTGG CGATGCGTCC CGACGATCTC GCCGCCCAGA TGGTCCGCGC GGCGCTGGAC
AAGGTGCCGT CGCTGGACCC GCGCGACATC GACGACCTGA TGATGGGCTG CGCGCAGCCC
GCCGGTGAGG CCGGCTACAA CATCGCGCGG GCCGTCGCCG TCGAGCTCGG CTACGACTTC
CTGCCCGGCA CGACCGTGAA CCGGTACTGC TCGTCGTCGC TGCAGACCAC GCGGATGGCG
TTCCACGCGA TCAAGGCCGG CGAGGGCCAC GCGTTCATCT CGGCCGGCGT CGAGACGGTG
TCGCGGTTCG GCAAGGGTGC GGCCGACGGC GCGCCGGACT CGAAGAACCC GATCTTCGCC
GACGCGCAGG AGCGGTCGGC CAAGGCCGCC GAGGGTGCCG AGGAATGGCA CGATCCGCGC
GAGGACGGGC TGCTGCCCGA CGTGTACATC GCGATGGGTC AGACCGCCGA GAACGTCGCG
GCGTTCACCG GAATCAGCCG TGAGGACCAG GACCACTGGG GCGTGCGTTC GCAGAACCGC
GCGGAAGAGG CGATCAACAG CGGCTTCTTC GACCGCGAGA TCGTGCCGGT GACGCTGCCG
GACGGAACCG TGGTGTCCAA GGACGACGGG CCGCGCGCGG GTACCAGCTA CGACAAGATC
AGCCAGCTCA AGCCGGTGTT CCGGCCCAAC GGCACGATCA CCGCCGGCAA TGCGTGCCCG
CTCAACGACG GCGCCGCCGC GGTCGTCATC ATGAGCGACA CCAAGGCCAA GGAACTCGGC
CTGACCCCGC TGGCGCGCAT CGTCTCCACG GGTGTGTCGG GTCTGTCGCC CGAGATCATG
GGCCTGGGCC CCATCGAAGC CGTCAAGAAG GCGCTCGCCA ACGCGAAGAT GAGCATCTCC
GACATCGACC TCTACGAGAT CAACGAGGCG TTCGCGGTGC AGGTGCTCGG CTCGGCCCGT
GAGCTCGGCA TGGACGAGGA CAAGCTCAAC GTGTCCGGCG GCGCGATCGC GCTGGGTCAC
CCGTTCGGCA TGACCGGCGC CCGCATCACC GCGACGCTGC TCAACAACCT CGCCACCCAT
GACAAGACGT TCGGCATCGA GTCGATGTGC GTCGGCGGCG GGCAGGGCAT GGCGATGGTC
GTGGAGCGGC TCTCCTAG
 
Protein sequence
MPEAVIVATA RSPIGRANKG SLVAMRPDDL AAQMVRAALD KVPSLDPRDI DDLMMGCAQP 
AGEAGYNIAR AVAVELGYDF LPGTTVNRYC SSSLQTTRMA FHAIKAGEGH AFISAGVETV
SRFGKGAADG APDSKNPIFA DAQERSAKAA EGAEEWHDPR EDGLLPDVYI AMGQTAENVA
AFTGISREDQ DHWGVRSQNR AEEAINSGFF DREIVPVTLP DGTVVSKDDG PRAGTSYDKI
SQLKPVFRPN GTITAGNACP LNDGAAAVVI MSDTKAKELG LTPLARIVST GVSGLSPEIM
GLGPIEAVKK ALANAKMSIS DIDLYEINEA FAVQVLGSAR ELGMDEDKLN VSGGAIALGH
PFGMTGARIT ATLLNNLATH DKTFGIESMC VGGGQGMAMV VERLS