Gene Mjls_4069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_4069 
Symbol 
ID4879777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp4300681 
End bp4301799 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content72% 
IMG OID640141380 
Productputative thiolase 
Protein accessionYP_001072334 
Protein GI126436643 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.655327 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.671191 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCCGA TGTCCTTGCG CCCCGGCAAG ACACCCGTGG AGCTGGCCGC TCAGGCCAGC 
GGCGCCGCGC TGGCCGACGC CGGCATTGCG CGCAGCGACG TGGACGGCCT GCTGGTCGGC
TCCTCGCAGG GCGTGCGGCC GGATCGGCTC GGTGTCGGTT TCGCCGCCCA GGCGGGCTTC
GCCGACCTAC GCCTGCTCGA ACACGTCGAG ATCAAGGGCG CCACCACGAT TGCCATGATC
CAGCGCGCGC GCCACGCGAT CGCCACCGGC GAGGCCTCGA CCGTGCTGTG CGTATTCGCC
GATGCCCCGC TGGTGGCCGG ACGGGGTGCC GGATCGACCT ACGCGCAGAG CGGCGGCAAC
AACGGAACGC GCGGCCTGGA GCGGGCCTCC GGCCTGCTCG GCTCGGTGCC GACCTACGCG
CTGCTGGCCC AGCGGTGGCT GCACGTCACC GGAACCGGTG CCGAGGCGCT GCGTTCGGTG
GCCACCACGG CGCGACGCTG GGCGCAGGAC AATCCCCATG CGGTCAACCG TGAACCGCTC
GACGACGACG GCTACCAGCG AAGTCCGATG ATCGCCGAAC CGCTGCGGCT GCTGGACTGC
GCAAGACCGG TCAACGGCGC TGTCGCGGTG GTGCTCACCG GTCGAGCTTC GGTCGGCACC
ACGCGCGTTC GCGTGCGCGG TGCCGGGAGG GACCATCCGG TGCGTCGCCG CCGGGCAGGC
GCCGAGTCGT GGTTCGGTGG CGGCGGCCGG GCGGTGGAGG ACGCGCTCGA CCAGGCCGGC
ATGTCCCGAT CGGACCTCGA TGTTGCTGAG CTCTACGACC CGTTCTCGAT CGTCACCCTG
GTGCTGCTCG ACGAATACCG TCTCACCGGC GGCGTACCCG CAGGCGCCTT CGTCCGCGAC
GGCCACACCG GCCCGGGCGG CACGCTGCCC ACCAACACCG GTGGTGGTCA GCTCTCCGGC
TTCTACCTGC AGGGCATGAC GCCGCTCGCC GAGGCCGTGA TCCAGTTACG CGGCGCCGGT
GGGCAGCGCC AAGTCCCCGA TGCCGCCGTG GCCCTGGTCG GCGGCATCGG TGGCCGGCTG
GACCACCACG CCGCACTGGT TCTGGAGCGG GCGGCATGA
 
Protein sequence
MTPMSLRPGK TPVELAAQAS GAALADAGIA RSDVDGLLVG SSQGVRPDRL GVGFAAQAGF 
ADLRLLEHVE IKGATTIAMI QRARHAIATG EASTVLCVFA DAPLVAGRGA GSTYAQSGGN
NGTRGLERAS GLLGSVPTYA LLAQRWLHVT GTGAEALRSV ATTARRWAQD NPHAVNREPL
DDDGYQRSPM IAEPLRLLDC ARPVNGAVAV VLTGRASVGT TRVRVRGAGR DHPVRRRRAG
AESWFGGGGR AVEDALDQAG MSRSDLDVAE LYDPFSIVTL VLLDEYRLTG GVPAGAFVRD
GHTGPGGTLP TNTGGGQLSG FYLQGMTPLA EAVIQLRGAG GQRQVPDAAV ALVGGIGGRL
DHHAALVLER AA