Gene Acel_2138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_2138 
Symbol 
ID4485606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp2418117 
End bp2419196 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content62% 
IMG OID639730940 
Productmyo-inositol-1-phosphate synthase 
Protein accessionYP_873896 
Protein GI117929345 
COG category[I] Lipid transport and metabolism 
COG ID[COG1260] Myo-inositol-1-phosphate synthase 
TIGRFAM ID[TIGR03450] inositol 1-phosphate synthase, Actinobacterial type 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTTCAA TTCGAGTAGC CATCGTTGGC GTCGGCAACT GCGCCGCCTC GTTGGTTCAG 
GGCGTCGAGT ATTACCGGAA CGCAAGCCCT GACGAGCGTG TGCCGGGGTT GATGCACGTG
CAGTTCGGCC CTTACCACGT ACGGGACGTC GAATTCGTCG CCGCCTTCGA CGTTGACGCG
AAGAAGGTCG GCCTCGATCT CGCGGACGCC ATCGGCGCCA GCGAGAACAA CACCATAAAG
ATTTGCGACG TGCCCCGGAC CGGCGTCATC GTGCAGCGCG GTCACACCTA CGACGGGCTC
GGCGAATACT ACCGGGAGCG CATTCAAGAG TCCGATGAGC CGCCGGTGGA CGTGGTCAAG
GTCCTGCGTG ACACCCAGGC GGATGTCCTC GTCTCCTACC TTCCGGTGGG TTCTGAGGTT
GCGGACCGCT TCTACGCGCA GTGCGCTCTC GACGCCGGTG TGGCGTTCGT CAACGCCCTG
CCGGTTTTCA TCGCCAGCGA TCCGGTCTGG GCGCAGAAAT TCACCGACGC CGGCGTACCC
ATCATCGGTG ACGACATCAA GTCGCAGGTC GGCGCCACCA TCACCCATCG CGTCCTCGCC
AAGCTCTTCG AGGACCGCGG CGTCGAACTG CTCCGCACGT ACCAGCTGAA CTTCGGCGGG
AACATGGATT TCATGAACAT GCTGGAGCGG CAGCGGCTCC AATCGAAGAA AATTTCCAAG
ACGCAGTCGG TCACCTCGCA AATCCCTCAC GAGATGGAGC GTGCGGCCGT GCACATCGGC
CCGAGCGACT ACGTGCCGTG GCTGGATGAC CGCAAATGGG CGTATGTCCG CTTGGAGGGA
CGAGCCTTCG GCGACGTACC GCTCAATTTG GAATACAAGC TCGAGGTGTG GGATTCACCG
AATTCCGCCG GTGTCATCAT CGACGCCATT CGAGCCGCGA AAATCGGCAA GGACCGCGGC
ATCGGCGGAC CGCTGCTCTC CGCGGCCAGT TACTTCATGA AGAGCCCCCC GGTGCAGTAC
AGCGATGACG AGGCCCGCCG GCTCGTGGAG GACTTCATCG CCGGAAAAGT CGAACGCTGA
 
Protein sequence
MGSIRVAIVG VGNCAASLVQ GVEYYRNASP DERVPGLMHV QFGPYHVRDV EFVAAFDVDA 
KKVGLDLADA IGASENNTIK ICDVPRTGVI VQRGHTYDGL GEYYRERIQE SDEPPVDVVK
VLRDTQADVL VSYLPVGSEV ADRFYAQCAL DAGVAFVNAL PVFIASDPVW AQKFTDAGVP
IIGDDIKSQV GATITHRVLA KLFEDRGVEL LRTYQLNFGG NMDFMNMLER QRLQSKKISK
TQSVTSQIPH EMERAAVHIG PSDYVPWLDD RKWAYVRLEG RAFGDVPLNL EYKLEVWDSP
NSAGVIIDAI RAAKIGKDRG IGGPLLSAAS YFMKSPPVQY SDDEARRLVE DFIAGKVER