Gene Acel_0712 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0712 
Symbol 
ID4485132 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp779201 
End bp780865 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content68% 
IMG OID639729480 
Productputative alpha-isopropylmalate/homocitrate synthase family transferase 
Protein accessionYP_872471 
Protein GI117927920 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00977] 2-isopropylmalate synthase/homocitrate synthase family protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGCG AACCCACCGG GCGGCACTCG GCGGCTGCGC CGGCCGGTTT TCAGGTCTAC 
GACACCACCC TGCGTGACGG CGCCCAGGGT GAAGGAATGG CCCTCACGGT GGCGGACAAG
CTCGCCATTG CCCGGCACCT CGACGACCTC GGTGTCGGCT TCATTGAAGG CGGCTGGCCG
GGCGCCCTGC CGAAAGACAC CGAATTCTTC CGCCGGGCGC GGACCGAATT GACCCTCCGC
AACGCGGTGT TGGTCGCGTT CGGCGCGACC CGTAAGCCCG AGAGCAGGGT GGAAGCAGAT
CCGCAGGTTC TCGCCCTCCT CGAAGCGGAG ACCCCGGCCG TCTGCCTGGT GGCCAAGAGC
CACGTGCGTC ACGTCAGCGA GGCGCTCCGG ACCTCGTTGG ACGAGAACCT CGCGATGATC
CGCGATACTG TCGCGTTCTT CCGCCGGGAA GGACGACGGG TTTTCCTCGA CGCCGAGCAC
TTCTTCGACG GGTACGCCGC TGATCCGCAG TACGCCGTCG AGTGTGTCCG GGTCGCTGCG
GAGTCCGGAG CCGAGGTCGT CGTCCTGTGT GACACCAACG GGGGAATGCT CCCGCCGCGG
ATCGCTGACG TCGTGCACGA GGTTGCCGAG CGGACCGGCG TGCCGCTGGG CATCCACTGC
CACGATGACA CCGGCTGCGC GGTCGCGAAC ACGCTCGCGG CGGTCGACGC CGGTGTCGTG
CAGGTGCAGG GTGTCGTCAA CGGGTACGGC GAGCGGTGCG GCAACGCCAA CCTGATCACT
GTGGTGGCCA ACCTCGAGCT GAAGATGGGC CGCCGGGTGC TGCCGCCCGG CCGACTGGCC
GAGCTCGGCC GGGTGAGCCA CGCGATCGCG GAGGTCGCGA ACCAGCCGCC GCGGGCCAAC
CAGCCGTACG TGGGGCTCTC GGCGTTCGCG CACAAGGCCG GCCTCCACGC CTCGGCGATC
AAGGTGTCAC CGGATCTCTA CCAGCACATC GACCCGGCCT TGGTCGGCAA CGACATGCGG
ATGCTCGTCT CGGAGATGGC CGGCCGGGCC AGTGTCGAGT TGAAGAGCCG CCAGCTCGGC
TTTGACCTGT CCGGTCAGCG TGATGCGGTG AGCCGGATCG TCGAGCGGGT GAAGAATCTC
GAAGCACGCG GGTTCATGTT CGAAGCCGCC GATGCCTCCT TCGAATTGCT GCTCCGTGAA
GAGCTTGACG GCGTCCCGAC CCGGTTCTTC GACCTGGAGT CCTGGCGGGT CATCGTTGAG
CGACGCGCGG ACGGCGAGGT GGTCTCGGAG GCCACCGTGA AGGTGGTGGT CAAGGGGGAG
CGGATCGTCG CGACCGCGGA AGGCAACGGC CCGGTGAACG CGCTGGACCG GGCGTTGCGG
CAGGCGCTGG AGCGGCTGTA CCCGCAGCTT GCCGAGCTCG AACTCGTCGA CTACAAGGTC
CGCATCCTGG ACGGCTCGCA CGGCACCGGC GCGGTGACCC GGGTGCTGAT TGAGACCAGT
GACGGCGAGA CCGAGTGGAC GACGATCGGC GTCGACGGCA ATGTGATCTC CGCTTCCTGG
CAGGCCTTGG ACGACGCGTA CATGTACGGC TTGCTGCGTC AGCACGCCGG CTCGGGCGAC
CCGGCTGCCC AGGGGGTGTC AACCGCCGGT CCGCGGACTC GATAG
 
Protein sequence
MSSEPTGRHS AAAPAGFQVY DTTLRDGAQG EGMALTVADK LAIARHLDDL GVGFIEGGWP 
GALPKDTEFF RRARTELTLR NAVLVAFGAT RKPESRVEAD PQVLALLEAE TPAVCLVAKS
HVRHVSEALR TSLDENLAMI RDTVAFFRRE GRRVFLDAEH FFDGYAADPQ YAVECVRVAA
ESGAEVVVLC DTNGGMLPPR IADVVHEVAE RTGVPLGIHC HDDTGCAVAN TLAAVDAGVV
QVQGVVNGYG ERCGNANLIT VVANLELKMG RRVLPPGRLA ELGRVSHAIA EVANQPPRAN
QPYVGLSAFA HKAGLHASAI KVSPDLYQHI DPALVGNDMR MLVSEMAGRA SVELKSRQLG
FDLSGQRDAV SRIVERVKNL EARGFMFEAA DASFELLLRE ELDGVPTRFF DLESWRVIVE
RRADGEVVSE ATVKVVVKGE RIVATAEGNG PVNALDRALR QALERLYPQL AELELVDYKV
RILDGSHGTG AVTRVLIETS DGETEWTTIG VDGNVISASW QALDDAYMYG LLRQHAGSGD
PAAQGVSTAG PRTR