Gene Ccel_1622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1622 
Symbol 
ID7310375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1957908 
End bp1959164 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content35% 
IMG OID643608551 
Producthomoaconitate hydratase family protein 
Protein accessionYP_002505954 
Protein GI220929045 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR01343] homoaconitate hydratase family protein
[TIGR02086] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCAGA CATTGGTAGA AAAAATATTT TCAAACAAAA TAGGAAAAGC TGTTTACGCA 
AATGGCACAG TTTTTTCTCC TATAGACCTT GCAATGGGTA CTGATGCAAC TATACCGATT
ACTATTGAGA CATTTAATGA GTTTGGTCTT GAAAATATTG TTAATCCAGA CAAAATTGTT
CTTGTAAATG ATCACTTTGT TCCGGCAAAG GATATTGCGA CTGCTAAATT TGCACTTATG
ATGAAAGAGT TTGCTAAGAA ACAGAATATA AAGAACTTTT TTGAAATTGG GAGAAGCGGT
ATTTGTCATG TACTTTTGCC CGAAAAAGGT TTTATCAAAC CTTATGATAT TGTTGTCGGT
GTAGATTCTC ATACGTGTAC ATATGGTGGA CTAAGTGCAT TTTCAACTGG TGTTGGTTCT
ACAGATATGG CGTGTGTATG GGCAACAGGA AAACTATGGT TTAGGGTTCC TGAAACTACA
AAGATAATAT TTTCTGGTGA ACTACCTAAA GGAGTTTACG CAAAAGACTT AGCCTTGTAT
TTAATTGGTC AATTAGGCAT TAATGGAGCC AACTATGATA TGCTGGAATT TGATGGCGAA
TTAATAAAAA ATCTTGATAT CTCAAGCAGA TTAACACTTT GCAATATGGT TATAGAAATG
GGTGCCAAAG CCGGAATTAT AAATGCAGAT GCTGTTACAC AAAAGTATTT CAGAGAGAAA
AATCTTAAGA CGGAAGATAT AGATTTTAAT TCTGATAATG GTGCCAGATA TAAGAAAATT
ATCGAAATTG ATGTTTCAAA AATTGAACCA GTCATTGCAT GTCCATATAG TCCTGGTAAT
ATAAAAACTG CAAAAGAGCT GGTGGATTTA AAAATAGACC AAGTAGTAAT CGGTTCTTGT
ACAAATGGAA GAATAGAAGA TTTCAGAGTA GCTCATAAAT ATTTGAAAGA TAATGAGGTT
CACCATGAGG TTAAATTAAT TGTCATACCT GGTTCACAAG AAGTGTTGAA ACAAATGGAG
GAAGAAAGTA TATTGATAGA TTTTATTAAA TGCGGAGCAC TAATATCACC TCCCACCTGC
GGACCTTGTA TGGGAGGACA TATGGGGGTA CTTGCAAATA ATGAAATAGG ATTGTTTACA
ACTAATAGAA ACTTTCTCGG TAGAAATGGT GACTCCTCAG CACAAGTCTA TTTATGTAAT
CCTGCGATAG CTGCATACTC AGCAACCAAA GGTAGCATAC AGATACCAGA ATTCTAA
 
Protein sequence
MKQTLVEKIF SNKIGKAVYA NGTVFSPIDL AMGTDATIPI TIETFNEFGL ENIVNPDKIV 
LVNDHFVPAK DIATAKFALM MKEFAKKQNI KNFFEIGRSG ICHVLLPEKG FIKPYDIVVG
VDSHTCTYGG LSAFSTGVGS TDMACVWATG KLWFRVPETT KIIFSGELPK GVYAKDLALY
LIGQLGINGA NYDMLEFDGE LIKNLDISSR LTLCNMVIEM GAKAGIINAD AVTQKYFREK
NLKTEDIDFN SDNGARYKKI IEIDVSKIEP VIACPYSPGN IKTAKELVDL KIDQVVIGSC
TNGRIEDFRV AHKYLKDNEV HHEVKLIVIP GSQEVLKQME EESILIDFIK CGALISPPTC
GPCMGGHMGV LANNEIGLFT TNRNFLGRNG DSSAQVYLCN PAIAAYSATK GSIQIPEF