Gene Ccel_3433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_3433 
Symbol 
ID7311995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3995749 
End bp3997311 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content39% 
IMG OID643610342 
Productputative alpha-isopropylmalate/homocitrate synthase family transferase 
Protein accessionYP_002507701 
Protein GI220930792 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00977] 2-isopropylmalate synthase/homocitrate synthase family protein 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAATA ACAAAATAAT TATGTATGAT TCAACTTTGA GAGATGGGGC TCAGGCACTG 
GGAATATCAT TTACAGTTGA GGATAAACTC AAAATTGTTT CAAAGCTTGA TGAACTGGGA
ATTGATTACA TCGAGGCGGG AAACCCCGGT TCCAATCCGA AAGACCTAGA GTTTTTTGAA
AAAGTATTAA AATTAAAGTT GAGGACTTCT AAAATAATTG CCTTTGGTTC AACCAGAAGG
GCAAATATAA AAGTAGAAGA TGATGCGAAC ATACAGTCTC TTCTAAGTGC AGGTACCGAA
GCAATAGCTG TGTTCGGAAA GGCTTGGGAT TTCCAGGTTA CGGATATATT AAAAACTACT
CTGGATGAAA ATTTTGATAT GATATACGAT ACCGTCAAAT TTTTAAAGGA ACAAGGAAAA
ACTGTTGTTT ACGATGCAGA ACATTTTTTT GACGGTTACG ATGCAAATCC CGAATATGCT
ATTGAGACAT TAAAGAGGGC ATATAATGCA GGAGCGGATA CCATATGCCT CTGTGATACA
AAGGGAGCAG GTTTGCCGTC CTATATAGCG AGAGTAACAA AGAAAGTCAG AGAAGAGGTG
GATTGTGCAA TCGGAATCCA CTGTCATAAT GATAATGGGA TGGCTGTGGC GGGGTCTATT
TCCGCAGTTG AGGCTGGAGC AGTACAAATA CAGGGAACTA TAAACGGATT TGGCGAAAGA
TGCGGAAATG CTAACCTTTG TACAATTATA CCCAACCTTC AGCTTAGGAT GGGATATCAG
TGCATACCAA GTGATAATAT GGCCTATCTG ACTCCTACAG CAAGGTTTGT AAGTGAGGTT
GCAAACATAA TACATGACGA AAGAGCTCCT TTTGTTGGAA GCTGTGCATT TGCACACAAG
GCAGGAATGC ATACTGATGC GGTAAATAAG AATTCTTTTG CATATGAAAT GGTAAATCCG
GAGGTTGTTG GAAATCAAAG AATTATTCTT ATGTCAGAAG TAGCAGGCAG AAGTGCTGTT
ATGAACATTA TAAACAAGGT AGACAGTACT ATAACCAAGG ATTCCCCAGA AACCAAAAAG
ATAATTGAAA GGTTGAAGGA ACTTGAGTAT GAGGGCTACC AGTATGAAGG TGCGGAAAGC
TCATTTGAGC TTGTTATCCG AAAGATGCTT GGCAAGTACA AATCATACTT TGAATTAAAG
GAATTTAAAG TTATAGTAAA TGAACCGACT ATAAATAGTG TAAATTCATC TGCAATGATA
AAAATTGTTG TAGGTGAGCA GACAGAAATC ACGGCGGCGG AAGGAGAAGG GCCTGTAAAT
GCCCTTGATA AGGCCTTAAG AAAAGCTCTG GAGAGGTTTT ATCCCCAAAT AGCTGAAATG
AAGCTTACCG ATTACAAGGT TAGGGTTCTT GATTCAAATT TTGCAACCGC ATCAAAGGTT
AGGGTTTTGA TAGAAAGCAC AGACGGACAG GAAGTTTGGA CAACGGTAGG GGTATCTACC
GACATTATTG AGGCTAGTTG GCGTGCCTTA GTTGATTCTG TAGAGCATAA ACTTATGAAA
TAG
 
Protein sequence
MSNNKIIMYD STLRDGAQAL GISFTVEDKL KIVSKLDELG IDYIEAGNPG SNPKDLEFFE 
KVLKLKLRTS KIIAFGSTRR ANIKVEDDAN IQSLLSAGTE AIAVFGKAWD FQVTDILKTT
LDENFDMIYD TVKFLKEQGK TVVYDAEHFF DGYDANPEYA IETLKRAYNA GADTICLCDT
KGAGLPSYIA RVTKKVREEV DCAIGIHCHN DNGMAVAGSI SAVEAGAVQI QGTINGFGER
CGNANLCTII PNLQLRMGYQ CIPSDNMAYL TPTARFVSEV ANIIHDERAP FVGSCAFAHK
AGMHTDAVNK NSFAYEMVNP EVVGNQRIIL MSEVAGRSAV MNIINKVDST ITKDSPETKK
IIERLKELEY EGYQYEGAES SFELVIRKML GKYKSYFELK EFKVIVNEPT INSVNSSAMI
KIVVGEQTEI TAAEGEGPVN ALDKALRKAL ERFYPQIAEM KLTDYKVRVL DSNFATASKV
RVLIESTDGQ EVWTTVGVST DIIEASWRAL VDSVEHKLMK