Gene Ccel_0138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0138 
Symbol 
ID7309049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp153543 
End bp154718 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content41% 
IMG OID643607067 
Productmalic protein NAD-binding 
Protein accessionYP_002504506 
Protein GI220927597 
COG category[C] Energy production and conversion 
COG ID[COG0281] Malic enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000809828 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTAATA TTTATGAGGA TTCACTAAAA GCCCACGAAG AGTGGCAGGG AAAGATAGAA 
GTTGTATGTA AGGCTCCTTT AAAGGACAAA AGAGATCTTT CTTTAGCTTA TTCACCTGGA
GTAGCTCAAC CATGTCTCGA AATTCAAAAA GACGTTGAGA ACTCTTACAA ATACACCAGA
AGACACAATC TTGTAGCAGT TGTCACTGAC GGTACTGCTG TACTTGGTTT AGGAGATATC
GGACCCGAAG CAGGTATGCC TGTTATGGAA GGTAAATGCT GTTTGTTCAA GACTTTTGGT
GATGTTGATG CATTCCCTCT CTGTATAAAG TCAAAAGATG TTGACACAAT TGTTAACACA
ATTAAATTGC TTTCAGGAAG CTTTGGCGGC GTAAATCTTG AAGATATAGC TGCTCCAAGA
TGTTTTGAAA TCGAAGAAAG ATTAAAGAAG GAAACTGATA TTCCTATATT CCACGATGAC
CAGCATGGAA CAGCTATTGT TACAGCTGCA GGTCTTATCA ATGCTTTAAA GGTTGTTGGT
AAGAAGATGG AGGATATCTC AATAGTTGTA AACGGTGCTG GTGCTGCTGC TATAGCTATT
ACAAAACTTC TTTTCTCCAT GGGGCTCAGA AAGGTTGTTC TTTGTGACAC AAAGGGTGCT
ATCTACGAGG GTAGAGACAA CCTTAATCCG ATAAAGGCTG AAATGGCTAA AATCACTAAT
CTTGAAGGCA AAAAAGGTTT ATTGAAGGAT GTTATTGTTG GTGCAGACGT ATTTATCGGA
GTTTCAGCGG CTAACCTAGT AACAAAAGAA ATGGTTAAGT CAATGGCTAA AGACCCGATT
ATCTTTGCTC AAGCAAACCC AACTCCTGAA ATTCTGCCTG AAGATGCTCT CGAAGCTGGT
GCTGCTGTTG TTGGGACAGG CCGTTCAGAC TATCCAAATC AGGTTAACAA TGTTCTTGCA
TTCCCTGGTA TATTCAGAGG AACTTTCGAT GTAGGAGCAC GTGAAATAAA TGATGAAATG
AAGATAGCTG CTGCATACGC AATCGCAGGA CTTGTTAGCG ATGAAGAAAG AAATGCAGAG
TATGTAATTC CGGCTCCATT CGATCCTAGA GTAGCAAAAG CAGTTGCAGA AGGTGTTGCT
GAAGCAGCTA GAAAATCAGG TGTAGCTAGA AGGTAA
 
Protein sequence
MGNIYEDSLK AHEEWQGKIE VVCKAPLKDK RDLSLAYSPG VAQPCLEIQK DVENSYKYTR 
RHNLVAVVTD GTAVLGLGDI GPEAGMPVME GKCCLFKTFG DVDAFPLCIK SKDVDTIVNT
IKLLSGSFGG VNLEDIAAPR CFEIEERLKK ETDIPIFHDD QHGTAIVTAA GLINALKVVG
KKMEDISIVV NGAGAAAIAI TKLLFSMGLR KVVLCDTKGA IYEGRDNLNP IKAEMAKITN
LEGKKGLLKD VIVGADVFIG VSAANLVTKE MVKSMAKDPI IFAQANPTPE ILPEDALEAG
AAVVGTGRSD YPNQVNNVLA FPGIFRGTFD VGAREINDEM KIAAAYAIAG LVSDEERNAE
YVIPAPFDPR VAKAVAEGVA EAARKSGVAR R