Gene Cthe_3131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3131 
Symbol 
ID4809694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3700138 
End bp3701835 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content38% 
IMG OID640108564 
Productvon Willebrand factor, type A 
Protein accessionYP_001039519 
Protein GI125975609 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1240] Mg-chelatase subunit ChlD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGGAAA GCAAAAAGAA TTTAAAAGTT ATAGTAATTT TATTGGGAAT AGCTTTAGTT 
GTGTTCGGGC TTATTTATGG TGGCATTTCT TTGACGCAAA ACTTTGGAAA AAGTCAAAAA
GTAATTTCAA TGGAAAGTGC CGAAAAAAAA CTGGATAAAC TTTATAAAGA TATAACTGTA
AATATTATTG AGCCAAAAAA AGGGCAGGTG GACATTGATC CTCCGGACCT TAAAGAGTCT
TTGCCTGATA TTTCCAAATA TCCGCCTCAG GTGGAAGAAA CTACAGGCAC TTTTATAGAA
ATCTTTTCTT CCACTGAAAA GTCAGGAGAG AAAAAAGACG GATGGCTTCT TGATGTGGCA
AGGGAATTTA ACAGAGCCAA CATACAGGTA AACGGTAAAC CTGTTTCTGT CAGAATCAGA
GGTATTGCCT CGGGTTTGGC GACAGACTAC ATAATTTCCG GTAAATATCT TCCTGATGCC
TTTACACCTT CCAACGAGCT TTGGGGGGAA ATGATCAGGG CAAGCGGGGT GGATATATCC
CTTGTTGAAA AAAGACTTGC CGGTAATGTT GCAGGAGTGC TTCTTTCAAA AGCAAAGTAT
GAGGAATTGC TTCAGAAATA TGGTTCAATA AATTTAAAAA ATATTACTGA GGCGGTTGCG
GCAAACGAAA TAGCAATGGG TTATACCAAT CCTTTTGCAA GTTCAACGGG AATGAACTTT
CTTGTTTCAA CATTGAGTAC TTTTGACAGT AAAAATATAT TGAGCGAAAA AGCTATAGAA
GGTTTTGAAA AATTTCAGAC CAATATTCCT TTTGTGGCTT ATACTACTTT ACAGATGAGG
GAGTCTGCAA AATCCGGCGT TCTCGACGGC TTTATACTGG AGTACCAGAC CTATGAAAAT
ACTCCTGAAC TGAAAAAGGA CTATGTTTTC ACTCCTTTTG GTGTAAGACA TGACAGTCCG
ATGTATGCCA TCGGAAATCT AACTCAGGAG AAAAAAGAAA TACTCAATAA ATTCGTTGAG
TTTTGCAAAA GCAGCAAATC ACAGGAGCTT GCAACAGAAT ACGGTTTCAA CAGGCTTGAC
GATTATTTGC CTGAAATATC GAATTTTGAC GGAGAGGCTA TAATGAAAGC CCAGAAGCTT
TGGAAAGAAA AGAAGGATGT TAACAATGAC ATTGTAGCCG TTTTTGTTGC CGATGTGTCG
GGAAGTATGG CAGGTGAACC GCTCAACAGA TTGAAGCAAT CTCTTATAAA TGGTTCTAAA
TATATAAGTT CAGATGTTTC CATCGGGTTG GTGTCTTATT CCACGGATGT GAATATAAAT
CTTCCGATTG CCAAATTTGA CTTAAACCAA AGGTCTTTGT TTGTAGGTGC GGTTGAAAGC
CTGGCTGCGG GCGGCAATAC AGCAACGTTT GACGCGATAA TTGTGGCAAC GAAAATGCTT
AAGGAAGAAA AAGCAAAGAA TCCTAATGCC AAATTGATGC TGTTTGTGTT AAGTGACGGT
GTGACAAATT ACGGCCACTC GCTAAACGAT ATTAAAGATA TGATGAAGAC TTTCGGAATT
CCAATTTATA CTATAGGATA TAACGCAAAT ATAAAGGCAT TGGAGACTTT ATCACAAATA
AACGAAGCGG CAAATATAAA TGCTGATACG GAAGATGTTG TATATCAGTT GGGAAGTTTG
TTCAACGCCC AGATGTAA
 
Protein sequence
MPESKKNLKV IVILLGIALV VFGLIYGGIS LTQNFGKSQK VISMESAEKK LDKLYKDITV 
NIIEPKKGQV DIDPPDLKES LPDISKYPPQ VEETTGTFIE IFSSTEKSGE KKDGWLLDVA
REFNRANIQV NGKPVSVRIR GIASGLATDY IISGKYLPDA FTPSNELWGE MIRASGVDIS
LVEKRLAGNV AGVLLSKAKY EELLQKYGSI NLKNITEAVA ANEIAMGYTN PFASSTGMNF
LVSTLSTFDS KNILSEKAIE GFEKFQTNIP FVAYTTLQMR ESAKSGVLDG FILEYQTYEN
TPELKKDYVF TPFGVRHDSP MYAIGNLTQE KKEILNKFVE FCKSSKSQEL ATEYGFNRLD
DYLPEISNFD GEAIMKAQKL WKEKKDVNND IVAVFVADVS GSMAGEPLNR LKQSLINGSK
YISSDVSIGL VSYSTDVNIN LPIAKFDLNQ RSLFVGAVES LAAGGNTATF DAIIVATKML
KEEKAKNPNA KLMLFVLSDG VTNYGHSLND IKDMMKTFGI PIYTIGYNAN IKALETLSQI
NEAANINADT EDVVYQLGSL FNAQM