Gene Cthe_3123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3123 
Symbol 
ID4809686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3686834 
End bp3690736 
Gene Length3903 bp 
Protein Length1300 aa 
Translation table11 
GC content37% 
IMG OID640108556 
Productvon Willebrand factor, type A 
Protein accessionYP_001039511 
Protein GI125975601 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATTTTTT ATTATGATGA GGAAAATGAC ACTATTAAGT TTTTAGAAAC GGAAGTGGAT 
GAGGAAACAA ATACGATAAA AACAACGGTT GATCATTTTA GTATTTATGG TGTTATAGAT
ATAGTAAGGT TTGCACAGAG TTGGGGTATA AAAGACATTT TGGATAAACT TTTAAACCCT
GGTGGAGAAA CCATACCGCC TGTTGCTGAA ATAGGACAGG CAGATATAGT TTTTGTTATT
GATACGACCG GTTCGATGGG CTCAGTAATT AATAATGTGA AAAACAATAT TACAAACTTT
GCCAACACAT TAATGGAAAA TAATGTAGAT GTGAGATTGG GGTTAATAGA TTACAAGGAT
TTAGAAGAAG ATGGCATGGA TTCTACTAAA AACCTTGGAT GGTTTGACAA TGTAAGTGAT
TTTATTGCAA GTGTAAACAA TATGAGGGCA ACCGGAGGAG GAGATGCTCC TGAATCTACC
GTGGATGCAT TGGAAGAAGC AAGAAGAATG GATTTCAGGC CCGGTGTAAA TAAATTTATT
ATGCTTTTTA CTGACGTGTC ATATAAAGAA TCTACAAGAT TTGAAGATGT TCAGTCAATG
AAGACAGTGA TAGAAAAGTT AAAAGAAGAT AAAATTGTCG TATCAGCAAT TGTACCTTCC
GGTTATGAGT CATTGTATAG GAATTTGTAT ACTGAAACAG GCGGTGTTTA CGCAAATATA
ACACAAGCTT TTTCTTCCGC GCTCCAATCG CTTATTAGCA ACATAGCCAG TGTAACCCAT
GGTGACGGTG TCTGGATAAG GTTATCCAAC GGAATGTTAA AAAAACTGGA AGGAGTTTCG
AGCAAAGAAG AATTGCTTGA GATGGCATCG GGAGGATACC TTTCTGTCGG CGATTTGTTT
TTTGAAGAAG TTACGGTAGA TTTTTTCGGA TTGAAAGTAA AAACTTTCAG GGAAGCTTTT
AACAAAAATG CCGTGGATGT CAGAATAACT CCCACAAGTG GATATTCAGG GGGTATGTAT
ACAGTTGATG CGGTAGCTAC AATTGTAACA AAACAGCAGG TGGATAAAGC GGTATTGTTA
TTCAGGCTGT TAAACGGAAA CTGGATGCAA TATAATGCCG GTCTTGATTC GAAATTTATT
ATGAGTAAGT ATAATTTTAA ATTGCCTGAA GAATATCCGG AACGCTTGAA TGATTTGTTT
ATTGGATACA GAACCAGTTT AAATATTCTG GATTCGGGAT TAAGGGAGTT TAAAGTTGTT
TTGTTTGACA AAGAAGGCAG GATAATTGTT GAGTCAGCAG TAAATAAAGT AAAGGTGGAA
AGTTCAATCA AGTGGACAGG GGATTTATCA CCGCTGGAAA CAGTAATCGG CACCAAAAAC
TTTTCCATAG CTCAAAATGC CAATCTTCTG GCCGGTGAAA AAATTGCTTT GTTTTTACGT
AAAGATGGCG AACAGTTTCC GGTAATGGTT GATTCGTCCG AACAAAAATA CAAACGCGAG
CAAAATAACT ATAATTTCAG TTTCGTATCA AATGATTGGG ATGCTGTAAA TAAGAGGCCT
AAATATACTA ATGGTGTATA TTGGATATCT GTACATATAA CGGATGGGAG TGGCAAATCT
CTTATTAAGT CTGATGAGAG AAAGATAAAA ATAAATAACA TTTGGATTTC TGAATACTAC
AATAGCGGAC AAAACAAGGA TATTTATGAT ATTAAAAAGA GATTGAATGA ATTGGGATAT
TTTGGCCAGA ATAATTTGCA GTTGACTGTC AATACAGTAT TTGATTATAA TACTTCTTTT
GCTGTCAGTG AATTTAAGAG AGTTAATCGT ATAAATGACA ACGGAATATT TAAAGGTGTT
GTATGTGACC AAACATGGAA TGTACTGTTT TCAGATTCTG CATTAAGGAA TGATAAGTCG
CCTTCTGCAG GTTCCGGTGA ATGGAATTTC CAGAGCCTGA AAGAAAGACT GAGATTTTTG
TCCGCGGCAG AATTGGAAAA GGAAATTTCA GATGCCAGCA AGGAAAAAAG GTCCTTGTCT
CCAAGTGCTG AGCTGACTAA AAGATTGAAT CTCATTAATT CAAGAATAGC GAATGACTGG
GTATCCAATG GCGTAACCAA AACAAGATTT GCCAAAAGGT ATCTTGACCA GTGTGCAAAA
AACGGCATAA CACCACCTGC CGAGTATAAC ACTGAGGCTA AGGTTATTAA AGCATTGACA
TGGAATGATG AGAAAATTGA GAAGCTGTAT AGAAATTCAT TGCAATTCCA AAATCTGGAT
ATTGATCCAA GGCTGCTTCT GGCGATAATA ATTCAGGAGG GTACAGGAAG TTTTAATACA
AATCCGGAGG TAAAGGACAG TAATGGCGGA CATTATATCC AGCCGGATTT TGAAAAAGAT
TTGAAAGCTG CATTGGACAA TCAATTTTTG AGAAAAGCAA ATGCATATAG GTACTATGGG
GAGCAGTTCT CTGAATTTGT AAGTGGTTTA AAGGTTTCAC AGCCTAATTT AACCAGCGGA
AAAGGTAATT TGTATCAATT CCTAAATTAT GCTACAATGG CGGCAACGGT TGATCTCAAT
ACTAATGAAA TAATTGAGCT TAAACCCCAT GGAGTATATG CAACTCATGA TGGCTGGTGG
AAAAATGTGG AAGCTGTTTT CAATTCATTG GTGGACAACA ATGGAGAGAA ACCTGCTGAA
AAGTATTCGA ATTTGTTTAA GAGCTCAAAA AAGGTACCTT TAAAAAGTGG CTATTCAAAA
CCGACAGTTG TGTTTAAATT GGAGTGGATA AATGAAGTAA TATACAATAG CAAGACCGAG
AAATATGACC CGGGTTATAC AATTAAAGCG ACACTGGCTT CACAGACTAC ACCGGAGGAA
CCAAAGCCGA CGGAGAATGT ATTCCCTCTG AAATATGGTG ATACCAGTGC CGTAAAAGGT
AGCTACATCA AAGAAGTTCA TGATTTACTT AGCAAAAACC TTGGAAGCGG TCTGGAATAT
CTTAAAATTG CAGATGGTGA TGCGGGTTAT GGAAAAAACT ATGGACCAAA AACTACAGCT
TTGGTGAAAT TATATCAGAG TCAAAACGGT GCCGCAGTTA ACGGGCAAAT AGATGAAGCT
ACTCTGAATA AGCTGAGGAA AGGTGAATGG AAAGTTGTAG CACCCAGTGA AGGAAAATAT
GTAATAAAGG GTTCTTATCA GGATTCTTAC TTTGAACTTG TTGTTGATAA AGCTCCTGAT
ATTTCTGATA CTGGAGACTT TAGCTTAAAG CTGAAAAGAC TGGTGGAAGA ATATACCGAT
TTTAATATTA ACGGGGTTAA AGTTAAATTG CCTTACTATC AGACTGTGGG CACACGTTAT
GGAGGTAAAT CAACACCGGA ACAGATAAGA AATTTCATAC TTGGAAAAAC AACAGATCCG
TCCAAATTCC AGAGTGTTGC AGATGACCCT GAAAACAGGC ATAAAGTTGG AGTTGACTGT
TCGGGACTGG TAGCTTATGT GCTTAATGAA GCAACAGAAG GAGCGATTCA CAAAACCCAC
GGGCAAACCG GTTATGCAAA TGGAATTAGT GCGGCAGCCC TCACAAACAC TAAACTTGGA
CAGAAAATAA CCAGGGCGAA AGATATTGTT CCGGGAGCTA TAATGAATAC TGATGACGGC
GGTCATGTTA TCGTAATATA TGAAGTTGTA AAAACAAACG GAAAAGTTAC TCAAATAAAA
TATGCACATT CTAATGGCAA ACACGGGCCT CATAAAGGAT ACATTGACAT AGGAGATGAA
AATCAGGACT TGGACGGAAG TGCACAAACG TGGCATGATA TTTCATATAC GGACCAAAAA
GCTAAAGAGC TTTATACCTA TACCATTTTG CGTAATGAAG TCATAGAATA TTTGAAAAAA
TAA
 
Protein sequence
MIFYYDEEND TIKFLETEVD EETNTIKTTV DHFSIYGVID IVRFAQSWGI KDILDKLLNP 
GGETIPPVAE IGQADIVFVI DTTGSMGSVI NNVKNNITNF ANTLMENNVD VRLGLIDYKD
LEEDGMDSTK NLGWFDNVSD FIASVNNMRA TGGGDAPEST VDALEEARRM DFRPGVNKFI
MLFTDVSYKE STRFEDVQSM KTVIEKLKED KIVVSAIVPS GYESLYRNLY TETGGVYANI
TQAFSSALQS LISNIASVTH GDGVWIRLSN GMLKKLEGVS SKEELLEMAS GGYLSVGDLF
FEEVTVDFFG LKVKTFREAF NKNAVDVRIT PTSGYSGGMY TVDAVATIVT KQQVDKAVLL
FRLLNGNWMQ YNAGLDSKFI MSKYNFKLPE EYPERLNDLF IGYRTSLNIL DSGLREFKVV
LFDKEGRIIV ESAVNKVKVE SSIKWTGDLS PLETVIGTKN FSIAQNANLL AGEKIALFLR
KDGEQFPVMV DSSEQKYKRE QNNYNFSFVS NDWDAVNKRP KYTNGVYWIS VHITDGSGKS
LIKSDERKIK INNIWISEYY NSGQNKDIYD IKKRLNELGY FGQNNLQLTV NTVFDYNTSF
AVSEFKRVNR INDNGIFKGV VCDQTWNVLF SDSALRNDKS PSAGSGEWNF QSLKERLRFL
SAAELEKEIS DASKEKRSLS PSAELTKRLN LINSRIANDW VSNGVTKTRF AKRYLDQCAK
NGITPPAEYN TEAKVIKALT WNDEKIEKLY RNSLQFQNLD IDPRLLLAII IQEGTGSFNT
NPEVKDSNGG HYIQPDFEKD LKAALDNQFL RKANAYRYYG EQFSEFVSGL KVSQPNLTSG
KGNLYQFLNY ATMAATVDLN TNEIIELKPH GVYATHDGWW KNVEAVFNSL VDNNGEKPAE
KYSNLFKSSK KVPLKSGYSK PTVVFKLEWI NEVIYNSKTE KYDPGYTIKA TLASQTTPEE
PKPTENVFPL KYGDTSAVKG SYIKEVHDLL SKNLGSGLEY LKIADGDAGY GKNYGPKTTA
LVKLYQSQNG AAVNGQIDEA TLNKLRKGEW KVVAPSEGKY VIKGSYQDSY FELVVDKAPD
ISDTGDFSLK LKRLVEEYTD FNINGVKVKL PYYQTVGTRY GGKSTPEQIR NFILGKTTDP
SKFQSVADDP ENRHKVGVDC SGLVAYVLNE ATEGAIHKTH GQTGYANGIS AAALTNTKLG
QKITRAKDIV PGAIMNTDDG GHVIVIYEVV KTNGKVTQIK YAHSNGKHGP HKGYIDIGDE
NQDLDGSAQT WHDISYTDQK AKELYTYTIL RNEVIEYLKK