Gene Cthe_2531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2531 
Symbol 
ID4809287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3000447 
End bp3001517 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content44% 
IMG OID640107947 
Productsulfate ABC transporter, periplasmic sulfate-binding protein 
Protein accessionYP_001038926 
Protein GI125975016 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1613] ABC-type sulfate transport system, periplasmic component 
TIGRFAM ID[TIGR00971] sulfate/thiosulfate-binding protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000783626 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGAAAA ATAAGGGAAG AAAAAATATT CTCAAGTCAA GCCTGGCAGT ACTGCTGATT 
ATAGCCGGCC TTGTCTCCTT TGCAGGATGC GCAAAAGAAG CAGGCAATAA TCAAAGCGAT
TCCGGTCAAA CAGAACCTGT TACGATTCTT AATGTATCCT ATGACCCTAC CCGTGAGCTT
TATCAGGATT ACAATGAAGC ATTCAGAAAA TACTGGAAAG AAAAAACCGG AAAGGACATT
GTCATCCAGC AGTCCCATGG AGGCTCAGGC AGCCAGGCAA GAGCTGTAAT TGACGGTCTT
GAGGCTGATG TCGTAACCCT GGCTTTGGCG TATGATATTG ATTCAATAAA CAAAAACAAA
GAGATTTTGA GCAAAGACTG GCAGAAACGC TTGCCGTACA ATTCAACTCC ATATACTTCG
ACCATTGTAT TTCTGGTAAG AAAGGGTAAC CCCAAAAATA TTAAGGACTG GGATGACCTT
GCCAGACCGG GGGTGGAAGT TATCACTCCA AACCCCAAGA CTTCAGGAGG TGCACGCTGG
AATTATCTTG CGGCATGGGG ATATGCCCTG AAAAAATACG GCAATGACCC GGAAAAAGCC
AAGGAATTTG TTAAAGCAAT ATATGCCAAC GTACCCGTTC TTGATTCCGG AGCAAGGGGC
TCTACCACGA CCTTTGTGGA GCGGGGATTG GGAGATGTGC TTATAGCCTG GGAGAATGAA
GCATTTCTTT CCTTGAATGA GCTGGGCAAA GACAAATTCG AAATTGTTGT ACCGTCTGTG
AGCATACTGG CTGAGCCTCC TGTTGCTGTT GTTGATTCGG TGGTTGACAA GAAAGGAACC
CGTGAGGTGG CGGAAGCTTA TCTTGAATAC CTGTACAGTG ACGAGGGGCA GGAAATAGCG
GCTAAAAACT ATTACAGGCC CAGAAAAGAA GAAATCAAGC AAAAATATGC TTCGCAATTT
GCCGAAGTTG AACTTTTTAC CATTGATGAA GTCTTTGGCG GATGGGATAA AGCGCAAAAG
GAACATTTTG ATGACGGCGG TATCTTTGAC CAAATATATG AGAAGAAATA A
 
Protein sequence
MEKNKGRKNI LKSSLAVLLI IAGLVSFAGC AKEAGNNQSD SGQTEPVTIL NVSYDPTREL 
YQDYNEAFRK YWKEKTGKDI VIQQSHGGSG SQARAVIDGL EADVVTLALA YDIDSINKNK
EILSKDWQKR LPYNSTPYTS TIVFLVRKGN PKNIKDWDDL ARPGVEVITP NPKTSGGARW
NYLAAWGYAL KKYGNDPEKA KEFVKAIYAN VPVLDSGARG STTTFVERGL GDVLIAWENE
AFLSLNELGK DKFEIVVPSV SILAEPPVAV VDSVVDKKGT REVAEAYLEY LYSDEGQEIA
AKNYYRPRKE EIKQKYASQF AEVELFTIDE VFGGWDKAQK EHFDDGGIFD QIYEKK