Gene Cthe_1754 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1754 
Symbol 
ID4810184 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2074396 
End bp2075352 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content42% 
IMG OID640107167 
Productperiplasmic binding protein 
Protein accessionYP_001038168 
Protein GI125974258 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0614] ABC-type Fe3+-hydroxamate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000128267 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGAATA AAAGGAGTAT GATTTTAAAA AGAAAGATAC TGCCGCTGTT AACGGCGTTA 
ATTTTGATTT TTGCTTTTTC TTCATGCAAT AAAAACAATA AAAACGGGCC AACTGATGGG
GCCGGTGATA ATAATAACGG ATACAGTGTA ACTTTGAAGG ATTCCTATGA CAGGGAAGTC
AACCTGGACA AAGAACCTGA AAGGATAGTC TCAGTTGCCC CCAACATAAC GGAGATAATT
TTTGCGTTGG GCAAGCAGGA CAAGCTGGTG GGACGGACGG ATTTTTGCGA TTATCCGGAA
GAGGCGAAGA ACATCGAGTC AATAGGAAAT ATAGACCAGC CGAATGTGGA AAAGATAGTT
GAACTTCAGC CGGATGTGGT TATAGCATCT TCCATCTTTA CGAAAGAGAT GCTGCAAAAG
CTTGAGGAGG CCAATATCAA GGTGGCTATC TTTCAGGCCG AGAAGGACTT TGAAGGTGTC
TACAACATGA TCGAAAAGAT TGGTCTTTTG CTGAACGCCC GGGAAGAGGC AAAGAATGTT
GTGACGGAAA TGAAGGAAAA AATAGAGTTT GTAAAGAGCA AGGTCGACGG CCTTGAAAAG
CCCAGTGTTT ATTATGTTCT TGGCTATGGC GAGTTTGGGG ATTATACCGC AGGAAGGGAC
ACATTTATCA GCCGCATGAT TGGGATGGCT GGAGGAAAGA ACGCGGCGGA TGATGTGGAA
GGCTGGAAAT ACAACATAGA AAGCCTCCTT GAAAAGGATC CTGACATACT TATATGCTCA
AAATATTATG ATACAAAAGA AGGAATAAAA AATACCGACG GATACAAGGA ACTTTCCGCG
GTAAAAAACG GAAAGCTTTT TGAGATAGAC AACAATATGC TGGACAGGCA GGGGCCGAGA
ATTGCCGACG GGGTTTTGGA ACTTGCTAAA ATAATTCATC CTGAAGTTTT TAAATGA
 
Protein sequence
MRNKRSMILK RKILPLLTAL ILIFAFSSCN KNNKNGPTDG AGDNNNGYSV TLKDSYDREV 
NLDKEPERIV SVAPNITEII FALGKQDKLV GRTDFCDYPE EAKNIESIGN IDQPNVEKIV
ELQPDVVIAS SIFTKEMLQK LEEANIKVAI FQAEKDFEGV YNMIEKIGLL LNAREEAKNV
VTEMKEKIEF VKSKVDGLEK PSVYYVLGYG EFGDYTAGRD TFISRMIGMA GGKNAADDVE
GWKYNIESLL EKDPDILICS KYYDTKEGIK NTDGYKELSA VKNGKLFEID NNMLDRQGPR
IADGVLELAK IIHPEVFK