Gene Cthe_0426 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0426 
Symbol 
ID4808429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp535029 
End bp536699 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content40% 
IMG OID640105840 
Productputative PAS/PAC sensor protein 
Protein accessionYP_001036857 
Protein GI125972947 
COG category[R] General function prediction only 
COG ID[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAAT GTTTGCAAAC TAAAAAATCA AATTGCAAGA ACTGCTATAA GTGTATAAGA 
CATTGTCCGG TTAAGTCATT GAAATTCACT GACGGTCAGG CTCATATAGT AAGGGATGAA
TGCGTTTTGT GCGGCGAATG CTATGTCGTT TGCCCGCAAA ACGCAAAGCA GATACGATCC
GATGTGGAAA AAGCAAAACA ACTGGTTTTA AAATATGATG TTTATGCCAG TATAGCTCCG
TCCTTTGTCG CATGGTTTCA TAACAAAAGC ATACATGATA TGGAACAGGC GTTAATAAAG
CTTGGGTTTA AAGGTGCGGA TGAAACGGCG AAAGGTGCTT ACATTGTCAA GAAACAATAT
GAAAAAATGA TAGAAGAGAA AAAAAGCAAA ATTATTATAT CTTCGTGCTG CCATACGGTA
AACACCTTGA TCCAAAGGCA TTATACCGGC GCAATTCAGT ATCTGGCAGA TGTAGTCTCG
CCGATGCTTG CCCATGCGCA AATGCTGAAA AAAGAACATA AAGGTGCAAA GGTTGTATTT
ATCGGACCGT GCATTTCCAA AAAAGACGAA GCAGAAAAAT ATAAAGGTTA CGTGGAACTT
GTGCTTACTT TCGATGAACT GGATGAATGG CTAAAATCAG AAAATATTAC AATTGAAAGC
AATAGGGGCA GCTCTAAAGA GGGAAGGACC AGAAGTTTTC CTGTTTCCGG AGGCATTATC
AGTAGTATGG ATAAAGATTT AGGATATCAT TACATGGTAG TGGACGGTAT GGAAAACTGC
ATAAATGCTT TGGAGAATAT AGAAAGAGGA GAGATTGACA ACTGTTTTAT TGAGATGTCG
GCATGCAGAG GCAGTTGTAT AAACGGTCCG CCTGCAAGGC GCAAAAGCAA TAATATTGTC
GGAGCTATAT TAGCCGTAAA TAAAAACACC GGAGCAAAAG ATTTTTCTGT CCCGATGCCC
GAACCTGAAA AACTAAAAAA AGAATTCCGG TTTGAAGGTG TGCACAAGAT AATGCCCGGG
GGTACTGCCA TCGAGGAGAT TTTAAAGAAA ATGGGCAAGA CCTCCATAGA GCATGAGCTA
AACTGCGGAA GCTGTGGTTA TGACACCTGC CGCGACAAAG CGGTAGCAGT ATTAAATGGA
AAAGCAGACC TTACAATGTG TCTTCCATAC CTGAAAGAAA AAGCGGAAAG CTTTTCAGAT
GCAATTATTA AAAATACACC CAACGGTGTT ATAGTATTAA ATGAAGACCT TGAGATTCAG
CAGATAAACA ATTCCGCAAA AAGAATTCTG AACTTAAGCC CGTCTACAGA TCTTTTGGGA
AGTCCTGTTA GCAGAATACT TGATCCTATA GATTATATTC TTGCGTTGAG AGAGGGCAAA
AACTGTTACT ATAAAAGGAA GTATTTTGCG GAATATAAAA AATATGTCGA CGAAACCATA
ATTTATGATA AGGAATATCA CGTCATTATC ATAATTATGA GGGATGTGAC TGAGGAGGAG
AAAATCAAGG CGCTGAAAAA CAAGCAAAGC GAGGCGGCAA TTGAAATAGC GGATAAAGTT
GTGGAAAAGC AGATGCGCGT AGTGCAGGAA ATAGCACTGC TTTTGGGAGA AACGGCCGCT
GAGACTAAAA TTGCTTTGAC CAAGCTGAAG GAGACTATGG AGGATGAATG A
 
Protein sequence
MTECLQTKKS NCKNCYKCIR HCPVKSLKFT DGQAHIVRDE CVLCGECYVV CPQNAKQIRS 
DVEKAKQLVL KYDVYASIAP SFVAWFHNKS IHDMEQALIK LGFKGADETA KGAYIVKKQY
EKMIEEKKSK IIISSCCHTV NTLIQRHYTG AIQYLADVVS PMLAHAQMLK KEHKGAKVVF
IGPCISKKDE AEKYKGYVEL VLTFDELDEW LKSENITIES NRGSSKEGRT RSFPVSGGII
SSMDKDLGYH YMVVDGMENC INALENIERG EIDNCFIEMS ACRGSCINGP PARRKSNNIV
GAILAVNKNT GAKDFSVPMP EPEKLKKEFR FEGVHKIMPG GTAIEEILKK MGKTSIEHEL
NCGSCGYDTC RDKAVAVLNG KADLTMCLPY LKEKAESFSD AIIKNTPNGV IVLNEDLEIQ
QINNSAKRIL NLSPSTDLLG SPVSRILDPI DYILALREGK NCYYKRKYFA EYKKYVDETI
IYDKEYHVII IIMRDVTEEE KIKALKNKQS EAAIEIADKV VEKQMRVVQE IALLLGETAA
ETKIALTKLK ETMEDE