Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0426 |
Symbol | |
ID | 4808429 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 535029 |
End bp | 536699 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640105840 |
Product | putative PAS/PAC sensor protein |
Protein accession | YP_001036857 |
Protein GI | 125972947 |
COG category | [R] General function prediction only |
COG ID | [COG4624] Iron only hydrogenase large subunit, C-terminal domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGAAT GTTTGCAAAC TAAAAAATCA AATTGCAAGA ACTGCTATAA GTGTATAAGA CATTGTCCGG TTAAGTCATT GAAATTCACT GACGGTCAGG CTCATATAGT AAGGGATGAA TGCGTTTTGT GCGGCGAATG CTATGTCGTT TGCCCGCAAA ACGCAAAGCA GATACGATCC GATGTGGAAA AAGCAAAACA ACTGGTTTTA AAATATGATG TTTATGCCAG TATAGCTCCG TCCTTTGTCG CATGGTTTCA TAACAAAAGC ATACATGATA TGGAACAGGC GTTAATAAAG CTTGGGTTTA AAGGTGCGGA TGAAACGGCG AAAGGTGCTT ACATTGTCAA GAAACAATAT GAAAAAATGA TAGAAGAGAA AAAAAGCAAA ATTATTATAT CTTCGTGCTG CCATACGGTA AACACCTTGA TCCAAAGGCA TTATACCGGC GCAATTCAGT ATCTGGCAGA TGTAGTCTCG CCGATGCTTG CCCATGCGCA AATGCTGAAA AAAGAACATA AAGGTGCAAA GGTTGTATTT ATCGGACCGT GCATTTCCAA AAAAGACGAA GCAGAAAAAT ATAAAGGTTA CGTGGAACTT GTGCTTACTT TCGATGAACT GGATGAATGG CTAAAATCAG AAAATATTAC AATTGAAAGC AATAGGGGCA GCTCTAAAGA GGGAAGGACC AGAAGTTTTC CTGTTTCCGG AGGCATTATC AGTAGTATGG ATAAAGATTT AGGATATCAT TACATGGTAG TGGACGGTAT GGAAAACTGC ATAAATGCTT TGGAGAATAT AGAAAGAGGA GAGATTGACA ACTGTTTTAT TGAGATGTCG GCATGCAGAG GCAGTTGTAT AAACGGTCCG CCTGCAAGGC GCAAAAGCAA TAATATTGTC GGAGCTATAT TAGCCGTAAA TAAAAACACC GGAGCAAAAG ATTTTTCTGT CCCGATGCCC GAACCTGAAA AACTAAAAAA AGAATTCCGG TTTGAAGGTG TGCACAAGAT AATGCCCGGG GGTACTGCCA TCGAGGAGAT TTTAAAGAAA ATGGGCAAGA CCTCCATAGA GCATGAGCTA AACTGCGGAA GCTGTGGTTA TGACACCTGC CGCGACAAAG CGGTAGCAGT ATTAAATGGA AAAGCAGACC TTACAATGTG TCTTCCATAC CTGAAAGAAA AAGCGGAAAG CTTTTCAGAT GCAATTATTA AAAATACACC CAACGGTGTT ATAGTATTAA ATGAAGACCT TGAGATTCAG CAGATAAACA ATTCCGCAAA AAGAATTCTG AACTTAAGCC CGTCTACAGA TCTTTTGGGA AGTCCTGTTA GCAGAATACT TGATCCTATA GATTATATTC TTGCGTTGAG AGAGGGCAAA AACTGTTACT ATAAAAGGAA GTATTTTGCG GAATATAAAA AATATGTCGA CGAAACCATA ATTTATGATA AGGAATATCA CGTCATTATC ATAATTATGA GGGATGTGAC TGAGGAGGAG AAAATCAAGG CGCTGAAAAA CAAGCAAAGC GAGGCGGCAA TTGAAATAGC GGATAAAGTT GTGGAAAAGC AGATGCGCGT AGTGCAGGAA ATAGCACTGC TTTTGGGAGA AACGGCCGCT GAGACTAAAA TTGCTTTGAC CAAGCTGAAG GAGACTATGG AGGATGAATG A
|
Protein sequence | MTECLQTKKS NCKNCYKCIR HCPVKSLKFT DGQAHIVRDE CVLCGECYVV CPQNAKQIRS DVEKAKQLVL KYDVYASIAP SFVAWFHNKS IHDMEQALIK LGFKGADETA KGAYIVKKQY EKMIEEKKSK IIISSCCHTV NTLIQRHYTG AIQYLADVVS PMLAHAQMLK KEHKGAKVVF IGPCISKKDE AEKYKGYVEL VLTFDELDEW LKSENITIES NRGSSKEGRT RSFPVSGGII SSMDKDLGYH YMVVDGMENC INALENIERG EIDNCFIEMS ACRGSCINGP PARRKSNNIV GAILAVNKNT GAKDFSVPMP EPEKLKKEFR FEGVHKIMPG GTAIEEILKK MGKTSIEHEL NCGSCGYDTC RDKAVAVLNG KADLTMCLPY LKEKAESFSD AIIKNTPNGV IVLNEDLEIQ QINNSAKRIL NLSPSTDLLG SPVSRILDPI DYILALREGK NCYYKRKYFA EYKKYVDETI IYDKEYHVII IIMRDVTEEE KIKALKNKQS EAAIEIADKV VEKQMRVVQE IALLLGETAA ETKIALTKLK ETMEDE
|
| |