Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2111 |
Symbol | |
ID | 4810971 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2507888 |
End bp | 2509945 |
Gene Length | 2058 bp |
Protein Length | 685 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640107518 |
Product | hypothetical protein |
Protein accession | YP_001038511 |
Protein GI | 125974601 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCTTTG AGTTGGTTTC CCGCAACAGT AAAAGAAGCA GAAAAGAAAA CGGATTGTTT TTTGCATCCC TTTTGATATC CATAGTTGCA TTCTACAATA TTTTAGCGTT GTCGAAGCAG GATATTATGA TTTTTCTCGC AAAGCTGGAA AGCGATGCGG TAAACAAGCT TCTTGCCATG ATACCGTTGT TTTACGGAAT GACGTTATTT ATTCTGTTCT TCCTGGTCTA TTTTGCCAGC AAATTTCAAT TGGAGCGGCG AAAGCATGAA TTTGGTATTT ATTTAATGCT GGGAATGCGC CGTTCAAAAT TATTCTTTAT GTTACTGGCG GAAGATTTCC GCAGCAGTAT AACGGCACTT TTTATGGGAT TGCCCATTGC AATTTTACTA TCGGAGCTTA TCAGCTTAGT AACGGCGCGC CTTGCGGGGC TTGGTATTAT AGGACATCAA ATTTCAATTT CTTTTCAGGC AATCTTATGG ACAGCTGTTG GATTTTACCT TATTAAATTC ATTGCCTTTC TTATTCTCAG CGGTAAAATT GCAAGTCAGG AGATAGGCTC ATTGCTTGTA GAAACACCGG AAGGCACAAA AAAAGACCTT CCTTCTGTTG CTTATGCCCT GGCATTTTTA GCCGGTGTTA TCTTCTTGGC TGCTGCATAC GGCATGGCAA TCAGCGGTCT GGCCTGGGAT GAAGCCGGTA AAATGGGGCT TACTCTTATC CTTGGCTTTG CAGGTACATT GTTACTGTTT TTCGGTCTTC GCTCAGTTAT GGGTTTTTTG GCAAAACGCA GCGGTAACGG TCAAAAGCTG CAGGCCTTTA CTTTCCGCCA GCTTCAGGAA AATGTCATTC ACCGTTCCAC GACATTGGCT GTCAGCTCCC TGTTGATTTT AGCGGCACTG TGTTGTTTTA GCGCAGGTGT GGCCATTGCA CAGTTTTACA GTGGGTTGGA GCAGCATGTC TTAGATTATA CATTCAGCAG TGATTCACAA GATATCGCTG TTGTCAAAGA AACTTTGGAA TCCCATGGAT TGGATTCATT ATTCTCTGAG CTTTTTGAGA TGAGAGTAGG ATATATCCGG TCGGCAAAGG ATACGGATAA TGCGTTTGAG ATGAATTCCG TACTGGAATA TTTAGCCCGG CTGCCGGGAA GCGATGACAG GGATGTTCTT CTCAATAATC TTGGATATCA GACATATCCC CATGTGATTT CCCTGTCAGG TTACAACCGC CTGCTGTCTG TTGCAGGAAA ACTCACTATC GAGCTGGCGG AAGGCGAAGC CGCCGTTTAT ATGGACAATG AGTTTGTCTC GCCGGCCAAA TTGGAAATGC TCAATCAAAT TTTGGCACTA AATCCCGAAG TACAGATTGC CGGGGAAAAT TACCGCCTCA CAGGTACCGT CCAAAGTACC GACCTTGTGA CAGACAGTTC CATTACATTA TCCTTTGCAT TGATTGTGCC GGATGCTGAT TTTGAACGTT TTACCGAAAA TGATTACGAT ATTTATTTAA ATGCTGTCTT AAATCCGGAG CAAGTTGCCG GCAAAAGTCT TTTGAACGCT ATATCCGAGA CCAATGAAAA ACTGGATGCC GCAGGTTTAT TATACGAAAG CTATTTGCAA AATATGGGAC GTCAACTGTT TTATATTGCT GCTGCAAGCT ATATAACAAT CTATTTGGCC GTTGTTTTTC TAATAATTGC TAATACTATC ATTGGTGTTC AATTTCTTAC ATGGCAGCAA AAAACCAGCA GGCGGTATAA GACCCTGGTT CGGTTGGGAG CCTCGTATGA AACACTATGC CATTCGGCCG GAAAACAAAT TAATTGGTAT TTCGGCATCC CCGCAGCTGT TGCGGCAATT AGCAGTGTTT TCGGTGTTCG GGCCCTGTTT AGCGGCTTAT TATCCTCAAG GGTTAAAAGT AATATTTCTG CAATGATGAT AATTTCTGTA GCTATGGTGC TATTGCTTTG TGTGGTAGAA TATATTTATA TGGCTGTGGT TAAGAAATCC AGCAATCGTT ATATCCTGAC ACTCATGGTA CCGGAACGCG AAGAATGA
|
Protein sequence | MFFELVSRNS KRSRKENGLF FASLLISIVA FYNILALSKQ DIMIFLAKLE SDAVNKLLAM IPLFYGMTLF ILFFLVYFAS KFQLERRKHE FGIYLMLGMR RSKLFFMLLA EDFRSSITAL FMGLPIAILL SELISLVTAR LAGLGIIGHQ ISISFQAILW TAVGFYLIKF IAFLILSGKI ASQEIGSLLV ETPEGTKKDL PSVAYALAFL AGVIFLAAAY GMAISGLAWD EAGKMGLTLI LGFAGTLLLF FGLRSVMGFL AKRSGNGQKL QAFTFRQLQE NVIHRSTTLA VSSLLILAAL CCFSAGVAIA QFYSGLEQHV LDYTFSSDSQ DIAVVKETLE SHGLDSLFSE LFEMRVGYIR SAKDTDNAFE MNSVLEYLAR LPGSDDRDVL LNNLGYQTYP HVISLSGYNR LLSVAGKLTI ELAEGEAAVY MDNEFVSPAK LEMLNQILAL NPEVQIAGEN YRLTGTVQST DLVTDSSITL SFALIVPDAD FERFTENDYD IYLNAVLNPE QVAGKSLLNA ISETNEKLDA AGLLYESYLQ NMGRQLFYIA AASYITIYLA VVFLIIANTI IGVQFLTWQQ KTSRRYKTLV RLGASYETLC HSAGKQINWY FGIPAAVAAI SSVFGVRALF SGLLSSRVKS NISAMMIISV AMVLLLCVVE YIYMAVVKKS SNRYILTLMV PEREE
|
| |