Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1577 |
Symbol | |
ID | 4809568 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1906032 |
End bp | 1907576 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640106995 |
Product | hypothetical protein |
Protein accession | YP_001037996 |
Protein GI | 125974086 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGAGACA TTTCCATGGA GCGCGTGAAT GAAACGCGCA AGAAAATACT GGATATCATC AAAAGCCCGA TACTAACTCA CGAGCAAAAG GTTGCTTCAC TGACCAATAC TGCCGATTCT CTGCTGGAAG TATTGGATTT GCCGCAGGGG CTGGACGAGC TGATGAATGT CCCGGCAGAC AGAAAATGCA TCTGCGACCT GAATGAGGGA CACGCTCCGC TTCGTCCCCG ATATATCATT CCCGACTATG CCAAATTCAT GAAGGAAGGC AGCCGGTTTC TGCAGCTTGC GCCACCGACC GACTTATATG AAGCGCTTAA TTCTCTGCTG ATTTTCTACA AGCATGTTCC AAGCGTAACC AACTTCCCGG TATATCTGGG ACAGTTGGAT ACGCTGCTGG AACCTTTTAT GCAAGACGTT GACGACACTA CGGCCAAAAA ACTTCTGCGA GGCTTTTTGG TGCACGTGGA CCGTACTATT CTTGACTCCT TCTCCCATGC AAATATCGGT CCCAAGCCCA CACGGACCGG CCGCTTGCTG CTTGAGGTGG AGGCAGAGCT TACGCAGGCC GTTCCCAATC TGACCTTAAA ATACGAAGAA GGCGTAACCG ATGACGACTT CGCCATTCAG GCTGTCAAGA CAGCCCTGCG TTCGGCCAAG CCCAGCTTTG CCAACCACCG CATGTTTACG CAGGAACTTG GTGAGAATTA TGTCATTGCC AGCTGCTATA ACGGATTGCC GCTGGGCGGC GGATCATACA CCCTTTGCCG CCTGATTTTG GGCAACATTG CAAAACGCGC GGGCGGAATT GAAGATTTCA GGACCAACCA GCTGCCCTAT GTTATGGACA TCATGGCACG TTACATGGAC TCCCGTATCC GCTTTGAAGT CGAAGAAAGC GGCTTCTTTG AAAATAACTT CCTGGCCAGG GAAGGCTTTA TCAGCCGCAA ACGCTTCACT GCCATGTTTG GCCTGGTGGG TCTGGCGGAA TGTGTAAATA TCCTGCTGGA AAAAGAGGGC CGTCCCGGCC GCTTTGGCCA CGATGAGTAC GCTACCGAGC TTGGTGAATC CATCATCAAG CAGATATATG AATTCAGTGA GAAACACCAT AACCCCTACT GCGAAATAAC CGGCGGTCAT TTTCTGCTTC ATGCGCAAGT AGGAATTTCA GAAGATCAAA ATATCTCGCC GGGTACCCGT ATCCCCATAG GAGAGGAACC CGACGAAATG ATTGATCACC TCATGGTTGT CAATCATTTC CATAAATATT TTCCATCCGG AACGGGCGAC ATTTTCCCTA TTGATGTTAC TGTTCACCAA AATCCCGAAT ACGTGCTGGA CATCGTCAAA GGCTCCTTCC GCAAGGAATT GCGCTATCTG TCCTTCTATG AGAAGAACAG CGATGTTATC CGTATCACGG GCTACCTGGT CAAACGGTCG GAAATCGAAA AGCTGAAAAG CGGTCAGAAT GTGCTGCAGG ACACCACCGC TTTGGGAATG GGCGCCGTCA TGAATGGAAA GATTTTGGAT AGAAAGGTGC GCTGA
|
Protein sequence | MRDISMERVN ETRKKILDII KSPILTHEQK VASLTNTADS LLEVLDLPQG LDELMNVPAD RKCICDLNEG HAPLRPRYII PDYAKFMKEG SRFLQLAPPT DLYEALNSLL IFYKHVPSVT NFPVYLGQLD TLLEPFMQDV DDTTAKKLLR GFLVHVDRTI LDSFSHANIG PKPTRTGRLL LEVEAELTQA VPNLTLKYEE GVTDDDFAIQ AVKTALRSAK PSFANHRMFT QELGENYVIA SCYNGLPLGG GSYTLCRLIL GNIAKRAGGI EDFRTNQLPY VMDIMARYMD SRIRFEVEES GFFENNFLAR EGFISRKRFT AMFGLVGLAE CVNILLEKEG RPGRFGHDEY ATELGESIIK QIYEFSEKHH NPYCEITGGH FLLHAQVGIS EDQNISPGTR IPIGEEPDEM IDHLMVVNHF HKYFPSGTGD IFPIDVTVHQ NPEYVLDIVK GSFRKELRYL SFYEKNSDVI RITGYLVKRS EIEKLKSGQN VLQDTTALGM GAVMNGKILD RKVR
|
| |