Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1115 |
Symbol | |
ID | 4811413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1329264 |
End bp | 1330919 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640106537 |
Product | Tn7-like transposition protein C |
Protein accession | YP_001037540 |
Protein GI | 125973630 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAAG TAATAATACC TAACGGTGCT AATGCTGTAG TTGCAGAATA CAAGGAACAG TTGATACCTG AATATAGCGG GAATCCATTT ATTGAAGCAC TACCACCGGT TTACTCTAAG GAAGAGGTAG TGGAGAAACT GTCTCTGTAT CCCCGCTATA ATCCAGAGGA AAGACGATTG GAAGACCACT ACCGTATTCA TATGGTGCAG CGGTTGTTCC AGTGCTTTCA GCCGTTGAGT ATTCATCTTG ACCTAGAAAG TAGAATAAGC AGGGTTATAA GGCAGGGATA CCTGGCACGT AACCCATTTA AACCTTCCTA TGCTGAAAGC CTACAAGACG GATATAAGGC TATACAAAGC ATGAAATGGG AGTTAAGCAG CAACGCATCC TTCAGGACTA CTGCATCGGG ATTTACTATT ATAGGTGTAA GTGGAATGGG TAAGACAACT GCTATCAACC GTGTATTATC TCTTTATCCC CAGATAATTG TACATTCAGA ATATAATAAT ACCAATTTTA GTATGTACCA ACTGGTTTGG CTTAAGTTGG ACTGCCCTTT TGATGGTTCT TTGAAAGGCT TGTGTATTGA GTTCTTCCAT AAGGTAGATG ACCTGTTGGG TACAGACTAT CATAAGAAAT TCGGGGTAGG CAGGAATACG GTAGACACTA TGCTCTCCGT TATGTCTCAG ATAGCCAGAA ATACAGCGTT AGGAGTATTA GTAATTGACG AGATTCAACA TCTGAGCAGT GCAAAAAGCG GAGGGGATGA AAAGATGCTT AACTTTTTTG TCACCCTTGT AAATACTATC GGAGTGCCTA CTGTACTTAT TGGTACAACA AAAGCATTAT CAGTTTTGCA ATCTGAATTC CGTCAGGCAA GGCGTGGAAG CGGACAAGGG GATATGATTT GGGAGAGGTT GAGCAAAGAT AAAAGTTGGG AACTGCTTAT CAATGCATTT TGGGACTATC AATGGACCAA AAAGGAAGTA CCGCTAACAC CTGAATTAAG TGATGTTCTC TATGAAGAGT CACAAGGCAT TATAGACATT GCGGTAAAAC TTTATGCGAT GTCACAAATA CGGGCTATTC TTTCAGGGAA AGAGGTTATC ACAGCAAATC TGATTAGGCA GGTTGCAAAA GATAATTTAA AATTGGTTCG TCCTATGCTG GAAGCATTAA AATCAGGAAA TATTAAAGAA ATCGCAAAAT ATGAAGATAT TTGCACTGTA GACATTGATT TTATGGGATT TGTGGACAAA AGCAAACAGT CAGTAGATTG GGACATGAGG ATGAAAATGC TTCAAAAGCA GCAAAAGAAA AAAGAAGAGG AAGTCAATCT TTCAAAGAAG GAACAGGCAA TTCTTAAATT GCTGGACTTA AATATTGATG CCAAAAAGGC TCAAAAAGCA GTTGAGAAGG TTCTTGATAA GGAAGAAGGG CTTGAAGTTT CTGAGATTGT AATAAAGGCT GTACAGATGA TAGCAAACAA TGGTAAATTA AAACAAAAGG AAAAGAGTAA AGCAAAGAAT ATGGATGAAA ATGATATAAG GTATATTGTG GAAGAAGGCA GGAAAAATAA AAAATCAGCC TATGAATCAT TAAATGAAAA AGGGCTTATT AAGCAGGTAG AAAAAGACTT TTTCAAGGCG GTGTAG
|
Protein sequence | MNKVIIPNGA NAVVAEYKEQ LIPEYSGNPF IEALPPVYSK EEVVEKLSLY PRYNPEERRL EDHYRIHMVQ RLFQCFQPLS IHLDLESRIS RVIRQGYLAR NPFKPSYAES LQDGYKAIQS MKWELSSNAS FRTTASGFTI IGVSGMGKTT AINRVLSLYP QIIVHSEYNN TNFSMYQLVW LKLDCPFDGS LKGLCIEFFH KVDDLLGTDY HKKFGVGRNT VDTMLSVMSQ IARNTALGVL VIDEIQHLSS AKSGGDEKML NFFVTLVNTI GVPTVLIGTT KALSVLQSEF RQARRGSGQG DMIWERLSKD KSWELLINAF WDYQWTKKEV PLTPELSDVL YEESQGIIDI AVKLYAMSQI RAILSGKEVI TANLIRQVAK DNLKLVRPML EALKSGNIKE IAKYEDICTV DIDFMGFVDK SKQSVDWDMR MKMLQKQQKK KEEEVNLSKK EQAILKLLDL NIDAKKAQKA VEKVLDKEEG LEVSEIVIKA VQMIANNGKL KQKEKSKAKN MDENDIRYIV EEGRKNKKSA YESLNEKGLI KQVEKDFFKA V
|
| |