Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1872 |
Symbol | |
ID | 4809203 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2222288 |
End bp | 2223943 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640107291 |
Product | Tn7-like transposition protein C |
Protein accession | YP_001038286 |
Protein GI | 125974376 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAAAG TGGTAATACC CAACGGTGCG GAGGGAGAAG AGGCAGTATA TAGGGAGCAG GTAATAAGAG AATATTCAGG AAATCCTTTT ATAGAGGCGC TTCCGCCTAT TTATGCTCCT GAAGAAGTGG TCGAAAAGTT GGCCGTTTAT CCGCCATACA GTGAAGAGGA GCGAAATCTC GACGCACACT ATAGGATACA TCTTACCCAA AAACTTTTTC ATTGCTTCCA GCCTCTACCC CAGCATTTGG ATTTGGAAAG CAGGATTTCA AGAGTTATAC GGCAGGGGTA CTTGCCGAGG AATCCAGTGA GCAAAGAATA TGTAATAAGT TTAAAGGATG GGTATAGGGC TGTACAGAAT TGTGATATCA ATGCCAACCA GCAGTTTAGA AGCACAGCCT GTGGTTTTAC TATTATTGGC GTATCAGGCA TGGGAAAATC AGTATCAATT AATAGGGTTT TGTCTATGTA TCCGCAGGTA ATTGTGCATA GCAGGTATAA GGGGCAGGGA TTCAGTTTTT ATCAATTAGT ATGGTTAAAG TTGGACACCC CTTTCGACGG AAGCGTGAAA GGACTCTGTA TTGATTTTTT TAACAAAGTA GACCAATTAA TGGGGACTGA TTTTTACAAG AAAGTTGCTA ATTCAAGAAA GTCTGTTGAT TATATGCTGA CGCTTATGTG TCAGGTAGTA AGAAATACAG GGCTGGGCTT GCTTATTATC GATGAAGTCC AGCATCTGTG CGGGGCAAGA GGCCTTGGCG ATGAAAGGAT GCTTAACTTT TTTGTAACTC TTGTCAATAC CGCAGGAGTA CCTATTATAT TAATAGCAAC TCCCAAGGCT ATGTCAATTT TACAGAGTGA ATTCCGACAG GCAAGAAGGG GTTCAGGCCA GGGGGATATG GTGTGGGAAA GGTTACAGTA TGATGAAAAC TTTGAAATAC TGCTGGATTC CCTTTGGCAG TATCAGTGGA CGAGGAAAGA AAGCATATTA ACAAAGAGCA TGCGAGAATT GTTGTATGAA GAATCTCAGG GAATAATTGA TATAATCTGT AAAATCATTG TCATGGCACA AACTATTGCA ATTTCCACTG GAAAAGAGCA GGTTGATGAA AAATTAATAC AACAAGTAGC AAGAGAACAT CTCCAATTGG TTAGACCGAT GATTTTAGCG TTGAAATCGG GGAACATTAG GGAGATTGCT AAATATTCTG ATATTTGCAC GGCTGAGATT GACTATGGCA AGTTAGTGAA TCAGGAGAAA CTGCCAGTAG AAATGCGGAT GAGAATAAGG GCAATAAAGG AGCAAAAGGA AAAAAGGGAA AAGGAAAATA AGACTTCAAA AATGGAACAG GCTCTTTACA AGTTAGCGGA ACTTGGTATT GATGTTGCAA AAGCCAGAAA AGCCGTTGAA GCAGTGTTTG AAGCAGAAGG CCATAATATA GATGAAGGAC AACTTGTTAT AAAAGCGATT CAGGTTCTTT CAGGAAACAC AACGGAAAAG AAGTCTAAAA AAGTGAAGAC TACTCCGAAA TCAGAAAATG ACTTGCGCTA TATTGTGGAG GAAGGTAGAA ATCAGGGCAA ATCGGCATAT GATGCATTAA AGGACAAGGG ATACATCAAA GGAGAGAAGA ATGATACTTT GTTCAAGGCG GTGTAA
|
Protein sequence | MAKVVIPNGA EGEEAVYREQ VIREYSGNPF IEALPPIYAP EEVVEKLAVY PPYSEEERNL DAHYRIHLTQ KLFHCFQPLP QHLDLESRIS RVIRQGYLPR NPVSKEYVIS LKDGYRAVQN CDINANQQFR STACGFTIIG VSGMGKSVSI NRVLSMYPQV IVHSRYKGQG FSFYQLVWLK LDTPFDGSVK GLCIDFFNKV DQLMGTDFYK KVANSRKSVD YMLTLMCQVV RNTGLGLLII DEVQHLCGAR GLGDERMLNF FVTLVNTAGV PIILIATPKA MSILQSEFRQ ARRGSGQGDM VWERLQYDEN FEILLDSLWQ YQWTRKESIL TKSMRELLYE ESQGIIDIIC KIIVMAQTIA ISTGKEQVDE KLIQQVAREH LQLVRPMILA LKSGNIREIA KYSDICTAEI DYGKLVNQEK LPVEMRMRIR AIKEQKEKRE KENKTSKMEQ ALYKLAELGI DVAKARKAVE AVFEAEGHNI DEGQLVIKAI QVLSGNTTEK KSKKVKTTPK SENDLRYIVE EGRNQGKSAY DALKDKGYIK GEKNDTLFKA V
|
| |