Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1871 |
Symbol | |
ID | 4809202 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2220405 |
End bp | 2222282 |
Gene Length | 1878 bp |
Protein Length | 625 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640107290 |
Product | Tn7-like transposition protein D |
Protein accession | YP_001038285 |
Protein GI | 125974375 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGCATT TTTTCCCAAC CCCATATCCT GATGAAATCC TTTATAGTGT ATTGGCACGC TATAGTGTCC GCTGTGGGAT TACAAGTTAT CAAACTATCA TGGAGAGCAT ATTTGGAAAG TGTAGTTCCA GGGCTGTAAT GGAAATGCCT TTTAATTTGA ATTCGCTGGT ATCTAACCTA CCTGTGAATT GTCCTTATAC TGCTGATGAT TTGATTTATA ATCATACTCT GTACCCATTC TTTACTGCAT TTCTTCCAAA AGAACGGGCA GAAGAAGTAA AACAATTAAT GCTGTCTGAA GGTGGAAGCA AAATTTATGG TAAAGCAGGT ATTATCGGCA GTAGGATTCC ACTAAACCAA TATTTAAGAT TTTGTCCCAA ATGCTTCGAA GAAGAGCAGA AATTGTATGG TGAAGGATAT TGGCACAGGT TGCATCAGAT ACCCTTTGTA ATGGTGTGTC CTATACATAA AGCGATTCTT CATAATAGTA CTGTTTTAGT GCGGGGTCAT AATCCACAAG CCTATGTACC TGCTGACGCT GATAACTGCA TTAATAATGA ATTACTTTAT TTCAAACCTG AAACGATAGA GAAGTTTGTA CTCATTGCCC AAGATGCAAA AGTATTGCTT GAAAACCAGT ATCCCAATAA GCAGTTTAAA TGGTTTGCTA AACAGTATTT AGAAAGGTTT AAGGAATTAG GGTATGCCAA TATTAATGGT AAAGTGTATT GGGAAGAAGT AATAAAGGAC TTCATAGATT TCTACGGATT GGAGTTCCTA AATGCTGTTT GTTCTAATAT AGAAGACAAG GAACAGGGAA GATGGCTGAA AGAAATTACA CATAGTGATG CAAAGTCGGT GTATCCAATT CGGCACCTGA TGCTGGCAAG ATTTTTAGGA ATAGGAGTAG AAGAACTTTT TATAAAAGAG TTAAATTACA GACCTTTCGG AGAAGGGCCT TGGGTATGTC TTAATCCTGC TGCTGACCAC TTTCTTGAAC CTGTAATAAA AGACGTAGAA ATAAAGTATA GGCGTAGAAA TAAGAACACT AATGGTTTTT TCAGATGTCC CTGCGGATAT GAATATATGG AAACAGTTAC AAGAGAACAA GGAGAAAGGG GTAAGGGGCG GCGCAGATTT GTCAGGGTTG TAGAGTATGG ACATGTATGG AAGGCTAAGG CTCATGAATT GCATGAAAGT GGAGTAAGTA TTCAGGAAAT TGCAGTAAGA CTTAATGCAG ATATTAGTAC TGTAAGAAAG TATATTTCTG AAAAAGGACA GGAAAAAAGC AAAAAGTTAG TAGAACGTAA TAGTGTTGCT TCTGCGGAGT TTGAAGTAAA AAGACAGTAT CACAGGGAAA AGTGGCTTCA GATAGTCAGG GAAAATCCAG ACAAGGGCAG ACTTGAATTA CGGAGGCTGG GTAAATATAC CTGTACCTGG TTATGCAGGA ATGACAGGGA GTGGTGGGAG AAAAACACCC CTGCAAAGAA ATACGTTCAG GCTTACAGTA ATGTTGATTG GGAAACTAGG GACAAGGAAA TCTTACAGCA TGTTAAGCAA ACTGTTCAGG AGATTTTAGA AAGTGATGAA AAACCTCAAC GAATTAGCCT GCGGTTGATT AAAACTAAGT CAGGACTTAA AAGTTTTGAC CTTCAATTGG ATAAATTACC GTTGACTAAA TCATTCATCA ATTCTGTTAT AGAGAACCCT ATGGATTTGC ACAAAAGGCG TATACAATGG GCTATTGAAA AACTTAATAA AGAAGGAAAA GCATTAACTA TTTCTAATAT TACTGTCATG ACAGGTGTTG GTAATAAATA TAGGAAACTA GTTATAGAGG AGATTAAAAA GGCGTTAGAA GAGTTAGGTG AGAGGTAG
|
Protein sequence | MMHFFPTPYP DEILYSVLAR YSVRCGITSY QTIMESIFGK CSSRAVMEMP FNLNSLVSNL PVNCPYTADD LIYNHTLYPF FTAFLPKERA EEVKQLMLSE GGSKIYGKAG IIGSRIPLNQ YLRFCPKCFE EEQKLYGEGY WHRLHQIPFV MVCPIHKAIL HNSTVLVRGH NPQAYVPADA DNCINNELLY FKPETIEKFV LIAQDAKVLL ENQYPNKQFK WFAKQYLERF KELGYANING KVYWEEVIKD FIDFYGLEFL NAVCSNIEDK EQGRWLKEIT HSDAKSVYPI RHLMLARFLG IGVEELFIKE LNYRPFGEGP WVCLNPAADH FLEPVIKDVE IKYRRRNKNT NGFFRCPCGY EYMETVTREQ GERGKGRRRF VRVVEYGHVW KAKAHELHES GVSIQEIAVR LNADISTVRK YISEKGQEKS KKLVERNSVA SAEFEVKRQY HREKWLQIVR ENPDKGRLEL RRLGKYTCTW LCRNDREWWE KNTPAKKYVQ AYSNVDWETR DKEILQHVKQ TVQEILESDE KPQRISLRLI KTKSGLKSFD LQLDKLPLTK SFINSVIENP MDLHKRRIQW AIEKLNKEGK ALTISNITVM TGVGNKYRKL VIEEIKKALE ELGER
|
| |