Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1114 |
Symbol | |
ID | 4811412 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1327372 |
End bp | 1329258 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640106536 |
Product | Tn7-like transposition protein D |
Protein accession | YP_001037539 |
Protein GI | 125973629 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGACGT TTTTTCCTGT GCCATATGAG GATGAAGTAT TATACAGCGT CCTTGCAAGA TACCATGTGA GGAGTGGAAA TATAAGTTAT AAGGCGACAA TGAGAGACCT TTTCGGGTCA ACTTCCGTAA CAGCGGTGAT GGACTTGCCT TCAAATATAC ATAACCTCGT CAATAATATG CCCCTTAATT CCAGGTATAC CGAAGAATAT CTTATTAAAA ATCATACACT TTTTCCGTTC TATTCTGCTT TCTTGCCTCC TGAGCGTGCA GAACAAGTTT TTCAATCAAT GAAAGGGGAA AATGGAGGAA GCATATATAC CCGGACTGGA ATTATGGCCA GTTCTATAGT ATTAAACCAG TATTTTAAGT TTTGTCCTAC ATGTACTGAA GAAGATAAGT TGCAGTACGG AGAGTTATAC TGGCATAGAG TTCATCAAAT TTCTGGAGTA CTGGTATGCC CAAAACATTA TGTTCCTTTA TATAATAGCC TGGTACCGGT CAGGGGATAT AATAAATACC AATATAAGGC TGCTAGCGAA GAGAACTGTG TAAAGCCAGA TATTAATGTG ATATACGCTG ATGATGTTTT TGAAAAACTG GTTAGGCTTG CAGAGGATGC TCAGGTTTTA CTTAACAGTG ATTTTGAAAA GAGAAATATA GAATGGTATA AAGAACAGTA TCTTGCAAAG ATGATGGAAA TGGGATTTGC AACTGTGAAT GGGAAAGTGT ACCGAAAGAG GCTTATAAAA GAGTTTATTA ATTATTATGG TGAAGAATTT TTAGATATGG TGCAGTCCAG TGTTGATGTA GATAACGATT CAAATTGGCT AATGGATATG ATAAGAAAGA AAAACAAGAC TGCTCATCCA ATAAGACATT TGCTTTTGTC ACAGTTTCTT GGTATTTCAC TTCAAGATTT ATTTAATAAA AAGATGGAAT ATAAGCCTTT TGGAGATGGA CCATGGCCTT GCTTGAATGC AGCATCAGAC CACTACCTAA AGCCGGTTGT TTCTGATTTA AAAGTTGCAT ACAGTACGGA TTCAAAATGT CCTGTTGGGA CTTTTTCTTG TACTTGCGGT TTTGTTTATA CAAGAAGTGG GCCAGATGAG TCTGAAGATG CAAGATATAG GTTCGGAAGG ATAAAAAAGT TTGGACAAGT ATGGGAGGAG AAACTCAAAG AATTAGTAGA CCGAAAATTG AGTTTAAGAG AAACAGTAAG GTTATTAGGG GTAGACCCTA TTACAGTTAA GAAGTATGCT AAGAAACTTG GATTAACGAC TTACTGGGAA AAGCGGAGTG AGGCTGATGT TGTATATGAT TATGAAAAGA GCAGTTATTC TTCAATGAAA TTGGATGATA AGGATTACTA CAGAAAAAGG TGGAATGAAT TAAGAAAGCA ATATCCAGAG ATGGGAAAAA CGCAATTACG ACAGGTTGAT AAGGCTTTAT TTGCCTGGCT TTATAGAAAT GACAGGGAAT GGCTAAACCA GAACTCACCT GATAAAAAAG TGTCTAATAC TGTAAACAGA AGAGTTGATT GGAATCAAAG AGATAATGAG ATATTGTCTC AGATAAAAGA AATAGTTGAT AAGATGCTGA ATTCGGACGA AAAGCCTGAA AGGATTACTA TTAGTCTAAT TGGTAGTAAA TTAGGTATAA GAGGCTTGCT TGAAAAGCAT TTAGATAAAC TTCCAAAGAC AAAAGCATAC CTGGATTCTG TTAAGGAGAC CAATCATGAC TTCAGGTTAA GAAGGTTTCG CTGGGCAGTT AAGGAATTAG AGAAAGAAGG AGAAGAACTG CAACTGTGGA AGATTATGAG GAAGGCTGGG ATAAGGAATA TATATCAAAT TTCAATCCAA TTTGAAGGAA GTGGTAATTT TAAATAG
|
Protein sequence | MMTFFPVPYE DEVLYSVLAR YHVRSGNISY KATMRDLFGS TSVTAVMDLP SNIHNLVNNM PLNSRYTEEY LIKNHTLFPF YSAFLPPERA EQVFQSMKGE NGGSIYTRTG IMASSIVLNQ YFKFCPTCTE EDKLQYGELY WHRVHQISGV LVCPKHYVPL YNSLVPVRGY NKYQYKAASE ENCVKPDINV IYADDVFEKL VRLAEDAQVL LNSDFEKRNI EWYKEQYLAK MMEMGFATVN GKVYRKRLIK EFINYYGEEF LDMVQSSVDV DNDSNWLMDM IRKKNKTAHP IRHLLLSQFL GISLQDLFNK KMEYKPFGDG PWPCLNAASD HYLKPVVSDL KVAYSTDSKC PVGTFSCTCG FVYTRSGPDE SEDARYRFGR IKKFGQVWEE KLKELVDRKL SLRETVRLLG VDPITVKKYA KKLGLTTYWE KRSEADVVYD YEKSSYSSMK LDDKDYYRKR WNELRKQYPE MGKTQLRQVD KALFAWLYRN DREWLNQNSP DKKVSNTVNR RVDWNQRDNE ILSQIKEIVD KMLNSDEKPE RITISLIGSK LGIRGLLEKH LDKLPKTKAY LDSVKETNHD FRLRRFRWAV KELEKEGEEL QLWKIMRKAG IRNIYQISIQ FEGSGNFK
|
| |