Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1334 |
Symbol | |
ID | 4809474 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1624461 |
End bp | 1626023 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640106758 |
Product | FHA domain-containing protein |
Protein accession | YP_001037759 |
Protein GI | 125973849 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCAATG ATTACCTGAA AAAATTTGAA TTTAATTATG AAAGCAGTGC CACAGGCAGC TTTCTTGTTG TGAGCACGGA TGCCGGTGAA AATGTGCTTT TATACCAGGT TGAAATGCTC GCCAGCAATC CAAACAAAAA TATCTTGCCC CTGGACATAA GACAAAAGGA CGCAAAGTAC AATTTTTACT ACAATATTAC TTCCAAACTG GCTTTATCCC AATATCTCAA ACGCAACAAG CTAAAAAGGA ATGATTTTAT AAACATATTC AAAGATATAA CAAAAACAAT TTTAAGTGCC AAAGATTACT TACTGTCAGA CAGGTCATTT ATTTTAAATG AGGATTATAT TTTTATAGAT CCAAACACGA TGGATGTTTC CCTGGTTTAT CTTCCGCTGA ACATAGAGAA CAATGTAAAT GAGCAGTTAA AAAATTTCAC CATGAACTTT ATAATGTTCA GCGCAAATAT TGAGGAAAAC AACAGCGACA ATTTCCTTCA AAGAATTTTA AATCTTTTGA AATCAGATAC CTTTAATATT CTCGAATTTA ACAAATTGCT GAATGATTTA GAGGCGGAAT CCGGCATGCC AAAACCGGTC GTACAAAATC CTGCTGTTTC AGAGCAAAAT GTGCCTTTGC AGGCTTCACC GACTCCGGCA CCGCCGAAAC CAAATATTCC AGGACAGGAT ATTCCCAAAA CTTCCGTGCC AAAGCCACAG ACGCCAAAAC CGGGCCCGAT AAGACCAAAT GTTCCAAAAA CAGCTCCACA GGGACAGGTT GCTCAAAGAC CTTCGAAACC TGTGAATACA AATGAAAAAA CCGTAGTTAA AATGAAATAT AAAACCAGTG TGATAATAAT CGGTGCCGTA TTGCAGGCAG TTATTGTCAT CGGATCCGTT GCCCTTTTAG CGTCAGGAGC CATGGATTCT TTAGGAAACG AACCAATTGT AAACATATTG GGCATCGGCA TTCTGGCAGG TGCCATATCC TACGCCTTAT GGAAAACGAT TCTGAATGAA AAGAACAAGG TTGAAACAGT CACCAAAGTA GTGGAAAAAC CTGATCCAAG ACCGCAAATA AACATGGAAA AAAGAAATTT CACCACTGTT CCGAATATGC CAAACGTTCC TCCAAGAAAC AATACACCTT CGCAGCCGGA ATATAATTTT AACAACGCAG GAAACGCCCG AAATCCTTTA AATGAAACAA CAGTAATTTT TCCGACCCAT GTGGAGGAAA CCGTTTATCT GGGCACCTCA AATTCATACC CTTATCTGCA GGGAACAGTA AACGGTGTGA CTGAACAAAT AATAATCAAC AAACCCAGCT TTATAATAGG AAGACTGAAA AGCCAGGTGG ACTACATAAG CCAAAACAAT GCGGTCGGAA AGGTACACGC GGAAATAATA TCAAGAGACG GACGTTATTT TGTGAAAGAT TTGAATTCCA AAAACGGAAC ATTTGTCAAT GGTGTGAGAA TTGCCGCAAA CACCGAATAT GAAATAAAAA ACAATGACAA AATCACTTTT GCCAACAGTG AATATGTTTT TATAATTCCT TAA
|
Protein sequence | MVNDYLKKFE FNYESSATGS FLVVSTDAGE NVLLYQVEML ASNPNKNILP LDIRQKDAKY NFYYNITSKL ALSQYLKRNK LKRNDFINIF KDITKTILSA KDYLLSDRSF ILNEDYIFID PNTMDVSLVY LPLNIENNVN EQLKNFTMNF IMFSANIEEN NSDNFLQRIL NLLKSDTFNI LEFNKLLNDL EAESGMPKPV VQNPAVSEQN VPLQASPTPA PPKPNIPGQD IPKTSVPKPQ TPKPGPIRPN VPKTAPQGQV AQRPSKPVNT NEKTVVKMKY KTSVIIIGAV LQAVIVIGSV ALLASGAMDS LGNEPIVNIL GIGILAGAIS YALWKTILNE KNKVETVTKV VEKPDPRPQI NMEKRNFTTV PNMPNVPPRN NTPSQPEYNF NNAGNARNPL NETTVIFPTH VEETVYLGTS NSYPYLQGTV NGVTEQIIIN KPSFIIGRLK SQVDYISQNN AVGKVHAEII SRDGRYFVKD LNSKNGTFVN GVRIAANTEY EIKNNDKITF ANSEYVFIIP
|
| |