Gene Cthe_1334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1334 
Symbol 
ID4809474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1624461 
End bp1626023 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content38% 
IMG OID640106758 
ProductFHA domain-containing protein 
Protein accessionYP_001037759 
Protein GI125973849 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCAATG ATTACCTGAA AAAATTTGAA TTTAATTATG AAAGCAGTGC CACAGGCAGC 
TTTCTTGTTG TGAGCACGGA TGCCGGTGAA AATGTGCTTT TATACCAGGT TGAAATGCTC
GCCAGCAATC CAAACAAAAA TATCTTGCCC CTGGACATAA GACAAAAGGA CGCAAAGTAC
AATTTTTACT ACAATATTAC TTCCAAACTG GCTTTATCCC AATATCTCAA ACGCAACAAG
CTAAAAAGGA ATGATTTTAT AAACATATTC AAAGATATAA CAAAAACAAT TTTAAGTGCC
AAAGATTACT TACTGTCAGA CAGGTCATTT ATTTTAAATG AGGATTATAT TTTTATAGAT
CCAAACACGA TGGATGTTTC CCTGGTTTAT CTTCCGCTGA ACATAGAGAA CAATGTAAAT
GAGCAGTTAA AAAATTTCAC CATGAACTTT ATAATGTTCA GCGCAAATAT TGAGGAAAAC
AACAGCGACA ATTTCCTTCA AAGAATTTTA AATCTTTTGA AATCAGATAC CTTTAATATT
CTCGAATTTA ACAAATTGCT GAATGATTTA GAGGCGGAAT CCGGCATGCC AAAACCGGTC
GTACAAAATC CTGCTGTTTC AGAGCAAAAT GTGCCTTTGC AGGCTTCACC GACTCCGGCA
CCGCCGAAAC CAAATATTCC AGGACAGGAT ATTCCCAAAA CTTCCGTGCC AAAGCCACAG
ACGCCAAAAC CGGGCCCGAT AAGACCAAAT GTTCCAAAAA CAGCTCCACA GGGACAGGTT
GCTCAAAGAC CTTCGAAACC TGTGAATACA AATGAAAAAA CCGTAGTTAA AATGAAATAT
AAAACCAGTG TGATAATAAT CGGTGCCGTA TTGCAGGCAG TTATTGTCAT CGGATCCGTT
GCCCTTTTAG CGTCAGGAGC CATGGATTCT TTAGGAAACG AACCAATTGT AAACATATTG
GGCATCGGCA TTCTGGCAGG TGCCATATCC TACGCCTTAT GGAAAACGAT TCTGAATGAA
AAGAACAAGG TTGAAACAGT CACCAAAGTA GTGGAAAAAC CTGATCCAAG ACCGCAAATA
AACATGGAAA AAAGAAATTT CACCACTGTT CCGAATATGC CAAACGTTCC TCCAAGAAAC
AATACACCTT CGCAGCCGGA ATATAATTTT AACAACGCAG GAAACGCCCG AAATCCTTTA
AATGAAACAA CAGTAATTTT TCCGACCCAT GTGGAGGAAA CCGTTTATCT GGGCACCTCA
AATTCATACC CTTATCTGCA GGGAACAGTA AACGGTGTGA CTGAACAAAT AATAATCAAC
AAACCCAGCT TTATAATAGG AAGACTGAAA AGCCAGGTGG ACTACATAAG CCAAAACAAT
GCGGTCGGAA AGGTACACGC GGAAATAATA TCAAGAGACG GACGTTATTT TGTGAAAGAT
TTGAATTCCA AAAACGGAAC ATTTGTCAAT GGTGTGAGAA TTGCCGCAAA CACCGAATAT
GAAATAAAAA ACAATGACAA AATCACTTTT GCCAACAGTG AATATGTTTT TATAATTCCT
TAA
 
Protein sequence
MVNDYLKKFE FNYESSATGS FLVVSTDAGE NVLLYQVEML ASNPNKNILP LDIRQKDAKY 
NFYYNITSKL ALSQYLKRNK LKRNDFINIF KDITKTILSA KDYLLSDRSF ILNEDYIFID
PNTMDVSLVY LPLNIENNVN EQLKNFTMNF IMFSANIEEN NSDNFLQRIL NLLKSDTFNI
LEFNKLLNDL EAESGMPKPV VQNPAVSEQN VPLQASPTPA PPKPNIPGQD IPKTSVPKPQ
TPKPGPIRPN VPKTAPQGQV AQRPSKPVNT NEKTVVKMKY KTSVIIIGAV LQAVIVIGSV
ALLASGAMDS LGNEPIVNIL GIGILAGAIS YALWKTILNE KNKVETVTKV VEKPDPRPQI
NMEKRNFTTV PNMPNVPPRN NTPSQPEYNF NNAGNARNPL NETTVIFPTH VEETVYLGTS
NSYPYLQGTV NGVTEQIIIN KPSFIIGRLK SQVDYISQNN AVGKVHAEII SRDGRYFVKD
LNSKNGTFVN GVRIAANTEY EIKNNDKITF ANSEYVFIIP