Gene Cthe_0821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0821 
Symbol 
ID4810439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp997660 
End bp999336 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content44% 
IMG OID640106238 
Productcoagulation factor 5/8 type-like protein 
Protein accessionYP_001037249 
Protein GI125973339 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000172205 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA GGGTGTGCAT TCTTTTGGCT GTTGTCATTT TAATGACTGT CAGCCTGCCG 
TTTGTAACAC TGGGGGCGGA TGACATTTAT CCGGGACTTA GAGTTCAGGG GAGGTTTTTG
TACGACAAAT ACGGTGAAAA AATAATACTG TATGGTGTGA ACGAAATGTC GATCTGGGGC
GACATTGACG GTGATGTGGC ACTTCCTGAA ATCAGGAAAA CCGGAGCTAA TGCGGTAAGA
CTTGTATGGT CAGTTTCCGG GCCGGCAAGG AAACTTGATA TTTTGCTTTA CAATTGCCGT
ATAAACAATA TGATTCCTAT TATAGAGCTT CATGACGCAA CCGGTGAATG GCAGAAGCTT
TCCATGCTCG TTGACTATTG GACAAGACCC GATGTGCTTG AAGTATTGAA AAAACATCAG
GAATATCTCA TTATCAATAT TGGTAATGAA GTAGGAGCCC AGGTTTCGGA ATCTGAATTT
AAAACAGGGT ATGAGGCTGC CGTAAAGAGA ATGAGGGAAG CTGGAATTCA TGTTCCTCTT
ATGATTGACG GCAGCGACTG GGGAAAGAAT ATTGATATTC TTCAGGCAAC CGGACCTTAT
CTTATAAACG CCGACCCTGA CAAAAACCTC TTGTTCTCGG TACATATGTG GTGGCCGTAC
ATGTGGGGTT ACAATGAACA GAAAGTAATC GATGAAATAA AAGAATCTGT TGAAATGGGC
TTGCCCCTTG TGGTGGGAGA ATTCGGACAC CAATGGGAAG AACATGAAAT GGGTAAAATC
CCTTACAAAA CCATAATGGA GCAATGTTAC AAGAATGAAA TAGGTTACCT GGCATGGTCA
TGGGGACCGG GCAACCAGCC GCAGACTTTC CTGGATATGA CCACCGACGG AACTTTTGAT
ACATTGAGAG GATGGGGATT GGAAGTTTGC GTAACGGATC CTTACAGCAT AATGAATATT
GCGGTTCGTC CGGCTTCAAT GCTTGAAGAG CCTACAACGG AGCCTCCGGT AATAAACATT
CCCGCCGGAA GTATTGCGCA GAACAAGCCT GTATATGCAT CTTCGACGGA ACCAGGGCTG
GGTAACACAC CGGAGAAGGC TGTTGACGGA AATATAGCAA CCAGATGGTC ATCGGATTAC
AGCGATAATC AGTACATATA TGTTGACCTT CTGGATGAAT ATGAAATTGA AAGAGTGTAT
ATTGAGTGGG AGGCTGCATA CGCCCGTCAG TACAAAATCC AGGTGTCCAA TGACGCCGTC
ACATGGACCG ATGTATACAC CGAGTACAAC GGAGACGGAG ATATAGACGA TATTTACCTT
GAAGCAAGGG GAAGATATGT AAGAATCTAT TGTATGCAAA GGGCAACTCA GTACGGAAAC
TCCATATTTG AGTTGGGGGT TTATCCAAAG GGTGGAATTG CGGAACCGAC TCCTCCGGGG
GTTGAAATTC TGGTGGGAGA TATAAACAGA GACGGCAAAA TTAATTCGAC GGACTTGGGT
ATGTTGAACA GACATATATT GAAACTTGTA ATTCTTGATG ATAATTTAAA GCTTGCGGCA
GCTGACATTG ACGGAAACGG AAATATAAAT TCCACTGATT ATTCATGGCT GAAAAAATAT
ATATTAAAAG TAATTTCTGA ATTCCCGGGA GGGGATACGA GAATAGTGAC ACCATGA
 
Protein sequence
MKKRVCILLA VVILMTVSLP FVTLGADDIY PGLRVQGRFL YDKYGEKIIL YGVNEMSIWG 
DIDGDVALPE IRKTGANAVR LVWSVSGPAR KLDILLYNCR INNMIPIIEL HDATGEWQKL
SMLVDYWTRP DVLEVLKKHQ EYLIINIGNE VGAQVSESEF KTGYEAAVKR MREAGIHVPL
MIDGSDWGKN IDILQATGPY LINADPDKNL LFSVHMWWPY MWGYNEQKVI DEIKESVEMG
LPLVVGEFGH QWEEHEMGKI PYKTIMEQCY KNEIGYLAWS WGPGNQPQTF LDMTTDGTFD
TLRGWGLEVC VTDPYSIMNI AVRPASMLEE PTTEPPVINI PAGSIAQNKP VYASSTEPGL
GNTPEKAVDG NIATRWSSDY SDNQYIYVDL LDEYEIERVY IEWEAAYARQ YKIQVSNDAV
TWTDVYTEYN GDGDIDDIYL EARGRYVRIY CMQRATQYGN SIFELGVYPK GGIAEPTPPG
VEILVGDINR DGKINSTDLG MLNRHILKLV ILDDNLKLAA ADIDGNGNIN STDYSWLKKY
ILKVISEFPG GDTRIVTP