Gene Cthe_1172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1172 
Symbol 
ID4810124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1399482 
End bp1400630 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content42% 
IMG OID640106594 
Producthypothetical protein 
Protein accessionYP_001037597 
Protein GI125973687 
COG category[S] Function unknown 
COG ID[COG2006] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAAAG TGGCTCTAAT CAGATGTGAA AGTTATGACT ATGATGCCGT CAAATCAGCC 
GTAAAAAGGG GGCTTGACCT TATTGGAGGC CCTCACCGGT TTGCCGCTCC CAATGAAAAA
ATACTCTTAA AACCCAATCT TCTTTCGGCA GACCCGCCGG AAAGATGCAG CACAACGCAC
CCTTCCGTAT TTAAAGCCGT GGCGGAAATA TTCATGGAGG CAGGAATAAC CAATCTTTCC
TACGGCGACT CCCCCGGCAT TCACAAGCCC ATAACCGCGG CGAGAAAAAA CGGAATTGAA
AAAGCTGCAA ATGAGCTTGG AATCAAACTT GCCGATTTCC TGGAAGGAAA GGAAGTGTTC
TTTGAAAACG CAATCCAAAA TAAAAAGTTT ATAATAGCAA ACGGTGTTCT TGAAAGCGAC
GGCATTATCA GTCTGCCCAA GCTAAAAACC CATGGATTTG CCAGAATGAC GGGTTGTGTG
AAAAACCAGT TTGGATGCAT ACCCGGACCT CTGAAAGGAG AATTTCACGT TCGGATTCCC
AGTATAATCG ATTTTTCAAA AATGCTGGTG GATCTTAACG TTTATTTAAA ACCTCGTCTT
TTTGTCATGG ACGGTATCAT AGCAATGGAA GGCAACGGAC CCAGAGGCGG GACTCCCAGA
AAAATAAATG CGATACTTCT TTCCGAAGAT CCAATTGCCC TGGATGCCAC TGTATGCAGA
ATGATAAATT TAAACCCCGA GTTTGTGCCT ACCATAGTAT TTGGAAAAGA AGCCGGCCTT
GGAACTTATG ACGAAAATGA AATTGAAATT CTCGGAGATG ATATTCAAAG CTTCATAACT
TATGACTTTG ATGTAAGAAG AGAACCTGTA AAGCCCTTCA AGCCCGGCGG AGCCATCCAG
TTTTTCAGAA ATTTCATCGT TCCAAAGCCC TACATTTTAA AAAACAAATG TATTAAATGC
GGAGTATGTG TAAATGCGTG TCCGGTAAAA CCAAAAGCAG TAGATTGGCA CAACGGAAAT
AAAAAAGAAC CTCCTACATA TATTTACAAA AGATGTATAA GATGTTACTG CTGTCAGGAA
CTTTGTCCGG AAAGCGCAAT CCACCTCAAG GTTCCTTTTA TTCGCAAATT TTTTTATAAT
CCGAAATAA
 
Protein sequence
MSKVALIRCE SYDYDAVKSA VKRGLDLIGG PHRFAAPNEK ILLKPNLLSA DPPERCSTTH 
PSVFKAVAEI FMEAGITNLS YGDSPGIHKP ITAARKNGIE KAANELGIKL ADFLEGKEVF
FENAIQNKKF IIANGVLESD GIISLPKLKT HGFARMTGCV KNQFGCIPGP LKGEFHVRIP
SIIDFSKMLV DLNVYLKPRL FVMDGIIAME GNGPRGGTPR KINAILLSED PIALDATVCR
MINLNPEFVP TIVFGKEAGL GTYDENEIEI LGDDIQSFIT YDFDVRREPV KPFKPGGAIQ
FFRNFIVPKP YILKNKCIKC GVCVNACPVK PKAVDWHNGN KKEPPTYIYK RCIRCYCCQE
LCPESAIHLK VPFIRKFFYN PK