Gene Cthe_1832 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1832 
Symbol 
ID4809816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2172278 
End bp2173807 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content33% 
IMG OID640107246 
Producthypothetical protein 
Protein accessionYP_001038246 
Protein GI125974336 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA TATTATGTTG TATCATATCG TTATTTGTTT TAGCATCGTA TTTGAGTACA 
TATACATATG CAATAAGCTA TAATAGTGCA AAAGAAGCAA TAGATGATGC TAATAATTTT
TTATTAGAAA AAATGGGCTA TGAGAATTAC TATTCATTAG AAGTAAATGG TATGAATATA
AATGATAAAC TTGCACAATA TGGTTTGGAT GTGTTTTCAA ACAGGCCAGT ATTTGTATAC
GGTGATAATG TAGAAGCTAG TAAAAAGACA ACGACAGCAG GGCGAGATAT GGTTAAAAAA
GTTAATGGTA AGGACGAATA CCGTGCTTTA GGGTATGCAG TGGATGGTTC TGTTTTTCCA
AATCCCTCTT TTCCATATGA TAATGAGGGT CATGCTGCAA AAGATAAGAT GTGGGTTAAA
GAACCATGGA ATGGTTCAAA AGTAAAATAT CTATATAGCG AGAATGGAAA TATTGTAAAG
AGAACGTTAA CGGATAATGC TTTTCAGTAT ATAGAAAAAT GGATCAAGTT TACCAGTTTT
AAACCTCATG AAGTTGAAGC TTGTACAGGT AAGAAAAACT ATTTTGTACA AAATGCAGTT
GATGTACCGG AAGGATTAAA GGAAAATTTT GAAGACTTTT TATATATAAT ACAACCTCCA
ACAGAACATG CGTGGGGACT CGGTATAGCA TTTTACTACT GGAATGGATT TAATAATCTC
AACTATAGAT CTTTTCTCAT TAGGCCGTTT GATATGAATG ATGATTTGGA TGTTAGTTTC
CATGTAATAC CAGATAGTTC AACCGAAGGC AACGAAGTAT TGGTTGGTGT AAAAGTTAAA
TCTCACTTCG ATACAGACTT AGAAGGAGTT AAATTTAGGT GGAGTATTAC TACAAAAAAC
AGCGATGGTC AAGATGTTCC GTTGGATGCT GATGCTTATG AACTTGAATT TGGAGGTTCG
TCAACCAGTC AGAGCGGGAC TATAAATATA TCAGCAGAAG ACAAGGAAGC ATGTTTATAT
GCTGGGTTTA GAATGCCCAA TACTGATGTA TATATAGAAT TTGCAATTAA TGAAGACGGA
GAAAATCCTT TAGAAAATGA TTTGAAAAAT AATATTGTTT CTACAGTGGT TAAAGCCGAA
AAGCCTATAA ATTCTACTCT TAGGAAATTT GATTTACCAT ACTATGCATT ATCGAGAGAA
ATAAGCTATC CATTAGCTGA TTCAGATATT GTATTTAATC TAAACAATAT TAATGGTGAT
TGGCTGGATG GAAGTGCCAG AATAGATAAA TTGAATGTTA ATGTAAATGC AGGATTTTTG
CATAATTATC AAGTTGGAAG TTCGAGGATT GAAGATAACG AAAATACAAT AACTGTAAGT
TTGCCAAGCG TGAAAGCAAA GGTCGAAAGG AAAGATTTTG GAGATAATCC TGGAGAGAAG
AAGTGGTTGG TCAGCAACAA CACTGTTGAT GTAATAAAAA GAATTCTAGA TACATCTTAT
TATCTTTCTG TATCTAAAAA ATATAGATAA
 
Protein sequence
MKKILCCIIS LFVLASYLST YTYAISYNSA KEAIDDANNF LLEKMGYENY YSLEVNGMNI 
NDKLAQYGLD VFSNRPVFVY GDNVEASKKT TTAGRDMVKK VNGKDEYRAL GYAVDGSVFP
NPSFPYDNEG HAAKDKMWVK EPWNGSKVKY LYSENGNIVK RTLTDNAFQY IEKWIKFTSF
KPHEVEACTG KKNYFVQNAV DVPEGLKENF EDFLYIIQPP TEHAWGLGIA FYYWNGFNNL
NYRSFLIRPF DMNDDLDVSF HVIPDSSTEG NEVLVGVKVK SHFDTDLEGV KFRWSITTKN
SDGQDVPLDA DAYELEFGGS STSQSGTINI SAEDKEACLY AGFRMPNTDV YIEFAINEDG
ENPLENDLKN NIVSTVVKAE KPINSTLRKF DLPYYALSRE ISYPLADSDI VFNLNNINGD
WLDGSARIDK LNVNVNAGFL HNYQVGSSRI EDNENTITVS LPSVKAKVER KDFGDNPGEK
KWLVSNNTVD VIKRILDTSY YLSVSKKYR