Gene Cthe_3114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3114 
Symbol 
ID4809677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3674971 
End bp3676146 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content41% 
IMG OID640108547 
Productglycosyl transferase, group 1 
Protein accessionYP_001039502 
Protein GI125975592 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTCGA ACAGAATTCG GAATGTAGCG TTTCTGAGTA CTTATCCGCC AAGGGAGTGC 
GGACTTGCAA CTTTTACCGA TGATTTGGTA AGGGAGCTGG ACAAGGTTGA ACTTATAAAC
AACCCAAAAG TTATTGCTGT AAGTGATAAT GATTACAGTT ACGGCAGCAG GGTGATTATG
GAGCTTAAGC AGCACGAAAG GGAAAGCTAT ACAAAGATTG CTGAAGAGAT TAACAACTCG
GATATTGAAC TTCTTGTTAT AGAGCATGAG TACGGTATAT TCGGAGGAGA AGACGGGGAA
TATATTCTGG ATCTTGCGGA AAAAATTCAA ATTCCCTTTA TTCTCACGGT GCATACCGTA
CTTCCCAGTC CCAAGGAAAA ACAAAAGAAA ATACTTGAAG TGCTGGGAGA AAAGAGCGCA
AGGGTAGTTA CCATGGCTAA AAATACGATA CCTATACTTG AAAAAGTATA TGGTATTGAC
CCGGCAAAGA TTGAAGTAAT ACACCATGGT GTACCGTATA AAATTCTTGA ACCCAGAGAA
AAGCTAAAGA AAAAATTCGG GCTTGAAAAC CGCACTGTAA TAAGTACTTT TGGGCTGATA
AGTCCGGGCA AAGGTTTGGA ATACGGAATT GAAGCTGTTG CAAAGCTGGC AAAGAAGTAC
AAAGATATTG TTTACCTGAT TCTTGGACAG ACACATCCTT GTGTAAAAAG GGAGTTTGGC
GAGGTTTACA GGGAAAAACT TGTGCAAATG GTTGAAGAAC TTGGTGTAAA AGAGCATGTA
TGGTTTGTAG ACAAATATCT TACCAGGGAT GAAATTATGA ACTATTTGCA GCTATCGGAT
ATCTACATGA CGCCGTATCT CGGAAAAGAC CAGGCGGTAA GCGGTACTTT GGCTTATGCG
GTAGGATACG GCAGAGTAAT TATATCTACT CCGTACAGCT ATGCCAAGGA AATGCTCGCA
GAGGGAAGAG GACTTTTGGC AGAGTTTGAG GATGCAGATT CTTTGGCAAA ACATATTGAA
TATGTTCTGG ACAATCCCGA GGCAAAGAAA GAGATGGAGA GGCGAACATT AAGTCTTGGA
AGAACCATGA TGTGGGAAAA TGTGGCAAGT TGCTATTCCA GGCTTTTTAT CGACACTCTT
GAAGAAACAA AGCTCTCGGG GAGTATGATA GGATGA
 
Protein sequence
MASNRIRNVA FLSTYPPREC GLATFTDDLV RELDKVELIN NPKVIAVSDN DYSYGSRVIM 
ELKQHERESY TKIAEEINNS DIELLVIEHE YGIFGGEDGE YILDLAEKIQ IPFILTVHTV
LPSPKEKQKK ILEVLGEKSA RVVTMAKNTI PILEKVYGID PAKIEVIHHG VPYKILEPRE
KLKKKFGLEN RTVISTFGLI SPGKGLEYGI EAVAKLAKKY KDIVYLILGQ THPCVKREFG
EVYREKLVQM VEELGVKEHV WFVDKYLTRD EIMNYLQLSD IYMTPYLGKD QAVSGTLAYA
VGYGRVIIST PYSYAKEMLA EGRGLLAEFE DADSLAKHIE YVLDNPEAKK EMERRTLSLG
RTMMWENVAS CYSRLFIDTL EETKLSGSMI G