Gene Cthe_2650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2650 
Symbol 
ID4808961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3131748 
End bp3133571 
Gene Length1824 bp 
Protein Length607 aa 
Translation table11 
GC content39% 
IMG OID640108063 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_001039042 
Protein GI125975132 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAATT TGAACAACAG AATATGTCTG GCCGCCGATT GCTTGTTGGT AAACGTGGCT 
TTGTTTGCCG CACTTATGAT AAAGTTTGAT GCAAACATCC CGGAAAATTT GCTTAAAATA
ATACCTTTTA ATTTTCTTAT TACAACTACG GTAAGCATAT TAGTATTTAA TATATTCGGC
ATATATTCAA TGCTATGGAA CTATGCAAGT GTTGAAGAGC TTTTGAAAAT ATTTTGGGCA
ACAATGACAT CGGTAATTGT CCAACTTCTT TTGGCTGCAT ATTTTAATGT GATGTTCCCT
GTTTCGGTTT ACATTACATG CTGGATGATT ACATTTTATT TTATAGGCGG AACAAGATTC
ATCGTCAGAA TAACCAGGAG TATAAAAGCA AAAAAGAGAA AAAACACCCA CAAAAAAAGA
ATAATGATTA TCGGCGCCGG TGATACCGCT TCACTGTTGA TTAAGGAAAC AAAAAACAAT
AAGCGCAGCA TATATGAGCC GGTAGTTGCA ATTGACGACG ACCCTAAAAA GCACAACACC
CAAATAAACG GTGTTCCCAT CATAGGAGGC AGGGATAAAA TCATAGAAGC CGCAAAGGAT
ATGTCTATAG AAGAAATAGT TTTAGCCATG CCGTCGGTTT CAAAAAAAGA AACGCTGGAA
ATTATAGACT TGTGCCAGAA GACAGGCTGC AAATTAAAAG TTTTGCCAAG CGTGTACGGC
ATTGTAAACG GAGAAGTAAG CATCAAGGAA ATAAGAGACG TCACTGTCGA GGACCTTTTG
CCAAGAGATG AAATTTCCCT TTCCATAGAA GAGATATCCG GATATCTTAA AGGTGAAACC
ATTCTGGTTA CCGGGGGAGG AGGTTCCATT GGCTCCGAGC TTTGCAGGCA AATAGCATCT
TATGAGCCTA AAACCCTCCT GGTGTTTGAT ATATACGAAA ATAATGCTTA CGAGCTTCAA
AATGAACTTG TGCAAAAATA TAAAGGCAAT CTGGATATCA AAGTGATAAT AGGATCAATA
CGGGATAAAA AAAGACTGGA TTATGTGTTT TCCCAATACA AGCCGGGCAT AGTGTTTCAT
GCAGCGGCTC ACAAGCATGT CCCGCTTATG GAGTTCAATC CCCAGGAAGC CGTCAAGAAC
AATGTATTCG GCACATTAAA TGTTGCCGAA TGCGCCCATC AATACAACTG TAAAAAATTT
GTGCTAATCT CGACCGACAA AGCGGTAAAC CCAACAAGCA TAATGGGAGC AACAAAAAGA
ATTGCCGAGC TTATAATACA ATATATGAAC AGCATAAGCA AAACAAAATT TTGCGCCGTA
AGATTCGGCA ACGTACTGGA CAGCAACGGA AGCGTCATCC CCCTTTTTAA AAAGCAGATT
GAACAGGGAG GCCCCATAAC AATAACCCAT CCCGAAGTAT CCCGGTATTT TATGACCATT
CCCGAGGCGG TAAGCCTTGT CATTCAATCC GGAGCCATGA TGGAAGGGGG AGAAATATTT
ATACTTGACA TGGGCAAACC GGTTAAAATT ACCGACCTTG CCAGAACGTT AATTTCTTTG
TCGGGGTTAA AGCCCGGTGT TGACATAGAT ATTGAATATA TTGGATTAAG ACCAGGCGAA
AAGCTCCATG AAGAGCTGCT TATTTCAGAG GAAGGAGTCA GCGTAACAAA GAATGACAAA
ATATTTATTG AAAAGAGCAG AATGATTGAC TTTGACAAGT ACATGCAGAG AATAAAAGAG
TTTGAGCTTG ATAATTTGGA TGACAGCGAA AAAGTGATAA ATTTTATCAA AGAATTAGTT
CCCACCTATA AGAAAAATTT GTAA
 
Protein sequence
MKNLNNRICL AADCLLVNVA LFAALMIKFD ANIPENLLKI IPFNFLITTT VSILVFNIFG 
IYSMLWNYAS VEELLKIFWA TMTSVIVQLL LAAYFNVMFP VSVYITCWMI TFYFIGGTRF
IVRITRSIKA KKRKNTHKKR IMIIGAGDTA SLLIKETKNN KRSIYEPVVA IDDDPKKHNT
QINGVPIIGG RDKIIEAAKD MSIEEIVLAM PSVSKKETLE IIDLCQKTGC KLKVLPSVYG
IVNGEVSIKE IRDVTVEDLL PRDEISLSIE EISGYLKGET ILVTGGGGSI GSELCRQIAS
YEPKTLLVFD IYENNAYELQ NELVQKYKGN LDIKVIIGSI RDKKRLDYVF SQYKPGIVFH
AAAHKHVPLM EFNPQEAVKN NVFGTLNVAE CAHQYNCKKF VLISTDKAVN PTSIMGATKR
IAELIIQYMN SISKTKFCAV RFGNVLDSNG SVIPLFKKQI EQGGPITITH PEVSRYFMTI
PEAVSLVIQS GAMMEGGEIF ILDMGKPVKI TDLARTLISL SGLKPGVDID IEYIGLRPGE
KLHEELLISE EGVSVTKNDK IFIEKSRMID FDKYMQRIKE FELDNLDDSE KVINFIKELV
PTYKKNL