Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2650 |
Symbol | |
ID | 4808961 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 3131748 |
End bp | 3133571 |
Gene Length | 1824 bp |
Protein Length | 607 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640108063 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_001039042 |
Protein GI | 125975132 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAATT TGAACAACAG AATATGTCTG GCCGCCGATT GCTTGTTGGT AAACGTGGCT TTGTTTGCCG CACTTATGAT AAAGTTTGAT GCAAACATCC CGGAAAATTT GCTTAAAATA ATACCTTTTA ATTTTCTTAT TACAACTACG GTAAGCATAT TAGTATTTAA TATATTCGGC ATATATTCAA TGCTATGGAA CTATGCAAGT GTTGAAGAGC TTTTGAAAAT ATTTTGGGCA ACAATGACAT CGGTAATTGT CCAACTTCTT TTGGCTGCAT ATTTTAATGT GATGTTCCCT GTTTCGGTTT ACATTACATG CTGGATGATT ACATTTTATT TTATAGGCGG AACAAGATTC ATCGTCAGAA TAACCAGGAG TATAAAAGCA AAAAAGAGAA AAAACACCCA CAAAAAAAGA ATAATGATTA TCGGCGCCGG TGATACCGCT TCACTGTTGA TTAAGGAAAC AAAAAACAAT AAGCGCAGCA TATATGAGCC GGTAGTTGCA ATTGACGACG ACCCTAAAAA GCACAACACC CAAATAAACG GTGTTCCCAT CATAGGAGGC AGGGATAAAA TCATAGAAGC CGCAAAGGAT ATGTCTATAG AAGAAATAGT TTTAGCCATG CCGTCGGTTT CAAAAAAAGA AACGCTGGAA ATTATAGACT TGTGCCAGAA GACAGGCTGC AAATTAAAAG TTTTGCCAAG CGTGTACGGC ATTGTAAACG GAGAAGTAAG CATCAAGGAA ATAAGAGACG TCACTGTCGA GGACCTTTTG CCAAGAGATG AAATTTCCCT TTCCATAGAA GAGATATCCG GATATCTTAA AGGTGAAACC ATTCTGGTTA CCGGGGGAGG AGGTTCCATT GGCTCCGAGC TTTGCAGGCA AATAGCATCT TATGAGCCTA AAACCCTCCT GGTGTTTGAT ATATACGAAA ATAATGCTTA CGAGCTTCAA AATGAACTTG TGCAAAAATA TAAAGGCAAT CTGGATATCA AAGTGATAAT AGGATCAATA CGGGATAAAA AAAGACTGGA TTATGTGTTT TCCCAATACA AGCCGGGCAT AGTGTTTCAT GCAGCGGCTC ACAAGCATGT CCCGCTTATG GAGTTCAATC CCCAGGAAGC CGTCAAGAAC AATGTATTCG GCACATTAAA TGTTGCCGAA TGCGCCCATC AATACAACTG TAAAAAATTT GTGCTAATCT CGACCGACAA AGCGGTAAAC CCAACAAGCA TAATGGGAGC AACAAAAAGA ATTGCCGAGC TTATAATACA ATATATGAAC AGCATAAGCA AAACAAAATT TTGCGCCGTA AGATTCGGCA ACGTACTGGA CAGCAACGGA AGCGTCATCC CCCTTTTTAA AAAGCAGATT GAACAGGGAG GCCCCATAAC AATAACCCAT CCCGAAGTAT CCCGGTATTT TATGACCATT CCCGAGGCGG TAAGCCTTGT CATTCAATCC GGAGCCATGA TGGAAGGGGG AGAAATATTT ATACTTGACA TGGGCAAACC GGTTAAAATT ACCGACCTTG CCAGAACGTT AATTTCTTTG TCGGGGTTAA AGCCCGGTGT TGACATAGAT ATTGAATATA TTGGATTAAG ACCAGGCGAA AAGCTCCATG AAGAGCTGCT TATTTCAGAG GAAGGAGTCA GCGTAACAAA GAATGACAAA ATATTTATTG AAAAGAGCAG AATGATTGAC TTTGACAAGT ACATGCAGAG AATAAAAGAG TTTGAGCTTG ATAATTTGGA TGACAGCGAA AAAGTGATAA ATTTTATCAA AGAATTAGTT CCCACCTATA AGAAAAATTT GTAA
|
Protein sequence | MKNLNNRICL AADCLLVNVA LFAALMIKFD ANIPENLLKI IPFNFLITTT VSILVFNIFG IYSMLWNYAS VEELLKIFWA TMTSVIVQLL LAAYFNVMFP VSVYITCWMI TFYFIGGTRF IVRITRSIKA KKRKNTHKKR IMIIGAGDTA SLLIKETKNN KRSIYEPVVA IDDDPKKHNT QINGVPIIGG RDKIIEAAKD MSIEEIVLAM PSVSKKETLE IIDLCQKTGC KLKVLPSVYG IVNGEVSIKE IRDVTVEDLL PRDEISLSIE EISGYLKGET ILVTGGGGSI GSELCRQIAS YEPKTLLVFD IYENNAYELQ NELVQKYKGN LDIKVIIGSI RDKKRLDYVF SQYKPGIVFH AAAHKHVPLM EFNPQEAVKN NVFGTLNVAE CAHQYNCKKF VLISTDKAVN PTSIMGATKR IAELIIQYMN SISKTKFCAV RFGNVLDSNG SVIPLFKKQI EQGGPITITH PEVSRYFMTI PEAVSLVIQS GAMMEGGEIF ILDMGKPVKI TDLARTLISL SGLKPGVDID IEYIGLRPGE KLHEELLISE EGVSVTKNDK IFIEKSRMID FDKYMQRIKE FELDNLDDSE KVINFIKELV PTYKKNL
|
| |