Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1349 |
Symbol | |
ID | 4809489 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1641943 |
End bp | 1643346 |
Gene Length | 1404 bp |
Protein Length | 467 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640106773 |
Product | undecaprenyl-phosphate galactose phosphotransferase |
Protein accession | YP_001037774 |
Protein GI | 125973864 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2148] Sugar transferases involved in lipopolysaccharide synthesis |
TIGRFAM ID | [TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000163511 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATAGTT CAAACAAACA TGCTTTTGTA AGCATAGGAG AATTGTTTTT AGATATATTT TCCCTCATTT TATCGTTTTT TGTTTCATAC TACATAGCAT CACATCTTAG AGTGCTGCAG CATATAACCG CTTTTGTCTG GGTGTTGCTT TTATATATTC CCATGTGGCT TTCTTGCATG GGATTTTTGG GCATGTACAA CAAAACAACC TTTAATTATT ATGACAGAGT GTTAAGGAAT ATTTTGCTCT CTTCTCTCAT TGCATGCATG TTTGTTGCAT CCTTTATGTT TTTTATAAAA GAAACCATGT TCAGCAGAAC ACTTTATGCC GTTTTTACAC TTACAAGCAT AGCGTTTTTG ATTTTTGAAA GATTCATGTA CATATATTTT GTGAGCAAAC ACCGGAACAA AACAACAACA AACGTGATCT TCGTCGGAGA TCGCAATATA GCATTGAAAT TCATTTATTT CCTCCAGAAA ACAAACATTA CCATAAATGT TGTGGGTTAT GTTAACGTAC ACAAAAACGG CGGCAACGGA ACATTCAACA GTAAAAAAAC CTTGGGATAT ATTGAGGATT TGGAAGAAAT ACTTAAAAAC CATGTGGTCG ACGAAGTAAT TTTTGCTCTT CCGAAAGACT ATGTGGGAGA TGTTGAAAAA TATGTGTGTA TATGTGAGGA AATGGGAATA ACCGTAAGGG TTATTCTGGA TTTATACAAT CTCAAAGTTG CAAAAACTCA TTTCAGCTGC ATGGGTACTC TTCCTATGCT CACCTTCAAT TCGGTAAGCA TCAACCAATT TCAGCTTATG ATTAAAAGGT TAATGGATAT CGTCGGTGCT CTTATCGGGC TTGCCTTCAC GGCAGTTGCT TCGATATTCA TAGTACCGGC CATCAAGCTG ACATCTCCGG GACCGGTGCT GTTTAAGCAA GACAGAGTCG GAATGAACGG AAGAATATTT AAAATATATA AATTCAGAAC AATGTATGTT GATGCGGAAG AGCGAAAAGC GGAGCTTATG GCTCAAAACG AAATCAAAGG CGGTTTAATG TTTAAAATCA AATCAGACCC AAGAGTTACA CCTGTGGGCA GGATACTGAG AAAAACAAGC CTTGATGAGC TTCCCCAGTT CTTTAATGTA CTCAAGGGAG ATATGAGCCT TGTGGGGACA AGACCTCCAA CTGTGGATGA AGTCAAAAAA TATAAAACCT ATCACAGAAG AAGAATAAGC TTCAAGCCGG GTCTTACCGG AATGTGGCAG GTAAGCGGAA GAAGCAACAT TACAGATTTT GAAGAAGTTG TAAGACTTGA TACAAAATAT ATAGATGAAT GGTCAATCTG GCTTGATATA ATTATAATTT TAAAAACCAT CTGGGTAGTT TTGAGAAAAA AAGATGCCTA CTAA
|
Protein sequence | MHSSNKHAFV SIGELFLDIF SLILSFFVSY YIASHLRVLQ HITAFVWVLL LYIPMWLSCM GFLGMYNKTT FNYYDRVLRN ILLSSLIACM FVASFMFFIK ETMFSRTLYA VFTLTSIAFL IFERFMYIYF VSKHRNKTTT NVIFVGDRNI ALKFIYFLQK TNITINVVGY VNVHKNGGNG TFNSKKTLGY IEDLEEILKN HVVDEVIFAL PKDYVGDVEK YVCICEEMGI TVRVILDLYN LKVAKTHFSC MGTLPMLTFN SVSINQFQLM IKRLMDIVGA LIGLAFTAVA SIFIVPAIKL TSPGPVLFKQ DRVGMNGRIF KIYKFRTMYV DAEERKAELM AQNEIKGGLM FKIKSDPRVT PVGRILRKTS LDELPQFFNV LKGDMSLVGT RPPTVDEVKK YKTYHRRRIS FKPGLTGMWQ VSGRSNITDF EEVVRLDTKY IDEWSIWLDI IIILKTIWVV LRKKDAY
|
| |