Gene Cthe_1349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1349 
Symbol 
ID4809489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1641943 
End bp1643346 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content37% 
IMG OID640106773 
Productundecaprenyl-phosphate galactose phosphotransferase 
Protein accessionYP_001037774 
Protein GI125973864 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000163511 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATAGTT CAAACAAACA TGCTTTTGTA AGCATAGGAG AATTGTTTTT AGATATATTT 
TCCCTCATTT TATCGTTTTT TGTTTCATAC TACATAGCAT CACATCTTAG AGTGCTGCAG
CATATAACCG CTTTTGTCTG GGTGTTGCTT TTATATATTC CCATGTGGCT TTCTTGCATG
GGATTTTTGG GCATGTACAA CAAAACAACC TTTAATTATT ATGACAGAGT GTTAAGGAAT
ATTTTGCTCT CTTCTCTCAT TGCATGCATG TTTGTTGCAT CCTTTATGTT TTTTATAAAA
GAAACCATGT TCAGCAGAAC ACTTTATGCC GTTTTTACAC TTACAAGCAT AGCGTTTTTG
ATTTTTGAAA GATTCATGTA CATATATTTT GTGAGCAAAC ACCGGAACAA AACAACAACA
AACGTGATCT TCGTCGGAGA TCGCAATATA GCATTGAAAT TCATTTATTT CCTCCAGAAA
ACAAACATTA CCATAAATGT TGTGGGTTAT GTTAACGTAC ACAAAAACGG CGGCAACGGA
ACATTCAACA GTAAAAAAAC CTTGGGATAT ATTGAGGATT TGGAAGAAAT ACTTAAAAAC
CATGTGGTCG ACGAAGTAAT TTTTGCTCTT CCGAAAGACT ATGTGGGAGA TGTTGAAAAA
TATGTGTGTA TATGTGAGGA AATGGGAATA ACCGTAAGGG TTATTCTGGA TTTATACAAT
CTCAAAGTTG CAAAAACTCA TTTCAGCTGC ATGGGTACTC TTCCTATGCT CACCTTCAAT
TCGGTAAGCA TCAACCAATT TCAGCTTATG ATTAAAAGGT TAATGGATAT CGTCGGTGCT
CTTATCGGGC TTGCCTTCAC GGCAGTTGCT TCGATATTCA TAGTACCGGC CATCAAGCTG
ACATCTCCGG GACCGGTGCT GTTTAAGCAA GACAGAGTCG GAATGAACGG AAGAATATTT
AAAATATATA AATTCAGAAC AATGTATGTT GATGCGGAAG AGCGAAAAGC GGAGCTTATG
GCTCAAAACG AAATCAAAGG CGGTTTAATG TTTAAAATCA AATCAGACCC AAGAGTTACA
CCTGTGGGCA GGATACTGAG AAAAACAAGC CTTGATGAGC TTCCCCAGTT CTTTAATGTA
CTCAAGGGAG ATATGAGCCT TGTGGGGACA AGACCTCCAA CTGTGGATGA AGTCAAAAAA
TATAAAACCT ATCACAGAAG AAGAATAAGC TTCAAGCCGG GTCTTACCGG AATGTGGCAG
GTAAGCGGAA GAAGCAACAT TACAGATTTT GAAGAAGTTG TAAGACTTGA TACAAAATAT
ATAGATGAAT GGTCAATCTG GCTTGATATA ATTATAATTT TAAAAACCAT CTGGGTAGTT
TTGAGAAAAA AAGATGCCTA CTAA
 
Protein sequence
MHSSNKHAFV SIGELFLDIF SLILSFFVSY YIASHLRVLQ HITAFVWVLL LYIPMWLSCM 
GFLGMYNKTT FNYYDRVLRN ILLSSLIACM FVASFMFFIK ETMFSRTLYA VFTLTSIAFL
IFERFMYIYF VSKHRNKTTT NVIFVGDRNI ALKFIYFLQK TNITINVVGY VNVHKNGGNG
TFNSKKTLGY IEDLEEILKN HVVDEVIFAL PKDYVGDVEK YVCICEEMGI TVRVILDLYN
LKVAKTHFSC MGTLPMLTFN SVSINQFQLM IKRLMDIVGA LIGLAFTAVA SIFIVPAIKL
TSPGPVLFKQ DRVGMNGRIF KIYKFRTMYV DAEERKAELM AQNEIKGGLM FKIKSDPRVT
PVGRILRKTS LDELPQFFNV LKGDMSLVGT RPPTVDEVKK YKTYHRRRIS FKPGLTGMWQ
VSGRSNITDF EEVVRLDTKY IDEWSIWLDI IIILKTIWVV LRKKDAY