Gene Cthe_2414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2414 
Symbol 
ID4808129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2882849 
End bp2884000 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content40% 
IMG OID640107827 
Productmonogalactosyldiacylglycerol synthase 
Protein accessionYP_001038809 
Protein GI125974899 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0707] UDP-N-acetylglucosamine:LPS N-acetylglucosamine transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.798325 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGTTC TGTTTTTGTC AATTTCACTG GGGTCGGGGC ACATCAGGGC GGCCGAAGCT 
TTGCAAAAGT TCGTCGTGCA AAAGTACCCC AAATCCAAAA CTTTGATAGT GGATACTTTC
AAATACATAA ATCCTTTGAT TCACACCGTC GTCGTAGACG GATATCTTAA TATTGTCAAA
TATGTACCCG AAATTTACGG TGGGCTTTAC AGAATGTCCG AACATATAAA AAACATAGAC
AGAATGAGCA GGGGTTTTAG CAATCTTTTG ACTCCCAAAA TTCATAGACT GATACAAAGC
TTCAAACCCT CAATTATAGT GTGCACTCAT CCTTTTCCCC TGCAAATGAT TGCACACCTG
AAAAAACATT ACAACCTCGA TGTGCCGTCA ATAGCAATTG TAACTGATTT TGTAAATCAC
CCTTTCTGGT TTCAAAATAA TATTGAGGCC TATATTGTTG CCCATGACTA TATAAAAAGA
GACATGATTG AATGCGGGAT TTCCGAAGAC CGAATATTTA CTTATGGACT TCCCGTGGCA
CCGGAGTTTT TGAAGAGCAT ACCCAAGGAA CAGGCAAGAA AAGAACTGTC GCTGGAAAAC
ACCCTGACTG TGCTTTTAAT GGGTGGAAGC CTCGGAATTG GTGATATTGA AAATACCTTC
AAATCCTTTG CAAAATGCAA AAGGGACATC CAAATTATTG CCGTTGCGGG CAAGAACACA
GCCTTAAAAA AAAGGCTTGA CAGTCTGGCG GCTTCTTTTC CCATGCCGGT CAAAATTTTC
GGTTATACAG ACAGTATTCC CATGCTCATG GACGCATCAG ACTTTATTGT CACCAAACCG
GGAGCAATGA CAATATCCGA AGCTTTGGTA AAAAGGCTTC CCGCACTTAT AATATCTCCA
ATTCCCGGTC AGGAGGAAAG AAATGAACAG TTCCTTGTAA ACAGCGGCAC CGCGGTACGG
ATATATAAAA ATACAAAAAT CGACAGTGTT TTGTGCCAGG TCTATGACAA CAAACTAAGG
TACAAACAAA TGAAAGAAAT AGCCGGAAAT CTCGCCAATC CCGATTCCGG CCGCAATATT
CTAAGTCTCA TTGAAAAACT GGTAAATGAC AACGAAAAAG GGTTATTCAA ATATTCTTTT
AATGCCTTTT AA
 
Protein sequence
MNVLFLSISL GSGHIRAAEA LQKFVVQKYP KSKTLIVDTF KYINPLIHTV VVDGYLNIVK 
YVPEIYGGLY RMSEHIKNID RMSRGFSNLL TPKIHRLIQS FKPSIIVCTH PFPLQMIAHL
KKHYNLDVPS IAIVTDFVNH PFWFQNNIEA YIVAHDYIKR DMIECGISED RIFTYGLPVA
PEFLKSIPKE QARKELSLEN TLTVLLMGGS LGIGDIENTF KSFAKCKRDI QIIAVAGKNT
ALKKRLDSLA ASFPMPVKIF GYTDSIPMLM DASDFIVTKP GAMTISEALV KRLPALIISP
IPGQEERNEQ FLVNSGTAVR IYKNTKIDSV LCQVYDNKLR YKQMKEIAGN LANPDSGRNI
LSLIEKLVND NEKGLFKYSF NAF