Gene Cthe_1523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1523 
Symbol 
ID4810561 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1846955 
End bp1848835 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content45% 
IMG OID640106943 
Productglycosyl transferase, group 1 
Protein accessionYP_001037944 
Protein GI125974034 
COG category[I] Lipid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0204] 1-acyl-sn-glycerol-3-phosphate acyltransferase
[COG0438] Glycosyltransferase 
TIGRFAM ID[TIGR00530] 1-acyl-sn-glycerol-3-phosphate acyltransferases 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTATTTT TGTTGACGGT TTTCTTTTTT GTATTTGAAA AATTAAAAGT AAATGCGGCA 
AGAAAAGAGA GAGGAGTCAA ATTTCTGATG ATTATTACAC TGGTAAATGA CACCTTTAAC
ATAAACAACA ACGGGACCAC CATTTCTGCA ATGCGTTTTG CCGAGGCACT GTCACAACGC
GGCCACCAAA TCCGCATAAT TACATGCGGT GATCCTTTAA AAAGCGGCAA AGACCCTGAT
ACCGGTTTCG AAATGTTTTA TCTGCCGGAA CTCAAAATCC CCATTGCAAG CAGGCTGGCC
CACAAGCAGA ATACACTGTT TGCAAAGCCG GTTCGCTCCA TTTTGAAAAA AGCAATCTCA
GGGTCCGATG TTGTGCATAT ATATCAACCC TGGCCGCTTG GAAGCGCAGC CCAAAGAGTT
GCCAGGCAAA TGAACATCCC TGCAATCGCA GCTTTTCACA TACAACCTGA AAACATTACC
TTCAATATAG GTCTTAAGCG GTTTTCTCCG GCTGCCCATT TGACATATTT TTTGTTCTAC
CTGTTTTTCT ATCGCAGATT TTCACATATC CACTGCCCGT CAAAATTTAT TGCCGCGCAG
CTCAGGAGCC ACGGATACAA AGCACGGTTG CACGTCATCT CAAACGGCGT CCATCCGGCA
TTTTGTGCTC CCGCAAAGCC CAGGGAACAT ACTTTCAAAC CAATTAAGAT ACTTATGATT
GGCAGGCTTT CTCCCGAAAA AAGGCAGGAT GTTCTGATTC GTGCCGTCAT GAAATCCCGT
TATGCCGATC GTATTCAGCT GTATTTTGCC GGAAGCGGCC CCTGGGAGAA GAAACTTCGC
CGTCTTGGAA ACAAACTCCC CAATCCTCCT GTGTTTGGGT ATTACAATCG TGACGAGCTG
ATTAAGCTCA TACACGAATG CGACTTGTAT GTACACGCCT CAGATGCGGA AATTGAAGGC
ATCTCATTAA TTGAGGCGTT CGCATGCGGG CTGGTTCCGA TAATCTCCGA CAGCAAACAG
AGTGCCGCGG CGCAGTTTGC ACTCGGTCCC CAGAATCTTT TCAAAGCAGG GTCCCCTGAA
TCATTGGCGG AAAAAATCGA TTACTGGCTG GACCATCCGG AACAGCTGAA AGAAGCTGAA
AAGAAATATG CTCAATTAGG AAAGCAATAC GCCCTGGAAC ACAGTATCAG AAAAATAGAA
AAAGTATATT CATCCATGAC AAAAAATCAT AAAAATGAAT ACCATCGCAG TATTTTTTTC
AGACTATCCA CCCGCTTGTT CCAAATTGTA ATAGCCTGTC CCATCCTGCT GCTGTGGACA
CGTTTTGTTT TAGGTGCCAA AGTCTATGGC AGGGAAAATA TCCGTGGCCT CAAAAGTGGG
GTTACGGTAT GCAACCATGT CCACCTGCTG GACAGCGCTT TAATTGGCGT AACGTTTTTC
CCACGCAGGG TTGTTTTTCC CACACTCACC CAGAACGTAA AAACGCTCTG GCCGGGCAAG
CTTGTGCGAA TACTTGGCGG GTTTGCCATA CCTGATAATA TTATGGAGCT CAAAGCCTTT
TTTGACGAGA TGGAGTTTCT TTTGATGAAA AACTGTATCG TGCATTTTTT TCCCGAAGGG
GAATTAAGAC CCTATGATAC CGGTTTGCAA AACTTCAAAA AAGGGGCATT TTATCTTGCG
GCACAGGCTC AAGTGCCAAT TGTCCCTATG TTAATCACCT TTGAACCTCC AAAAGGACTG
ATAAAAATCA TACGAAAAAA GCCGGTTATG CGTCTTCATA TAGGAAAGCC AATACACCCG
ATGTCCAAGG ATATCGAAAT CGACTCAGAA CTTAGAATGA AAGCGGTCTG CAAAAAAATA
GAAGCCATTA CTTCCGTGTA A
 
Protein sequence
MLFLLTVFFF VFEKLKVNAA RKERGVKFLM IITLVNDTFN INNNGTTISA MRFAEALSQR 
GHQIRIITCG DPLKSGKDPD TGFEMFYLPE LKIPIASRLA HKQNTLFAKP VRSILKKAIS
GSDVVHIYQP WPLGSAAQRV ARQMNIPAIA AFHIQPENIT FNIGLKRFSP AAHLTYFLFY
LFFYRRFSHI HCPSKFIAAQ LRSHGYKARL HVISNGVHPA FCAPAKPREH TFKPIKILMI
GRLSPEKRQD VLIRAVMKSR YADRIQLYFA GSGPWEKKLR RLGNKLPNPP VFGYYNRDEL
IKLIHECDLY VHASDAEIEG ISLIEAFACG LVPIISDSKQ SAAAQFALGP QNLFKAGSPE
SLAEKIDYWL DHPEQLKEAE KKYAQLGKQY ALEHSIRKIE KVYSSMTKNH KNEYHRSIFF
RLSTRLFQIV IACPILLLWT RFVLGAKVYG RENIRGLKSG VTVCNHVHLL DSALIGVTFF
PRRVVFPTLT QNVKTLWPGK LVRILGGFAI PDNIMELKAF FDEMEFLLMK NCIVHFFPEG
ELRPYDTGLQ NFKKGAFYLA AQAQVPIVPM LITFEPPKGL IKIIRKKPVM RLHIGKPIHP
MSKDIEIDSE LRMKAVCKKI EAITSV