Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1523 |
Symbol | |
ID | 4810561 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1846955 |
End bp | 1848835 |
Gene Length | 1881 bp |
Protein Length | 626 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640106943 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_001037944 |
Protein GI | 125974034 |
COG category | [I] Lipid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0204] 1-acyl-sn-glycerol-3-phosphate acyltransferase [COG0438] Glycosyltransferase |
TIGRFAM ID | [TIGR00530] 1-acyl-sn-glycerol-3-phosphate acyltransferases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTTATTTT TGTTGACGGT TTTCTTTTTT GTATTTGAAA AATTAAAAGT AAATGCGGCA AGAAAAGAGA GAGGAGTCAA ATTTCTGATG ATTATTACAC TGGTAAATGA CACCTTTAAC ATAAACAACA ACGGGACCAC CATTTCTGCA ATGCGTTTTG CCGAGGCACT GTCACAACGC GGCCACCAAA TCCGCATAAT TACATGCGGT GATCCTTTAA AAAGCGGCAA AGACCCTGAT ACCGGTTTCG AAATGTTTTA TCTGCCGGAA CTCAAAATCC CCATTGCAAG CAGGCTGGCC CACAAGCAGA ATACACTGTT TGCAAAGCCG GTTCGCTCCA TTTTGAAAAA AGCAATCTCA GGGTCCGATG TTGTGCATAT ATATCAACCC TGGCCGCTTG GAAGCGCAGC CCAAAGAGTT GCCAGGCAAA TGAACATCCC TGCAATCGCA GCTTTTCACA TACAACCTGA AAACATTACC TTCAATATAG GTCTTAAGCG GTTTTCTCCG GCTGCCCATT TGACATATTT TTTGTTCTAC CTGTTTTTCT ATCGCAGATT TTCACATATC CACTGCCCGT CAAAATTTAT TGCCGCGCAG CTCAGGAGCC ACGGATACAA AGCACGGTTG CACGTCATCT CAAACGGCGT CCATCCGGCA TTTTGTGCTC CCGCAAAGCC CAGGGAACAT ACTTTCAAAC CAATTAAGAT ACTTATGATT GGCAGGCTTT CTCCCGAAAA AAGGCAGGAT GTTCTGATTC GTGCCGTCAT GAAATCCCGT TATGCCGATC GTATTCAGCT GTATTTTGCC GGAAGCGGCC CCTGGGAGAA GAAACTTCGC CGTCTTGGAA ACAAACTCCC CAATCCTCCT GTGTTTGGGT ATTACAATCG TGACGAGCTG ATTAAGCTCA TACACGAATG CGACTTGTAT GTACACGCCT CAGATGCGGA AATTGAAGGC ATCTCATTAA TTGAGGCGTT CGCATGCGGG CTGGTTCCGA TAATCTCCGA CAGCAAACAG AGTGCCGCGG CGCAGTTTGC ACTCGGTCCC CAGAATCTTT TCAAAGCAGG GTCCCCTGAA TCATTGGCGG AAAAAATCGA TTACTGGCTG GACCATCCGG AACAGCTGAA AGAAGCTGAA AAGAAATATG CTCAATTAGG AAAGCAATAC GCCCTGGAAC ACAGTATCAG AAAAATAGAA AAAGTATATT CATCCATGAC AAAAAATCAT AAAAATGAAT ACCATCGCAG TATTTTTTTC AGACTATCCA CCCGCTTGTT CCAAATTGTA ATAGCCTGTC CCATCCTGCT GCTGTGGACA CGTTTTGTTT TAGGTGCCAA AGTCTATGGC AGGGAAAATA TCCGTGGCCT CAAAAGTGGG GTTACGGTAT GCAACCATGT CCACCTGCTG GACAGCGCTT TAATTGGCGT AACGTTTTTC CCACGCAGGG TTGTTTTTCC CACACTCACC CAGAACGTAA AAACGCTCTG GCCGGGCAAG CTTGTGCGAA TACTTGGCGG GTTTGCCATA CCTGATAATA TTATGGAGCT CAAAGCCTTT TTTGACGAGA TGGAGTTTCT TTTGATGAAA AACTGTATCG TGCATTTTTT TCCCGAAGGG GAATTAAGAC CCTATGATAC CGGTTTGCAA AACTTCAAAA AAGGGGCATT TTATCTTGCG GCACAGGCTC AAGTGCCAAT TGTCCCTATG TTAATCACCT TTGAACCTCC AAAAGGACTG ATAAAAATCA TACGAAAAAA GCCGGTTATG CGTCTTCATA TAGGAAAGCC AATACACCCG ATGTCCAAGG ATATCGAAAT CGACTCAGAA CTTAGAATGA AAGCGGTCTG CAAAAAAATA GAAGCCATTA CTTCCGTGTA A
|
Protein sequence | MLFLLTVFFF VFEKLKVNAA RKERGVKFLM IITLVNDTFN INNNGTTISA MRFAEALSQR GHQIRIITCG DPLKSGKDPD TGFEMFYLPE LKIPIASRLA HKQNTLFAKP VRSILKKAIS GSDVVHIYQP WPLGSAAQRV ARQMNIPAIA AFHIQPENIT FNIGLKRFSP AAHLTYFLFY LFFYRRFSHI HCPSKFIAAQ LRSHGYKARL HVISNGVHPA FCAPAKPREH TFKPIKILMI GRLSPEKRQD VLIRAVMKSR YADRIQLYFA GSGPWEKKLR RLGNKLPNPP VFGYYNRDEL IKLIHECDLY VHASDAEIEG ISLIEAFACG LVPIISDSKQ SAAAQFALGP QNLFKAGSPE SLAEKIDYWL DHPEQLKEAE KKYAQLGKQY ALEHSIRKIE KVYSSMTKNH KNEYHRSIFF RLSTRLFQIV IACPILLLWT RFVLGAKVYG RENIRGLKSG VTVCNHVHLL DSALIGVTFF PRRVVFPTLT QNVKTLWPGK LVRILGGFAI PDNIMELKAF FDEMEFLLMK NCIVHFFPEG ELRPYDTGLQ NFKKGAFYLA AQAQVPIVPM LITFEPPKGL IKIIRKKPVM RLHIGKPIHP MSKDIEIDSE LRMKAVCKKI EAITSV
|
| |