Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1363 |
Symbol | |
ID | 4809358 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1655927 |
End bp | 1657321 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640106787 |
Product | lipopolysaccharide biosynthesis |
Protein accession | YP_001037788 |
Protein GI | 125973878 |
COG category | [D] Cell cycle control, cell division, chromosome partitioning [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0489] ATPases involved in chromosome partitioning [COG3944] Capsular polysaccharide biosynthesis protein |
TIGRFAM ID | [TIGR01007] capsular exopolysaccharide family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000148306 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAAGG ATGATATCTT GGAAATTGAT TTGCGGGAGA TAGTATATAT TCTTCTTAAG AAGTGGTATC TCATTGTGTT GTGCTTTGTT CTATCAGCTG GCACAGCCTT TGTAGTTACT CAATTTTATA TAAAACCGGT CTATAAAGCT GAAACAACAC TGTTTCTTGG TAAGGAGAAG GATCAGGTTT CACTGAGCTT TTCCGATATA CAGGTAAATA ATCAACTTGT TGTGGACTAT AGAGAGATAC TTCAATCAAG GCGTGTGGCA GAGATTATTA ATCAAAAGTT TGGAGTTGGA ATACAGGAAT TTCAAAAAAA TGTAAACGTA AAAACAGTAA AAGATTCAAG ATTGTTCAGT ATAAGCTATG AGGATACTGA TCCAAAACGT GCTGCTGACA TAGTAAATGA ACTGGCAACG GTTATTCAGA AGATGGCAGA CGAGATTATA CAGGTCAAAA ATATTAAAGT TATAGATACT GCGAAAATAC CGGAAAACCC TATAAAGCCT AATAAGAAAA TGAATATATG TGTGGCAGGA TTGTTTGGTT TGGTATTGGG GATCGGATTG ATATTCTTAC TTGAATTGAT TGACCATACA TTTAAGAAGC CGGAAGAAGT TGAAAAGATG CTTGGAATTA ATGTTATTGG TACAATCCCG GCATTTGATG GCGGCAAACG AGGAAAGAAA AAAGCCAAGG ATGAAAAAGA ACTTCAAGAA CAATACCTTA AAAATCTTAT TGTACATAAT AATCCAAAGT CAGCCACTGC CGAGGCGTTT AGAGAGCTTC GCACAAATTT GTATTATTCA AGTGTAGACA GCGAAGTAAA GACTATAGTA GTAACAAGTC CAACTCTCGG AGACGGCAAA ACGGTTACTG CTGTGAATCT TGCAATAACT CTTGCACGTT CCGGCAAGAA AGTTCTTGTT ATAGATGCTG ACTTAAGAAA GCCAAAGGTT CATCATTATT TTGGTGTGAA AAATAAAGAG GGTTTAACTA ATCTTTTAAC TGATTCAAAG GAAGAAGTAA AAATTAAAAC AACAGAGAGA AGCGATATTT CAAATCTGTA TATAATTACA AGTGGTCCGA TTCCGCCAAA TCCGGCAGAA ATGCTTAATT CAAACAGGAT GAAAAGCCTT TTGGAAAAAG TACGGGAGGA ATATGATATT GTTATCATAG ACACTCCTCC GGTGGGGCAG GTTACAGATG CAGCAATTCT TGCCGGAATT ACGGATGGAG TTATCCTTGT ATTGGCAAGT GGCCAAACAA GAATAGAAAT GGCGAAGCGA GCTTTTAAAT CCCTTGAGAG TGTAAAAGCA AGGTTTATTG GTGCAGTTCT TACGAAATTG GATACTGAAA GAACTGGTTA TTATTACAGT TACAAGTATG AGTGA
|
Protein sequence | MEKDDILEID LREIVYILLK KWYLIVLCFV LSAGTAFVVT QFYIKPVYKA ETTLFLGKEK DQVSLSFSDI QVNNQLVVDY REILQSRRVA EIINQKFGVG IQEFQKNVNV KTVKDSRLFS ISYEDTDPKR AADIVNELAT VIQKMADEII QVKNIKVIDT AKIPENPIKP NKKMNICVAG LFGLVLGIGL IFLLELIDHT FKKPEEVEKM LGINVIGTIP AFDGGKRGKK KAKDEKELQE QYLKNLIVHN NPKSATAEAF RELRTNLYYS SVDSEVKTIV VTSPTLGDGK TVTAVNLAIT LARSGKKVLV IDADLRKPKV HHYFGVKNKE GLTNLLTDSK EEVKIKTTER SDISNLYIIT SGPIPPNPAE MLNSNRMKSL LEKVREEYDI VIIDTPPVGQ VTDAAILAGI TDGVILVLAS GQTRIEMAKR AFKSLESVKA RFIGAVLTKL DTERTGYYYS YKYE
|
| |