Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1890 |
Symbol | |
ID | 4809221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2244759 |
End bp | 2246891 |
Gene Length | 2133 bp |
Protein Length | 710 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640107309 |
Product | cellulosome enzyme, dockerin type I |
Protein accession | YP_001038304 |
Protein GI | 125974394 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAAAA AAGGACTACT ATTACTACTA ACAGTTATTG CAGCAACCAT TGTTGTTAGC ATGATGTCTG CCGGTGCTAC TACATTATAC GGTGACTTAA ATGCAGATGG TTCAATCAAC TCAACCGATT TAATGATAAT GAAGAGAGTA CTGCTCAAGC AAAGAACTCT TGATGACATT ACTCCCGCTG ATTTGAATGG TGACGGTAAA GTAACCTCAA CAGATTATTC GTTGATGAAA AGATACTTAC TCAAGGAAAT AGACAAATTT CCGGTTGAGG ATATAGAACC GACTCCTACA CTGGAGGTTA GCCCAACTCC TACGGAAACC AGTGAAGAGG TATTTGCTTT TAAAATTAAA CTATTTTCGG ATGGCGATAC ATACAGGTTT CCTATTCAAG AGATATCAGA GAATAATAAT ATTGTTGTTG ACTGGGGTGA TGGTACAACA AGTACTATAA CTGATTATTC AACATTGAGG CATAAGTATG AAAAAGCAGG TGTATATACA ATAAAAGTAC TTTGGTTTGA TCACATACCA ATTCGGTTTA CAGGAGATAA GTATGTGATT GAAATACTTA CACCCCTTCC AGATATCGGA TTAACTGACT TTAGCTCTTT TTTTAAAAAC TGTAGTAACC TAGAAAGAAT TCCAGACAGA TTATTTTCAA ACAATATTAA TGCAACAGAC TTTAATTTCT GTTTTAGCGG CTGTACTAGC TTGACAGAAA TACCTGAGAG TTTGTTTGCA GGCAATGTTA ATGCAACTAC CTTTGTTCGG TGTTTTTACC GTTGTAGCAA CTTAATAAAA GTTCCGGAAG GGTTGTTTGA AAATAATGTT AATGCGACTA ATTTTTTGGG CTGTTTCGAT GAATGTAGTA GCCTGAAGGA AATTCCAGAA GGATTATTTT CAAATAATGT TAATGCAGCA AACTTTAGTT GGTGCTTTAG TGAATGTGTT AGTTTAGCAA AAATTCCTGA AGGATTGTTT AGAAATAATA CTAATGCAAC AGATTTTAGT TACTGTTTTT ATGGTTGTAC TAGCATAACA AAAATTCCCG GAGGGCTGTT TGAAAATAAT ATTAATGCGG AAGACTTTGG TAATTGCTTT AGTGGATGCA GTAGCATAAC GGAAATTCCC GGAGGGCTGT TTGAAAATAA TATTAATGCG GCAAACTTTG GTAGTTGCTT TAGTGGATGT AGTAGCATAA CGGAAATTCC AGAAGGGCTA TTTGAAAATA ATATTAATGC GGAAGACTTT AGAGGTTGCT TTAGTGGATG CAGTAGCATA ATGGAAATTC CAGAAGGGCT ATTTAAAAAT AATATTAATG CGGAAGACTT TAGAGGTTGC TTTAGTGGAT GCAGTAGCAT AACGGAAATT CCCGGAGGGC TGTTTGAAAA TAATATTAAT GCGGAAGACT TTGGAGGTTG CTTTAGTGGA TGCAGTAGCA TAACGGAAAT TCCCGGAGGG CTGTTTGAAA ATAATATTAA TGCATCAGAC TTTAGTAGTT GTTTTAGTGG ATGCAGTAGC ATAACGGAAA TTCCTGGGGG TTTGTTTAGA AATAATATTA ATACAACAAG ATTTATGGAG TGCTTTAAAG GATGTAGCAG CGTAACAGAA ATCCCTGAAG AGCTATTTGC CAATAATGTT GATACAGCTA TCTTTATAGG TTGTTTTAGT GAATGCATCA GTTTGAGAAA AATTCCAGAA GGACTGTTTA AAAATAATAT TAACGTAATA AGCTTTATGG AGTGCTTTAA AGGATGTAGT AACCTAACAG AAATCCCTGA AGGGCTATTT GTAAATAATA CTAATGCAAC AGACTTTCAA GGTTGTTTTT ATGGGTGCAG TAGTTTGACA GAAATTCCTG CGAGATTATT TACGAATAAT GTTAATGTAA CTAATTTTAG AGAGTGTTTT AGGGATTGTA CGAGCTTAAT AGAAATTCCA GAGAGTCTTT TTGATAGCAA TGTTAATGTC ACCAATTTTT ATAGATGTTT TTATGGGTGC AAAAACTTAA CAGGTGTAGC ACCTGCTTTA TGGCTGCGTA CAAATGTTAA AGAATTTTCG GGTTGCTTTG GAAGCTGTAC TAAACTGTCC AACTATAATG ATATTCCAAA AGGTTGGAAA TAA
|
Protein sequence | MRKKGLLLLL TVIAATIVVS MMSAGATTLY GDLNADGSIN STDLMIMKRV LLKQRTLDDI TPADLNGDGK VTSTDYSLMK RYLLKEIDKF PVEDIEPTPT LEVSPTPTET SEEVFAFKIK LFSDGDTYRF PIQEISENNN IVVDWGDGTT STITDYSTLR HKYEKAGVYT IKVLWFDHIP IRFTGDKYVI EILTPLPDIG LTDFSSFFKN CSNLERIPDR LFSNNINATD FNFCFSGCTS LTEIPESLFA GNVNATTFVR CFYRCSNLIK VPEGLFENNV NATNFLGCFD ECSSLKEIPE GLFSNNVNAA NFSWCFSECV SLAKIPEGLF RNNTNATDFS YCFYGCTSIT KIPGGLFENN INAEDFGNCF SGCSSITEIP GGLFENNINA ANFGSCFSGC SSITEIPEGL FENNINAEDF RGCFSGCSSI MEIPEGLFKN NINAEDFRGC FSGCSSITEI PGGLFENNIN AEDFGGCFSG CSSITEIPGG LFENNINASD FSSCFSGCSS ITEIPGGLFR NNINTTRFME CFKGCSSVTE IPEELFANNV DTAIFIGCFS ECISLRKIPE GLFKNNINVI SFMECFKGCS NLTEIPEGLF VNNTNATDFQ GCFYGCSSLT EIPARLFTNN VNVTNFRECF RDCTSLIEIP ESLFDSNVNV TNFYRCFYGC KNLTGVAPAL WLRTNVKEFS GCFGSCTKLS NYNDIPKGWK
|
| |