Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2590 |
Symbol | |
ID | 4809012 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3061974 |
End bp | 3063893 |
Gene Length | 1920 bp |
Protein Length | 639 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640108004 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001038983 |
Protein GI | 125975073 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3693] Beta-1,4-xylanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.549614 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA CAGCATCATT TATTCTTGTT CTTTCATTGC TGCTGGCTTT TATTATTCCG GCTGAAGGGG GGTTGGCAGC AGAAGGCAAT TTACTTTTCA ACCCGGGCTT TGAACTGGGA AGCACCGAAG GATGGTATCC TTACGGAGAG TGTACCATTG AGGCGGTCGG TACGGAAGCG CACAGCGGAA ATTATAGCGT TTTTGTTACG GACAGGACTC AGGATTGGAA CGGTGTGGCC CAGGACATGC TTGACAAGCT GACCGTAGGC ATGACCTATC AGGTTTCGGC ATGGGTCAAA GTTGCAGGGA CAGGAAGTCA TCAAGTCAAA ATATCCATGA AGAAGGTCGA AACCGGCAAG GAGCCGGTGT ATGACAACAT TGCGTCAATT ACCGTTGAGG GATCCGAATG GTACAGACTG TCGGGTCCGT ACAGTTATAC CGGCACGAAT GTTACAAACC TTGAACTTTA CATAGAGGGA CCCCAGCCGG GTGTCAGCTA CTATGTGGAT GATGTTACGG TAACTGAAGT GGGTTCCGCC GCAACATGGA AGGAAGAGGC GAATGCAAGA ATTGAGCAAA TCAGAAAAAG GGATGCGAAA ATAAGGATTG TGGATTCAAA CAATAAACCT GTTTCGGGAG TAAGCATTGA TGTTCGCCAG GTAAAGCATG AATTTGGTTT TGGATCGGCT ATTACCATGA ATGGAATACA TGATCCCCGT TATACCGAGT TCTTCAAAAA TAATTATGAG TGGGCGGTAT TTGAAAATGA AGCAAAATGG TATTCCAACG AAAGCAGCCA GGGCAACGTT TCTTACGCCA ATGCGGACTA CTTGTACAAT TGGTGTGCCG AAAACGGCAT AAAGGTAAGA GGCCATTGTA TATTCTGGGA GCCCGAAGAA TGGCAGCCTT CATGGCTTAA AGGACTTACC GGAGATGCTC TTATGAAAGC GATAGATGCA AGGCTTGAAA GTGTTGTTCC CCACTTTAGA GGCAAATTCC TTCACTGGGA TGTAAACAAT GAGATGCTCC ATGGAGATTT CTTCAAAAGC CGCTTGGGAG AGTCCATATG GCCTTACATG TTCAAAAGGG CCAGGGAGCT TGATCCGGAT GCAAAACTCT TTGTAAATGA TTATAACATT ATCACTTATG TTGAGGGAGA TGCATACATA AGGCAGATTG AATGGCTCCT GCAAAACGGT GCCGAGATAG ATGGCATAGG GGTGCAGGGA CATTTTGATG AAGATGTTGA ACCCCTTGTT GTAAAAGCCA GACTGGACAA TTTGGCGACT TTGGGAATTC CCATATGGGT AACCGAATAT GACTCTAAAA CGCCGGATGT AAACAAGAGA GCGGAGAATC TTGAAAACCT TTACCGTATC GCATTCAGTC ATCCGGCGGT GGAAGGTATT ATAATGTGGG GATTCTGGGC AGGTAACCAC TGGAGAGGTC AGGATGCCGC AATAGTAGAT CATGACTGGA CTGTAAATGA GGCAGGAAAG AGATACCAGG CTTTGTTGAA AGAGTGGACC ACGATTACCT CAGGTACTAC CGACAGCACA GGTGCATTTG ATTTCAGAGG TTTCCACGGC ACGTATGAAA TTACTGTGAG TGTTCCGGGG AAAGAGCCTT TTGTAAAGAC CATTGAGCTT ACCAAAGGGA ACGGAACGGC TGTATATACA TTTACTGTGG ACGGAACTGA TGCCGGAAAT GTTTTGTATG GGGATTTGAA CCAGGACGGC CAGGTAAGTT CAACGGACTT GGTTGCCATG AAAAGATACC TGTTGAAAAA TTTTGAACTG TCTGGCGTAG GGCTTGAGGC TGCAGATTTG AACAGCGATG GCAAAGTTAA TTCCACGGAT TTGGTTGCTC TTAAAAGATT TTTGTTAAAA GAAATAGATG AATTGCCTTT AAAACGTTAA
|
Protein sequence | MKKTASFILV LSLLLAFIIP AEGGLAAEGN LLFNPGFELG STEGWYPYGE CTIEAVGTEA HSGNYSVFVT DRTQDWNGVA QDMLDKLTVG MTYQVSAWVK VAGTGSHQVK ISMKKVETGK EPVYDNIASI TVEGSEWYRL SGPYSYTGTN VTNLELYIEG PQPGVSYYVD DVTVTEVGSA ATWKEEANAR IEQIRKRDAK IRIVDSNNKP VSGVSIDVRQ VKHEFGFGSA ITMNGIHDPR YTEFFKNNYE WAVFENEAKW YSNESSQGNV SYANADYLYN WCAENGIKVR GHCIFWEPEE WQPSWLKGLT GDALMKAIDA RLESVVPHFR GKFLHWDVNN EMLHGDFFKS RLGESIWPYM FKRARELDPD AKLFVNDYNI ITYVEGDAYI RQIEWLLQNG AEIDGIGVQG HFDEDVEPLV VKARLDNLAT LGIPIWVTEY DSKTPDVNKR AENLENLYRI AFSHPAVEGI IMWGFWAGNH WRGQDAAIVD HDWTVNEAGK RYQALLKEWT TITSGTTDST GAFDFRGFHG TYEITVSVPG KEPFVKTIEL TKGNGTAVYT FTVDGTDAGN VLYGDLNQDG QVSSTDLVAM KRYLLKNFEL SGVGLEAADL NSDGKVNSTD LVALKRFLLK EIDELPLKR
|
| |