Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1013 |
Symbol | |
ID | 4811307 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1211382 |
End bp | 1212464 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640106431 |
Product | hypothetical protein |
Protein accession | YP_001037438 |
Protein GI | 125973528 |
COG category | [R] General function prediction only |
COG ID | [COG0628] Predicted permease |
TIGRFAM ID | [TIGR02872] sporulation integral membrane protein YtvI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000395023 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAAAA GAATAAAACT TGTTATCAGA CTGGTTCTTA TTTTTGGAGG TACGGTTATT GGAATATTGC TGGGACTTAG GCTTGCAATT TATTTTGCAC CTTTTCTTAT TGCTTTTGCC ATATCTTCAA TGATAGAGCC TGTAATCCGG TTTCTTATGA AAAAGTTAAA GTTTAGAAGA AAATTTGCGG CATTGGTTTC ATTGCTTTTG GGTTTGTCGA TGATTGCAAT ACTGTTGCTT ATGTTGTTTT CCAAACTTTA TAATGAAATA ACAAGCCTTT CAAGTTCTCA GCCGGAGTTT TGGAAAGAAG CATATCAAAA CATATCGAAT TTAATAAACA GAGGGTTGAA TATTTATTTT GGACTTCCGA GTGAGGTTAC CGCCCAAATT TCAAGTATGG TTTCAAGTCT TTCAAATTCC GTTTCTAGTT TGGTGGATTC TTTTGTGAAG GGAATTTACA ATACTGCAAT ATCCATACCT CAGATGGTGA TATTTATATT TGTGACAATT TTATCAACTT ATTTCATTTC CAGCGACAGG GACAGAATTT ATGACTATAT CAAAGATAAT GTGCCTGATG CATTATTAAA CAAGATAATA GATATAAAGG ACAGTATGTT TACAGCCTTG TTCGGGTATG TCAAAGCTCA GCTGATACTA ATGACAATTA CTTTTTGTGA GCTTTCCCTC GGGTTTACAA TAATTGGTGT AAAACGACCC ATTTTACTCG GATTGGTTAT CAGCTTCATA GATGCATTTC CGGTGCTTGG AACAGGCGGA GTACTTGTTC CCTGGGCCAT ATATGAGTTT TTAACCAAAG ACATCAGGAT GGGAGTGTCA CTGCTGATTT TGTATGTTAT AGTGCTTGTT ATCAGACAAA TGATAGAGCC AAAAGTGGTT GGAGAGCAGA TAGGCCTCCA TCCTTTGATG ACGTTGATAA CCATGTACCT CGGAATGAAG TTCTTTGGCT TGCCCGGCCT TATTTTGGGC CCCATTGTGA CATTGATTTT CAAAAATATA ATATCAGGAA TGATAAAGCA TAAGAATCTC AAAGAACTGA TAAACGAATT TATAAAAAAA TAA
|
Protein sequence | MDKRIKLVIR LVLIFGGTVI GILLGLRLAI YFAPFLIAFA ISSMIEPVIR FLMKKLKFRR KFAALVSLLL GLSMIAILLL MLFSKLYNEI TSLSSSQPEF WKEAYQNISN LINRGLNIYF GLPSEVTAQI SSMVSSLSNS VSSLVDSFVK GIYNTAISIP QMVIFIFVTI LSTYFISSDR DRIYDYIKDN VPDALLNKII DIKDSMFTAL FGYVKAQLIL MTITFCELSL GFTIIGVKRP ILLGLVISFI DAFPVLGTGG VLVPWAIYEF LTKDIRMGVS LLILYVIVLV IRQMIEPKVV GEQIGLHPLM TLITMYLGMK FFGLPGLILG PIVTLIFKNI ISGMIKHKNL KELINEFIKK
|
| |