Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1423 |
Symbol | |
ID | 4809084 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1741874 |
End bp | 1742980 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640106846 |
Product | hypothetical protein |
Protein accession | YP_001037847 |
Protein GI | 125973937 |
COG category | [R] General function prediction only |
COG ID | [COG0628] Predicted permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00001195 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAGAA ATCTCATAAT TGTTATTACT TATGCAGTGG CTTTGGTTCT TATTGTCATA AATTTTGTAC CGATTATGCG CGCTGTATGG AAGTTTATTG TTTTATTCAA ACCGTTTTTT ATGGGAATTG CCGTTGCTTT TGTCCTAAAC AGGCCATGCA TGGCGGTTGA GAGGTTTTTG AATAAAAGGT TGTTCAAAAA TCGATTAAAA GTACTTGTCA GAGGAATAGC AATTACTGTT ACATATTTAG TGGTACTTTT GTTGATTACG CTGATAATAA GTTTTATAAT ACCGGAACTT ATAAAAAGTA TACAAGTGTT TTTAAGCAAT ATGGGAGCAT ATATAGATAA TTTCAGGGAT TTGACCAATG AGCTTTCCGA ACTCTTGGGA CTTGAAAGGA TTGACCTGTC GTCTTTGGAC AAATTGATTC TTGAGTACAC AAACAGATTG GGAAGCAGCT TGACCGAGCT GATGCCGAAA ATTATCAGCA TTACGACGGG GGTTTTGTCA TTCTTTGCAA CATTGGTAAT AACGGTGGTG TTCTCGATAT ATATTTTGGC GGGAAAAGAA AGACTTATCG GACAATGCAA AAAAGTTTTC AGCACTTATC TTCCCGAGTG CCTGTACAAG AAGGGAGCTT ATGTGTATCG TGTTGTGGTG GATGTGTTTA ACAAATATAT ATATGGACAG CTGGCGGAGG CTTTCATTTT AGGTTCGCTT TGCTTTATTG GGATGGTTAT TTTTCGGTTT GAATATGCAC TTCTCATAAG CGTTTTAATT GCAGTTACCG CATTGGTGCC GTATTTTGGA GCGTACATAG GCGGATTTTG TGCGTTCATG CTCCTTTTAA TGATTTCGCC CACTAAAGCT ATATGGTTTT TAGTTTACCT GGTAGTATTG CAACAGTTGG AAAATAATTT AATATACCCA AGGGTTGTCG GAAGCAGTCT TGGACTTCCC GGAATATGGG TTGTTTTGGC GGCAATTGTC GGTGCCGGAG TCGGGGGCCC GATTGGTGTT TTGACTGGGG TACCGATTGC AACAGTTCTT TTCACTTTGC TTAGAAATGA TGTTTTAAGA AGATCCGGAA AGCAGAATGT TAAATGA
|
Protein sequence | MTRNLIIVIT YAVALVLIVI NFVPIMRAVW KFIVLFKPFF MGIAVAFVLN RPCMAVERFL NKRLFKNRLK VLVRGIAITV TYLVVLLLIT LIISFIIPEL IKSIQVFLSN MGAYIDNFRD LTNELSELLG LERIDLSSLD KLILEYTNRL GSSLTELMPK IISITTGVLS FFATLVITVV FSIYILAGKE RLIGQCKKVF STYLPECLYK KGAYVYRVVV DVFNKYIYGQ LAEAFILGSL CFIGMVIFRF EYALLISVLI AVTALVPYFG AYIGGFCAFM LLLMISPTKA IWFLVYLVVL QQLENNLIYP RVVGSSLGLP GIWVVLAAIV GAGVGGPIGV LTGVPIATVL FTLLRNDVLR RSGKQNVK
|
| |