Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1480 |
Symbol | |
ID | 4810630 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1798113 |
End bp | 1800218 |
Gene Length | 2106 bp |
Protein Length | 701 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640106901 |
Product | hypothetical protein |
Protein accession | YP_001037902 |
Protein GI | 125973992 |
COG category | [R] General function prediction only |
COG ID | [COG1033] Predicted exporters of the RND superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0156637 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATTCG GAAAAGCAGT GGTAAAATTC CGTGTGCCGA TTCTGATATT GACGCTGCTT TTAATGATTC CGTCCATATT GGGATATATC GGCACAAGAG TAAACTACGA CATGCTTGAA TATCTGCCGG AGGATATGGA AACAGTTATC GGTCAGAATG AACTGATGAA TGATTTCGGC AAAGGTGCTT TTTCCCTCGT TATCGTGGAG GATATGCCTG CCAAAGATGT TGCGGCATTG AAAGAAAAGA TTTCCAAAGT TGAGCATGTA GACACAGTTA TTTGGTATGA TTCCATTGCA GATTTGTCAA TACCAATGGA GATGCTGCCA AATAAACTCT ATTCTGCATT TAACACTGAA AATGCAACGC TAATGGCAGT ATTCTTCGAT TCTTCAACCT CTGCCGATGT CACAATGGAT GCAATAAGAG AAATTCGTTC TATCTGCGGC AAACAATGCT TTGTATCCGG TTTGTCGGCT CTTGTAACCG ATTTGAAGGA ACTTTGCGAA CGTGAAGAAC CGATTTATGT TACAATAGCC GTAATCCTGG CCTGTATTGC AATGTTTTTG TTCCTTGACA GCTGGCTTGT TCCTTTGGTG TTCCTTGCAT CTATCGGTAT GATGATTTTG CTCAATCTCG GAAGCAACTT TTTCTTAGGT GAAATATCAT ACATCACAAA AGCACTTTCC GCAGTTTTGC AGCTTGCCGT TACAATGGAT TACTCCATTT TCCTGTGGCA CAGTTACAAC GAACAATGCG AAACCTACAC CGACAAAGAA GATGCAATGG CGGTTGCAAT CAACAAAACA CTGGCAAGTG TTATAGGAAG TTCGGCCACC ACCATAGCAG GCTTTATTGC CCTTTGCTTC ATGTCTTTCA CGATGGGGCT GGACCTTGGT ATTGTAATGG CAAAAGGAGT GCTTCTCGGT GTTATAGGAG CTGTAACAGT GCTTCCCTCT CTCATTCTCA TACTCGACAA ACCGCTGCAA AAAACAAAAC ACCGCTCCTT AATTCCCAAC ATGGAAAAGG CAGCAAAAGG CATAGTAAAA GTATTTCCCC TGTTTCTTAT AATTTTTGCG CTTCTTATCG CACCGGCATA TTACGGTTAC AGCAAAACAA ACAGTGAAGT CTATTATGAT ATGGGGGAAT GTCTGCCGGA GGATATTCAA TATGTAATTG CCAATTCCAA GCTTCGGGAA AACTTTAACA TTGCCTCTAC ACACATGCTT CTTGTGGATA CCTCCGTACC GTCCAAAGAC GTTCGCTCCA TGATAAAAGA AATGGAACAG GTGGACGGAG TTAAATATGT AATAAGCCTT GAGTCGGTTA TTGGTTCCCG CGTACCGGAA GAAATTCTGC CGGAGTCGAT TACATCAATT GTAAAAAGTG ATAAATGGAG ACTTATGCTC ATAAGTTCTG AATACAAGGT TGCCAGCGAC AGCGTAAACA AGCAGATTGA CGAGTTAAAC ACCATACTGA AAAAGTATGA CAAAAACGGA ATGCTCATCG GAGAAGCTCC CTGCATGAAG GATATAATTG AAATCACTGA CCAGGATTTC AAAGTCGTCA ATACGGTGTC AATTTTTGCA ATCTTCGTTA TCATCGCATT GGTTTTAAGG AGCATTTCCC TGCCGTTTAT TCTGATTGCC GTAATTGAAC TTGCAATATT CATCAATCTC GGGCTGCCGC ATTATTTCGG TCAAAGTCTG CCGTTTATCG CGCCTATCTG TATCAGCACC ATCCAGCTGG GCGCCACAGT GGACTATGCG ATTTTGATGA CCACAAGATA CAAGTCGGAA CGTATTGAGG GAAATGACAA AACAACATCA GTACAAACAG CACTTGCCAC ATCAATTCCG TCAGTAATTG TGTCCGGTAT GGAATTGTTT GCAGCAACTT TCGGCGTTGC CATTTATTCC GATATTGACA TAATCAGCTC CATGTGTATG CTTATGGCCC GTGGTGCCAT CATCAGTATG CTGATGGTTA TTTTCGTGCT GCCGGCACTC TTGCTCCTCT GTGACAAAAT AATTTGCAAA ACAACTCTTG GTATGACAAA GATTAATAAT AACTTGAATT GGAGGTTGTC AGTAAATGAA AAGTAA
|
Protein sequence | MKFGKAVVKF RVPILILTLL LMIPSILGYI GTRVNYDMLE YLPEDMETVI GQNELMNDFG KGAFSLVIVE DMPAKDVAAL KEKISKVEHV DTVIWYDSIA DLSIPMEMLP NKLYSAFNTE NATLMAVFFD SSTSADVTMD AIREIRSICG KQCFVSGLSA LVTDLKELCE REEPIYVTIA VILACIAMFL FLDSWLVPLV FLASIGMMIL LNLGSNFFLG EISYITKALS AVLQLAVTMD YSIFLWHSYN EQCETYTDKE DAMAVAINKT LASVIGSSAT TIAGFIALCF MSFTMGLDLG IVMAKGVLLG VIGAVTVLPS LILILDKPLQ KTKHRSLIPN MEKAAKGIVK VFPLFLIIFA LLIAPAYYGY SKTNSEVYYD MGECLPEDIQ YVIANSKLRE NFNIASTHML LVDTSVPSKD VRSMIKEMEQ VDGVKYVISL ESVIGSRVPE EILPESITSI VKSDKWRLML ISSEYKVASD SVNKQIDELN TILKKYDKNG MLIGEAPCMK DIIEITDQDF KVVNTVSIFA IFVIIALVLR SISLPFILIA VIELAIFINL GLPHYFGQSL PFIAPICIST IQLGATVDYA ILMTTRYKSE RIEGNDKTTS VQTALATSIP SVIVSGMELF AATFGVAIYS DIDIISSMCM LMARGAIISM LMVIFVLPAL LLLCDKIICK TTLGMTKINN NLNWRLSVNE K
|
| |