Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1562 |
Symbol | |
ID | 4810069 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1890324 |
End bp | 1891535 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640106980 |
Product | hypothetical protein |
Protein accession | YP_001037981 |
Protein GI | 125974071 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4591] ABC-type transport system, involved in lipoprotein release, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00597787 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGCAT ATGCATTAAC TTCGAATAGA ATCTCCTTTC TAAATCTCAG AAAAAAGTCC TTTCGCACCT TTGCATTAAT ATTGGTGGTT GCAATCCTGT CTTTTGCACT TTTTGGAGGA ACTGTCCTGT CCTTTAGCTT TAGAAACGGG CTTCGCAGTG TAGAAGCAAG GCTTGGTGCA GATTTGATTG TGGTTCCTCT CGGGCATGGG AAACAACAGG AATCCATTCT TTTAACCGGG GAGCCGTCCT ACTTTTATTT TAATAAAGAG ATTGTCCAAA AGCTTCAGGA AGTTGAGGGT ATTTCCAAAC TGTCGACACA ATTTTACCTC ATTTCACTAA GTACAGGATG CTGTTCTCTT CCTGTGCAGA TTGTGGGATT TGATCCGGCT ACGGATTTTT CAATTCAACC GTGGATTCAA GAAACTCTGG GGGGAAATCT TGAAAACGGT GCTGTTATTG TCGGAAGTGA TATTATGATT GAGGATAACA AGCACATTAA ATTTTTTGAT AAAGAATATC CCGTGGCGGC AAAGCTTGAC AAAACGGGGA CAGGATTGGA TCAATCCGTA TTTGCTACTA TGGAAACCCT TAAAGACCTA TATTCCGGTG CAAAAGAAAA AGGCTTTAAC TTTTTGGAGG ATACGGATCC TGATACCTTT ATTTCATCCG TACTGATTAA AGTTCGTGAA GGGTACAATA TAGATCAAGT CATAACCAAT ATCCGAAGAA AAACGGACGG TGTCCAAATT GTCAGGACAC AAAATTTAAT TACCGGCATT TCAAAGAGCC TTGGCAATAT TATCACCTTT TTTAATGTAT TTGCATTGGT GCTTTTGGGC GTAACCTTGG TAATTTTGAC GGTGGTATTT TCAGCCTCAG CCAATGAACG AAAAAAGGAA TTTGCTATTA TGAGGATACT GGGGGCCACA AGGAAAAAGC TTGCTGCCGT TTTGATTTGG GAGTCGCTGT ATATCAGTGT GTCAGGCGGT GCTATAGGAA CTATACTTGC GGCAATATTT GTATTTCCCT TTAACGTTTT CATTGGTGAC AGCATAGGAT TGCCTTACAT ACAGCCTTCT CTTCTTTGGA TTATAGCTAT TTTACTGGGA ACATTACTCG TGGCTTTTTC CCTTGGCCCA ATTGCTTCAG CCTATTCTGC CGTTAAAGTC AGCCGTTCCC AAACTTATCT GACATTAAGG GAGGGTGAGT AG
|
Protein sequence | MMAYALTSNR ISFLNLRKKS FRTFALILVV AILSFALFGG TVLSFSFRNG LRSVEARLGA DLIVVPLGHG KQQESILLTG EPSYFYFNKE IVQKLQEVEG ISKLSTQFYL ISLSTGCCSL PVQIVGFDPA TDFSIQPWIQ ETLGGNLENG AVIVGSDIMI EDNKHIKFFD KEYPVAAKLD KTGTGLDQSV FATMETLKDL YSGAKEKGFN FLEDTDPDTF ISSVLIKVRE GYNIDQVITN IRRKTDGVQI VRTQNLITGI SKSLGNIITF FNVFALVLLG VTLVILTVVF SASANERKKE FAIMRILGAT RKKLAAVLIW ESLYISVSGG AIGTILAAIF VFPFNVFIGD SIGLPYIQPS LLWIIAILLG TLLVAFSLGP IASAYSAVKV SRSQTYLTLR EGE
|
| |