Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1807 |
Symbol | |
ID | 4809791 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2141116 |
End bp | 2142780 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640107221 |
Product | hypothetical protein |
Protein accession | YP_001038221 |
Protein GI | 125974311 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2834] Outer membrane lipoprotein-sorting protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACAAAA ATGAAAAGAA ATTATCCGAA TACATTGATA AATTAAATGC CGAGAAAATG CCTGACGAGC ACGAGTGTCT GCCGGATTCA CCGGAATTGG AGGAACTTAT GGATACGGTA AGAAAAATTC GAAGTCTGAA GGAGCCTGCT CTGCCGGATG CGGATTATCC AAAAAAGCTG GCCCGGGTAG TCAGTGCTCA ATTATCGCAA AAATCCGCCG CCGGAAAAAG AAAATGGACA TGGCTGGCCG GAGCGGCTGC TGTTGCGGCA GTTGCTGTCC TGGTTTTTGT ACTGAATTTT GTACTGTATT CCGGCAGAAC CGACATTGTA TACGCCATGG AGCAGGCATA TAAGGAAGTT AAAGCATATC ACGGAATCCT CAGCATTGTT GAAACCAATC TCAATGGAGA AGAGACTTTG CAGGCAATGC GGGAGGTTTG GGCGGACAGC GAGGGACGCT ACTATGTAAA AGAGCTTCAG GGCTTTCAGA AAGGCTTGAT AACCGTAAAC AACGGCGAAA AAAAGTGGCA GGTGAGTCCT GCTGAAGAAC AAGTATACAT CTTTCCATCA TTCCCCGATC CATACAAATT CACCTTGGAA CTTGGCAATG AAATAAAAGA TGCCAAAAAT GCCGAACAAA TCAAAGCCGT GGGAGAAGAG ATGGTTGCGG GAAGAGAAAC CTCTGTATTT GAGGTACTGC CCAGAGGAGG GGAATCCTAC AAAATATGGA TTGACAAGGA GACGAATCTG CCGCTTCAAA AAGAGAGTGC TATGATGAAT GCAATTCAAT ACAGGGTAAC CTATACCAGC ATTGAGTTTG GCGACAATAT ACCCGGTGAG CTTCTTGCTT ATAGCTTGCC GCAAGGCTTT AAGGAAATAG ATAAGAATCC CGAACTGCAG GTCGGCAGCG TTGAAGAAGC TGCGGAAACA GCCGGTTTTA CTCCCCAAAT ACCCCAAAAT GTTCCCGGGG GATATACAAG AAACGGCATG GCAGTTACAG GGGATATGAA AACCGTCAAG CTAAGCTATA TATCCCAGGA TAAGAAAAGC CGGGTAATTA TTTTGCAGAA AAAAGCAACG GATGAGTTTA AACCTGCATC AACAGCGGTT TTAGGCAAGG TGGGCGGCAA TACTGCCGAA ATTCAGTCTC CTGTGCAGGA CAGTCCTGGA GTGCTTGAAG GAGGAATGTA TTCAGGGATG GCGGATATCC GCTCGATTCG CTGGCAGGAA TCCGGATTTG AATATGCTGT GATAGGCGAT GCGCCAATGA ATGAATTGAT TTCATTCATT GAAAGTATAA CAACAGGTCC GGTTGAGATA CCGCCGGAAA ACGAAGAAAC CCCAGAGAAG CCTCAGATTG AAGTTCCGGT TGATCTGAAA GTCGAGAAAA ATGAGCAAAA AAGCGTGGAT GCGGGACATT CACCGTGGAA ACTGGATCCT GTTTATGTCG CACAAGTATT TGTAAGCCTG AAAATTTCTC CTGAAGGCAT TGAAGGAGAA TATCCGGTAA GTTATGAAGA CATGGAGGTT GTAAAAAACA ACGGCATAGA GGCGGTAGTG GAGATAAGCG GTGATAACAC ACCTGTGCGC AGGGTTTATT TAAAAAGACT GATAAGACAG GACAGCACGG GAATATGGAC TGTGGTCGGA TATGATCCGG TTTAA
|
Protein sequence | MDKNEKKLSE YIDKLNAEKM PDEHECLPDS PELEELMDTV RKIRSLKEPA LPDADYPKKL ARVVSAQLSQ KSAAGKRKWT WLAGAAAVAA VAVLVFVLNF VLYSGRTDIV YAMEQAYKEV KAYHGILSIV ETNLNGEETL QAMREVWADS EGRYYVKELQ GFQKGLITVN NGEKKWQVSP AEEQVYIFPS FPDPYKFTLE LGNEIKDAKN AEQIKAVGEE MVAGRETSVF EVLPRGGESY KIWIDKETNL PLQKESAMMN AIQYRVTYTS IEFGDNIPGE LLAYSLPQGF KEIDKNPELQ VGSVEEAAET AGFTPQIPQN VPGGYTRNGM AVTGDMKTVK LSYISQDKKS RVIILQKKAT DEFKPASTAV LGKVGGNTAE IQSPVQDSPG VLEGGMYSGM ADIRSIRWQE SGFEYAVIGD APMNELISFI ESITTGPVEI PPENEETPEK PQIEVPVDLK VEKNEQKSVD AGHSPWKLDP VYVAQVFVSL KISPEGIEGE YPVSYEDMEV VKNNGIEAVV EISGDNTPVR RVYLKRLIRQ DSTGIWTVVG YDPV
|
| |