Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0779 |
Symbol | |
ID | 4810397 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 942509 |
End bp | 943861 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640106196 |
Product | copper amine oxidase-like protein |
Protein accession | YP_001037207 |
Protein GI | 125973297 |
COG category | [S] Function unknown |
COG ID | [COG2340] Uncharacterized protein with SCP/PR1 domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000247024 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAT TAATGCTAAA AAGGTTGGCT CTTTTTCTTA CTGTATTTAC GGTTTTGTTT TCAATCGGTG CTTTTGATAT TTATTCTTTA AGTGATATAA AAATATGCGT AAATGGTCAA TACTTACAAT TGGATGTGCC GCCTGTTATA GAAAACGGTA GGACCCTTGT GCCTGTTCGT GGGGTGTTTG AGTCTATCAA TGCAGATGTT GAATGGGTTC CGGAATCGAA AAAAGTAAAT GTTTATAAAA ATGACATTGT AATCACTCTT AAAATCAACA GTAACATAGC TTATATAAAT GACAATGCGG TGAGGCTTGA CGTACCGGCC AGGATTGTAG AAGGAAGGAC CCTTGTTCCT GTAAGATTTA TATCTGAAAG TATTGGCGCA AGTGTAGGAT GGGATGATGC TACAAGAACG GTTATTATCA ATACATATGA CGAAAATGTT GAGGACGAAG ATCCTGAAAC TCAGGAATTT GTATTTGACG GGAAGGTAAG CACGTTAATT GGGGCATCCC TAAGTGACGT AATAGAGACA TTTGGGGAAC CTGACCGTAT TGATTTGAGC AAATACGGTT TTGACTGGTA TATTTATAAT AATAATCTTC TGAAATATAT TCAAATAGGG ATAGAAGATG ACAAAGTGGT TGGAGTATTT ACCAACTCTC CTTACTACCG GCTCAATGAA GCCATAGGGG TTGGAACGGA TGGTATAAGT GCGGAAAAAG AACTGGGAAA ACCTTTGGAA TACATAAAAA AAGGAAATAC TTATTACATG ATGAAAAATT CGAATGAACA GAAGGTGTTC AATGTAAATG ATCAATATTT TGTGACTGTA TTTTTTGATA TCTTTGATGA GAACAAAGTT ACGTCGTACC TTTTGATTGA TTGTGAGACG GAAAGAGCCC TTCACGGTTA TTACGGCAAG CCTTCCGAAG AGCTTAGAAT AAGCTTTGAG AGGGAAGTTT TTGATCTTGC AAATACCGTC AGAGCAAGGT ATGGATTAAA ACCCTTTGAA TGGGATGATG AGATAGCAAA AGTTGCAAGG GCTCACAGTG AAGATATGGT TTTAAACAAT TACTTTTCCC ATACAAATTT GCAAGGGGAA AGTCCTTTTG ACAGAATGAA AAAAGCCGGA ATATCGTATT CATCTGCTGG AGAAAACATT GCAATGGGAC AAACGGAGGC GATATTTGCC CATGAGGGAT GGATGAATTC CCAGGGACAC AGGTTAAATA TACTTGGAAA CTTTGAGCGC CTTGGAGTCG GAGTTTACAT TGGTAATGAA AACGAAATTA CCTATACTCA AAATTTCTAT ACCCCTATGA GATTTAATAA ATTTTTCTAC TAG
|
Protein sequence | MKKLMLKRLA LFLTVFTVLF SIGAFDIYSL SDIKICVNGQ YLQLDVPPVI ENGRTLVPVR GVFESINADV EWVPESKKVN VYKNDIVITL KINSNIAYIN DNAVRLDVPA RIVEGRTLVP VRFISESIGA SVGWDDATRT VIINTYDENV EDEDPETQEF VFDGKVSTLI GASLSDVIET FGEPDRIDLS KYGFDWYIYN NNLLKYIQIG IEDDKVVGVF TNSPYYRLNE AIGVGTDGIS AEKELGKPLE YIKKGNTYYM MKNSNEQKVF NVNDQYFVTV FFDIFDENKV TSYLLIDCET ERALHGYYGK PSEELRISFE REVFDLANTV RARYGLKPFE WDDEIAKVAR AHSEDMVLNN YFSHTNLQGE SPFDRMKKAG ISYSSAGENI AMGQTEAIFA HEGWMNSQGH RLNILGNFER LGVGVYIGNE NEITYTQNFY TPMRFNKFFY
|
| |