Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1832 |
Symbol | |
ID | 4809816 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2172278 |
End bp | 2173807 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640107246 |
Product | hypothetical protein |
Protein accession | YP_001038246 |
Protein GI | 125974336 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA TATTATGTTG TATCATATCG TTATTTGTTT TAGCATCGTA TTTGAGTACA TATACATATG CAATAAGCTA TAATAGTGCA AAAGAAGCAA TAGATGATGC TAATAATTTT TTATTAGAAA AAATGGGCTA TGAGAATTAC TATTCATTAG AAGTAAATGG TATGAATATA AATGATAAAC TTGCACAATA TGGTTTGGAT GTGTTTTCAA ACAGGCCAGT ATTTGTATAC GGTGATAATG TAGAAGCTAG TAAAAAGACA ACGACAGCAG GGCGAGATAT GGTTAAAAAA GTTAATGGTA AGGACGAATA CCGTGCTTTA GGGTATGCAG TGGATGGTTC TGTTTTTCCA AATCCCTCTT TTCCATATGA TAATGAGGGT CATGCTGCAA AAGATAAGAT GTGGGTTAAA GAACCATGGA ATGGTTCAAA AGTAAAATAT CTATATAGCG AGAATGGAAA TATTGTAAAG AGAACGTTAA CGGATAATGC TTTTCAGTAT ATAGAAAAAT GGATCAAGTT TACCAGTTTT AAACCTCATG AAGTTGAAGC TTGTACAGGT AAGAAAAACT ATTTTGTACA AAATGCAGTT GATGTACCGG AAGGATTAAA GGAAAATTTT GAAGACTTTT TATATATAAT ACAACCTCCA ACAGAACATG CGTGGGGACT CGGTATAGCA TTTTACTACT GGAATGGATT TAATAATCTC AACTATAGAT CTTTTCTCAT TAGGCCGTTT GATATGAATG ATGATTTGGA TGTTAGTTTC CATGTAATAC CAGATAGTTC AACCGAAGGC AACGAAGTAT TGGTTGGTGT AAAAGTTAAA TCTCACTTCG ATACAGACTT AGAAGGAGTT AAATTTAGGT GGAGTATTAC TACAAAAAAC AGCGATGGTC AAGATGTTCC GTTGGATGCT GATGCTTATG AACTTGAATT TGGAGGTTCG TCAACCAGTC AGAGCGGGAC TATAAATATA TCAGCAGAAG ACAAGGAAGC ATGTTTATAT GCTGGGTTTA GAATGCCCAA TACTGATGTA TATATAGAAT TTGCAATTAA TGAAGACGGA GAAAATCCTT TAGAAAATGA TTTGAAAAAT AATATTGTTT CTACAGTGGT TAAAGCCGAA AAGCCTATAA ATTCTACTCT TAGGAAATTT GATTTACCAT ACTATGCATT ATCGAGAGAA ATAAGCTATC CATTAGCTGA TTCAGATATT GTATTTAATC TAAACAATAT TAATGGTGAT TGGCTGGATG GAAGTGCCAG AATAGATAAA TTGAATGTTA ATGTAAATGC AGGATTTTTG CATAATTATC AAGTTGGAAG TTCGAGGATT GAAGATAACG AAAATACAAT AACTGTAAGT TTGCCAAGCG TGAAAGCAAA GGTCGAAAGG AAAGATTTTG GAGATAATCC TGGAGAGAAG AAGTGGTTGG TCAGCAACAA CACTGTTGAT GTAATAAAAA GAATTCTAGA TACATCTTAT TATCTTTCTG TATCTAAAAA ATATAGATAA
|
Protein sequence | MKKILCCIIS LFVLASYLST YTYAISYNSA KEAIDDANNF LLEKMGYENY YSLEVNGMNI NDKLAQYGLD VFSNRPVFVY GDNVEASKKT TTAGRDMVKK VNGKDEYRAL GYAVDGSVFP NPSFPYDNEG HAAKDKMWVK EPWNGSKVKY LYSENGNIVK RTLTDNAFQY IEKWIKFTSF KPHEVEACTG KKNYFVQNAV DVPEGLKENF EDFLYIIQPP TEHAWGLGIA FYYWNGFNNL NYRSFLIRPF DMNDDLDVSF HVIPDSSTEG NEVLVGVKVK SHFDTDLEGV KFRWSITTKN SDGQDVPLDA DAYELEFGGS STSQSGTINI SAEDKEACLY AGFRMPNTDV YIEFAINEDG ENPLENDLKN NIVSTVVKAE KPINSTLRKF DLPYYALSRE ISYPLADSDI VFNLNNINGD WLDGSARIDK LNVNVNAGFL HNYQVGSSRI EDNENTITVS LPSVKAKVER KDFGDNPGEK KWLVSNNTVD VIKRILDTSY YLSVSKKYR
|
| |