Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1197 |
Symbol | |
ID | 4810149 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1426715 |
End bp | 1428130 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640106619 |
Product | hypothetical protein |
Protein accession | YP_001037622 |
Protein GI | 125973712 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01445] intein N-terminal splicing region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000101518 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCTGCCG CAGTAGTTAT CACAGGATTG GGAATAGCAG CGGCGCTGAC AGGAGGAGTA TTGGGAGTTA TACTGGCAGG AGCGTTCTGG GGAGCATTGG CCGGAGGCTT GATAGGAGGA GCAGTCGGAG GAATAGCAGC GGCGATAAAT GGAGGTTCGT TCCTAGAAGG ATTTGCGGAT GGCGCTTTAA GCGGAGCGGT TTCCGGAGCG GTGACAGGAG CGGCATGTGC CGGGCTTGGT GCTTTGGGAG CAGCGGCAGG GAAAGGCATC CAATGCATGA GCACTGTAGG AAAAGCAATA AATGTTACTT CGAAAGTAAC TGCAGCCCTT TCGTTGGGTA TGGACGGATT TGACATGCTG GCAATGGGAG TATCGCTGTT TGATCCATCC AACGCATTGG TTGAATTTAA CCAGAAGCTG CATTCCAATG CACTTTACAA TGGATTCCAG ATTGCAGTAA ACGCGCTGGC TGTTTTCAGT GCCGGGGCGG CATCTACAAT GAAGTGCTTT GTTGCGGGTA CGATGGTATT GACAGCGGCA GGCTTGGTTG CGATAGAGAA TATCAAGGTA GGAGATAAGG TAATTGCGGC GAATCCGGAG ACTTTTGAAG TAGCCGAGAA GACAGTGCTT GAGACATATG TGAGAGAGAC AACGGAGCTT TTGCATTTGA GAATTGGAGG CGAAGTAATC AAAACAACCG TTGACCATCC ATTTTATGTA AAAGATGTTG GCTTTGTTGA AGCGGTGAAT CTGCAAGTCG GAGACAAGTT GGTTGATTCA AAAGGCAATG TTTTGGTGGT AGAAGAGAAA AAGCTCAAAA TAACTGGTAA ACCTGTGAAA GTTTACAACT TTAAAGTTGA TGACTTTCAT ACTTATCATG TTGGGAATAA AGGGATATTG GTACATAATG CGAATTATAA TCCTAAAACT ACCTTTGAAA ATCTGGATTT GGAAACCGCC AGTAACAAGC AAAAGGGTAA TTATGGAGAA TATCGAGCGG ATGATAATCT TATAAATAAT CCAAAATTGA AGGAAGTAGG ATATGATTTG GAACAGATAG GAGGGAAAGT TCCGACATCA CCGGATGATA AAATCACAAA AGGGATAGAT GGTATATATG TAAACAAGAA TCCTAATTCA AATATTAAAT ATGTGATTGA TGAGTCAAAG TTTAATACTG CACAATTGGG GAAAACGAAA AAAGGCATAA AGCAAATGTC GGATGAGTGG CTCCGTGAGA AACAAGGTAA AAGAATTTTA CAAGCAGTTA ATGGTGATAG AAGACTGAAA GATGATATAA TAGAAGCATT AAACAACGGT GCAGTAGAAA AAGTTTTATC ACGAGTTGGC AAGGATGGAA AAGTAACGAC GTATAGGTTA AACAGCAATG GTGAAATAAT TGGATTCTGG CCATAA
|
Protein sequence | MAAAVVITGL GIAAALTGGV LGVILAGAFW GALAGGLIGG AVGGIAAAIN GGSFLEGFAD GALSGAVSGA VTGAACAGLG ALGAAAGKGI QCMSTVGKAI NVTSKVTAAL SLGMDGFDML AMGVSLFDPS NALVEFNQKL HSNALYNGFQ IAVNALAVFS AGAASTMKCF VAGTMVLTAA GLVAIENIKV GDKVIAANPE TFEVAEKTVL ETYVRETTEL LHLRIGGEVI KTTVDHPFYV KDVGFVEAVN LQVGDKLVDS KGNVLVVEEK KLKITGKPVK VYNFKVDDFH TYHVGNKGIL VHNANYNPKT TFENLDLETA SNKQKGNYGE YRADDNLINN PKLKEVGYDL EQIGGKVPTS PDDKITKGID GIYVNKNPNS NIKYVIDESK FNTAQLGKTK KGIKQMSDEW LREKQGKRIL QAVNGDRRLK DDIIEALNNG AVEKVLSRVG KDGKVTTYRL NSNGEIIGFW P
|
| |