Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2620 |
Symbol | |
ID | 4809042 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3098687 |
End bp | 3099466 |
Gene Length | 780 bp |
Protein Length | 259 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640108034 |
Product | protein of unknown function DUF1078-like protein |
Protein accession | YP_001039013 |
Protein GI | 125975103 |
COG category | [N] Cell motility |
COG ID | [COG4786] Flagellar basal body rod protein |
TIGRFAM ID | [TIGR03506] fagellar hook-basal body proteins |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000000392095 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCAGAG GTCTTTACAC GTCAGGTTGG AGCATGCTTG CCAACAGCAA GCAAATGGAT GTTGTATCCA ATAATCTTGC AAATGTCAAT ACCACTGCAT TCAAAAAAGA TACGTTGATA TTGGAAAGCT TTCCTGATAT GCTGGTAAGA AGAATCAATG ACAGCCGCAG CGCAAGCAAT CCTTCAGGAA GGTTGGGGAA TGTGCAGCTT GGAAATGATG TGGGAGAGGT TTTTACATAC TATACCCAGG GGCAGCTCCA GAAGACGGAC AACAATTTGG ACATGGCTAT TTCAAATTGT GACACTGCCT TCTTTACCGT TGCGGTGCCC GGCGGGAACG AAGAGTATAC AGAGTACTAT ACAAGGGACG GTTCTTTTGC ACTGAATGCA TACGGGCAAC TGGTGACAAA GGAAGGTTAC CTGGTAATGG GCCAAAACGG TGTCATTACC CTGAACTCGG AGAATTTCAG CGTTAGCGAT GACGGTACGA TTATTCAGGA CGGCGAGGCT GTTGCAAGAC TTTTAATAAG GAATTTTACC GATACTACAA CTCTCAGGAA AGTGGGCTCA AACCTTGTGC AAAGAACTGA AAATACCCAG GAACAGCCCT TTGACGGAGT GGTAAGGCAG GGCTATCTTG AACAGTCAAA TGTTAATTCC ATTAATGAAA TGATTAATAT GATTACAATA ATGCGTTCTT ATGAGGCAAA TCAAAAAATC CTTCAGGCCC AGGACGGAAC GCTTGAAAAA GCGGTAAACG AAATAGGAGT TGTAAGATAG
|
Protein sequence | MIRGLYTSGW SMLANSKQMD VVSNNLANVN TTAFKKDTLI LESFPDMLVR RINDSRSASN PSGRLGNVQL GNDVGEVFTY YTQGQLQKTD NNLDMAISNC DTAFFTVAVP GGNEEYTEYY TRDGSFALNA YGQLVTKEGY LVMGQNGVIT LNSENFSVSD DGTIIQDGEA VARLLIRNFT DTTTLRKVGS NLVQRTENTQ EQPFDGVVRQ GYLEQSNVNS INEMINMITI MRSYEANQKI LQAQDGTLEK AVNEIGVVR
|
| |