Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2488 |
Symbol | |
ID | 4809426 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2957545 |
End bp | 2959380 |
Gene Length | 1836 bp |
Protein Length | 611 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640107903 |
Product | phage tape measure protein |
Protein accession | YP_001038883 |
Protein GI | 125974973 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02675] tape measure domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAACTT TAAAAGCAGT TATGGCTTTA TTGACTGGAG GTTATACATC AGGAATCAAT AAAGTCATTA AAAATACAGA TAAAGCAACA GATAAAATTT TAAAAGCTAG CGGTGCCACA GATGAATTTA ATAAAAAATT AGAAGTCACT GGCGCAAGTG CTAATACTGC AAGTGGTGGA TTGGGGAAAT TACTTAAAAC TTTTATAAGT TTAGCAGCGA TAAAAAAAGG AATAGATATT ACAGACGAAT ATAGTAATAT AGCTGCTAGA CTCGCACTTA TTAATGACGG CTTGCAAACA CAAGAAGAAT TGCAAAATAA AATCTTTGCA GCTGCTAATC GGTCTCGCGG TGTATACTCA GATATGGCCA GTGCAGTGGC CAAAATGGGG CTGCTAGCCA AGGACGCTTT TACCTCCAAT GATGAACTAA TTGCCTTTAC AGAGCTTGTA CAAAAATCAT TTAAAATTAG CGGAGCTGAC CCATCTGAAC AGGCAGGAGC AATGAGACAA TTAGCTCAAG CGATGGCTTC TGGTAGGCTT CAAGGTGATG AATTAGTATC AATAATGGAA AATGCTCCAA TGATATATGA GGCAATAGCA AAATATATGG GAAAGACAAA AGGAGAACTT AAAAAATTAT CTTCTGAAGG AGCTATAACG GCCGACATAA TCAAAAATGC CGTATTTGCC GCAGCGAAAA ACATCAACAC CAAGTTTGCA GAGATGCCAA TGACTTTCGG AGACATATGG AACAGGATTA AGAATGGTGC ACTTAAGGCC TTTGATAAAG TTATTGTAAA GGTAAATCAG CTTATTAATG CTGACAAGTT CCAGCGATTT GTAGACAGAA TGATTACTGG TTTTAGTCTT GCAGCATCTG CGGCAAGCTG GTTAATCGAT GCTATAATTA GAGGTTGGGA TACGATAGGG CCAATACTTG CAGTTATTGC TGGCATATGG CTTGTTTCTA TAATTGGAAA ACTGTGGGCA ATGATACCAC CACTGATTGC GCAAGCAGCA GCATGGTTAA GTGTATATTG GCCTATACTA CTGGTAATTG CTATTATAGG AATAGCAATA TCTGCAGCAA GACAGCTGGG AGCAACATGG GATGAGATTA TAGGATTTAT TGGAGGGCTA ATCGGTGTTT TTGCCACAAC TTTCTATAAC TATTTCGTCA TGATCTGGAA TCACATAGCC GCTTTCGTGA ATTTCTTCGG CAACGTATTC AAAAACCCAG TAGCTGCTGT ACAAGCGCTG TTTTTTGATC TAGCATCTAA CTTGCTTGGG TATATCGAAA AAGTGGCCCG GGGAATTGAA GATTTGCTGA ACAAGATCCC GGGCGTGAAC GTAAATATCG CCGGAGCCAT CACAAAACTG AGAGACAAAC TAAAAGCGGC ATCAACGCAG ATAAAAACCG AAGCCGACCT GAAAACCTAT GTTCAATCCA AAGAATTCAT GGATTTCTCT GAAGGTTGGA CGAAAGGCAG CACCATGGGG AAAAATCTTG TAGACAAGGT AAGCAACGCA TTGTCAGGGC TGACTGATAT AGGCAAAAGT TTTGACATGG GGCAATTCGG TACAAGTCAA AACCCGCTAT ATGTCACATC TAACGATAAG CTTAAGGTGG ACATGTCGGA TGAAGACTTG AAGTATTTGC GAGATATCGC AGAAAGAGAA TACATTGCCA AATTCAGCAC CGCAACGCTT GCACCTAACA TCAGTATATC CTTTGGAGAT GTACACGAAA CAGCGGATGC CAATAAGATA GCGGGAAGAA TTAGAAAAAT ACTCCAGGAA GAAATCGCTA TGGCGGCAGA GGGGGCATAT GCATGA
|
Protein sequence | MATLKAVMAL LTGGYTSGIN KVIKNTDKAT DKILKASGAT DEFNKKLEVT GASANTASGG LGKLLKTFIS LAAIKKGIDI TDEYSNIAAR LALINDGLQT QEELQNKIFA AANRSRGVYS DMASAVAKMG LLAKDAFTSN DELIAFTELV QKSFKISGAD PSEQAGAMRQ LAQAMASGRL QGDELVSIME NAPMIYEAIA KYMGKTKGEL KKLSSEGAIT ADIIKNAVFA AAKNINTKFA EMPMTFGDIW NRIKNGALKA FDKVIVKVNQ LINADKFQRF VDRMITGFSL AASAASWLID AIIRGWDTIG PILAVIAGIW LVSIIGKLWA MIPPLIAQAA AWLSVYWPIL LVIAIIGIAI SAARQLGATW DEIIGFIGGL IGVFATTFYN YFVMIWNHIA AFVNFFGNVF KNPVAAVQAL FFDLASNLLG YIEKVARGIE DLLNKIPGVN VNIAGAITKL RDKLKAASTQ IKTEADLKTY VQSKEFMDFS EGWTKGSTMG KNLVDKVSNA LSGLTDIGKS FDMGQFGTSQ NPLYVTSNDK LKVDMSDEDL KYLRDIAERE YIAKFSTATL APNISISFGD VHETADANKI AGRIRKILQE EIAMAAEGAY A
|
| |