Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1835 |
Symbol | |
ID | 4809819 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2175342 |
End bp | 2178650 |
Gene Length | 3309 bp |
Protein Length | 1102 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640107249 |
Product | viral A-type inclusion repeat-containing protein |
Protein accession | YP_001038249 |
Protein GI | 125974339 |
COG category | [D] Cell cycle control, cell division, chromosome partitioning |
COG ID | [COG1196] Chromosome segregation ATPases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAATTT TTTTGTGTGT TGTTTTGTTG ATAACGATTG CCATCCCCGC TATATTGGTT GAAGGAACGA GAATAAAGGC GGCACAGACA CAAATCGAAA GTGCCCTGGA TATATCGGCA AAGTCAGCTC TTGCCAATTA TAATTATTTG TTGAAAGAGC TATATGGTCT TATGGCTCTG TCCAGTGATG ATCCCGACTT GTTGATGGAG GAAATAATAT ATTATCTTGA AAGAAATTTG ATGGTTGAAG GTATTAAAGA ACACAAAACA AAAGCGGAAT CAACTCTTGA CACCATAAAT GAATCGGAAA CCGTTAAAAA TATAAAGAAG TTTTTGGGAG TGGACAAGAA AAAAGAGGAA AAATCTTTGG ATTTGTATGA TTATAAAATT GAAAATGTTA AAGTTCAACC AATATATAAC CTGTCGGAGC CGGAGGTTTT AAGAGCGCAG ATACTGGAAT ATATGAAATA CAGAGGTCCG AAGGAATTGT CTGAAGGACT TATGGATAAA TTTCTTGCTT TGAAGGATTT TAAGAAGCAG GCTGATATTT TAAAGGAGAA GCTTGATGTT GACAAGAATT TAACGGAGAT TAAGGAAAAT GAGGTTAAAG CCTCAGATAA TATGGTGACG GTCAATAAAT TTGCCACAGA TGCAAATATA CCAAATCAGT TGGATTTGGC TGCGCAATAT ATATTGCAAA AAGTTAAGTT GGAAAAAGCG ATAAAGGACA TGGAGAAAGA GATAGAAGAT AAGGAAGAAA AGTTGGAAGA AATAGAAGAA GAAATAGATG GATACAAGAA TGAAATCAAA GACCTCAAGA AGCAGATAGA GGAAAAGAAA AAAGAGGCTG AAGATGATGA GTCCGGTGAA GTGGATGTAT CAAACGAGGA AAACAGGATA AAAGAGATTG AAAGTTTGAT AAAGGATCTT GAGGATTCGA AGGATGAGAT TGAAGAAGAA ATAGACGAGC TGAAAGAAAA AATAAAAGCA AATAAAAAAG AATTGGAGGA TAAGAAGACA CTTATAAAGA ACGTTAACGA TAGATTAATA GGTTATATTG ACAAAACCAT AAAAGCTGTA GAGGAAGCAA GGAATGCACT TGAGACTGTA ATGAAAAAGT CGGTAGAGAC TGTGGCAAAG ATTGACCAAA TAAATGAGAA GCTGGAAGGA GAAACGAATG AATTTTCGAA CACCACAAGG CTTGACCTTG GAAGCAAGAA AGAAAGGATA TCGGCAGAGG ATTTGACTCC CAAGATTGCC GAGCTTAATC ACAACCTTGC TCTGCTGAAT GATATAAAGA ATTGTATAGA AAATGCGAGA ATGAAAGAAC TTGGTTTGAG TGATTTTGGG GATGCAATTC CGGATATAGA TGAAGTGCGT GCAAGGCTTA ATATTGAGAC GGTGAAGGCG AAAATAGCAG AGTATAAAGG TAAAGTCAAT GGTGATCCTA TTGATTATTA TGCAGACAAG GGTATAATTT CTGACAAACC CAAGGAAGGA GAAAAGGATC CGAGGGATTC CATAAGTGAT TTTACGAAAA AAGGCGGACC CGAGGATGAA AATCCGGTAA AGGAAAACAA GAAGACAATG CCGGAGGATG TACCGTCGAA AAATGGGTAT AAACGGACGG AACTGGAAGA AATAAAAAAT GACATGGAAC ATGTAAAGAA TATACTTGGT ATTATAGCTT CAGGAGGGAA GAGTGGAGGA ACTTCAACCG GAAAAGAGCT TGGCGCGGTG GATTTGAGTA AGACATCGTT CTCGAAGGAT AATACGGAGT TTTCAAAGAG CGGATTGGAT TTTCTTTCGG GTATATCAGA AATATTTATG GACGGTATCT ATAATTTGAG AGATGAAATA TATATAAATG AGTACATAAT GGGAAGCTTT GCGAATTACA CCACGGATTT GGAGAAAGAC CTGGATTTAA GAGGAAATTT GATGAAAAAC AGGCCGGTTT TCTTTGACGC GAACCATGCG GATGTAGAGT ATATTTTAGG GGGATTTGAA AGTGAAAAGG AGAATATAAA TGCAGTAAAG GCACAGATTA CTCTTGTTAG GTTTGCGTTG AATGTTATAG CTATATATAC GGATCCATAT AAGTTTAATA CTGCATTAGA GGTTGCTACC GTTGTGGCTG GGTGGACCGG CGGTGTAGGT GTACCAATTG TACATACATT GATTATGATG GCTTGGGCAA TGGCTGAGTC TTTGTTTGAT GTGTACCTGC TGTTGAAAGG GGATAGTGTA CCGATATTTA AAACGAGAAA TACGTGGATA ACGGATATTG ATGGGTTTTC TAAAATAATT ACAAATGAAA TTGTAGAAAA TACAAAAAAT ATAGCACGGA ATGCAGCTAA GCAAGTTATT GACTATACAG AAAATAAAGT AGAAGATTTT CTAAAAAGTG CAAGTATAAC TATTTCTGAT TACATTGATT CTAAAGTTGA CCTGTTGGTT GATAAAGCTT TTGCAAGTAT TGAAAATCCT TTAAAAGAAA ATATGTATTC GGCTGAAAAT ATTTTTAGCG ATTTTGAAGC AAGCGTATAC TTTAATGGTG AAGGTGAAAT AGGAGATATA ATGAATCAAA TTAGCAACGA AGTTCAAGTT TTGCTTCAAG AAGAAATGAA AATTGTTGAG AATAGTACTT TGGGTAGCTA TGTGCCTGAA ATTATTCTGG GAGTAGATTA TACAGATTTT TTGCAAGAAT ATAAAAATAG AAATCTTAAA GAGGTAATTG CTGAGTTGAC TGTTGAACTT GCGGAAGGAA AGGAGTTGCT TGGATATAAT GTAGATGAAA CTAGCAAAAA AATAAATAGC ATGATTTTTA ATACTATTCA AAAAGCAAAA AATGAAGTGA AACTTAATAT AAAAACAAAA ATTAGAGAGT ATAAGAAAGA TTTAATTGAA AAATTTGAAA AAAAATTTAA AGAAACAGCT GAGAAAGGAA AAGAAGAAGT TGATAAATTT ATTGATTCTA TAGGTAATAC AAGTGATTCA GAAGTAATGA AAACAAACCT TAAAGGGTCT TTTCTCTCCA TGAAATACGT TGATTACCTT AGGTTATTTC TACTTTTTAC AGACAAAGAT GTGAAAATAA AAAGAATAGC TGATTTGATT CAAGTGAATA TGAGAAATGT ATCTGGGAAT AAAACTTTTA AAATGTCAGA ATGTAGTACG TATATGCGTA TTGAATCTTC AGTTTCTATA AAATATTTGT TTGCGACAAA ACCTTTTATG CCAAAAGAGT TCAGAACCGA AGATGGAAAA AGAATAGAGT TAGATGTTGT CCTGTATAAA GGATATTAA
|
Protein sequence | MSIFLCVVLL ITIAIPAILV EGTRIKAAQT QIESALDISA KSALANYNYL LKELYGLMAL SSDDPDLLME EIIYYLERNL MVEGIKEHKT KAESTLDTIN ESETVKNIKK FLGVDKKKEE KSLDLYDYKI ENVKVQPIYN LSEPEVLRAQ ILEYMKYRGP KELSEGLMDK FLALKDFKKQ ADILKEKLDV DKNLTEIKEN EVKASDNMVT VNKFATDANI PNQLDLAAQY ILQKVKLEKA IKDMEKEIED KEEKLEEIEE EIDGYKNEIK DLKKQIEEKK KEAEDDESGE VDVSNEENRI KEIESLIKDL EDSKDEIEEE IDELKEKIKA NKKELEDKKT LIKNVNDRLI GYIDKTIKAV EEARNALETV MKKSVETVAK IDQINEKLEG ETNEFSNTTR LDLGSKKERI SAEDLTPKIA ELNHNLALLN DIKNCIENAR MKELGLSDFG DAIPDIDEVR ARLNIETVKA KIAEYKGKVN GDPIDYYADK GIISDKPKEG EKDPRDSISD FTKKGGPEDE NPVKENKKTM PEDVPSKNGY KRTELEEIKN DMEHVKNILG IIASGGKSGG TSTGKELGAV DLSKTSFSKD NTEFSKSGLD FLSGISEIFM DGIYNLRDEI YINEYIMGSF ANYTTDLEKD LDLRGNLMKN RPVFFDANHA DVEYILGGFE SEKENINAVK AQITLVRFAL NVIAIYTDPY KFNTALEVAT VVAGWTGGVG VPIVHTLIMM AWAMAESLFD VYLLLKGDSV PIFKTRNTWI TDIDGFSKII TNEIVENTKN IARNAAKQVI DYTENKVEDF LKSASITISD YIDSKVDLLV DKAFASIENP LKENMYSAEN IFSDFEASVY FNGEGEIGDI MNQISNEVQV LLQEEMKIVE NSTLGSYVPE IILGVDYTDF LQEYKNRNLK EVIAELTVEL AEGKELLGYN VDETSKKINS MIFNTIQKAK NEVKLNIKTK IREYKKDLIE KFEKKFKETA EKGKEEVDKF IDSIGNTSDS EVMKTNLKGS FLSMKYVDYL RLFLLFTDKD VKIKRIADLI QVNMRNVSGN KTFKMSECST YMRIESSVSI KYLFATKPFM PKEFRTEDGK RIELDVVLYK GY
|
| |