Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2476 |
Symbol | |
ID | 4809856 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2948505 |
End bp | 2949914 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640107891 |
Product | SPP1 family phage head morphogenesis protein |
Protein accession | YP_001038871 |
Protein GI | 125974961 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG5585] NAD+--asparagine ADP-ribosyltransferase |
TIGRFAM ID | [TIGR01641] phage putative head morphogenesis protein, SPP1 gp7 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAATAAAA AGGACATAAC CTACTGGGAA AAACGACAGG AACGGAAATA TCTGGCCGGA GAGAAGAAGC TTGATGAATA TTATAAAGGT TTGCAGAAAG CGTTTAGACA AGCAAAACGA GAAATCCAGA GTGTTATAAA TGATTTCTAC ATGCGATATG CAAAAGAAAA CAAAGTATCC TATGCTGAAG CCCAAAAACT ACTTGATAAG GCAGAAATAG GCGAGCTGCA GGACTTTATA GACCTTGTTA ATAAGAATAT GGGCAAGTAT AATCGAAAGC TTAACAATAT GTCTATAAAA GCCAGAATTA CCCGCTATCA AGCGCTAGAA AAGCAGATAG ATGCTATACT ACAGCAATTA TATGCTATTG AGTATGAGTA TAAAGGTAAA GAGCTACTGA AGGAAGTATA TGAGGATTCT TATTATCGTA CCTGGTTTAA CATAGACCAG TACCACGGCT TTCATCAGGA GTTCGCACAG ATTAATCCTA GAACTATAGA AGAGTTGATA AAATATCCTT GGAATGGAGC AAGTTTTTCT GATAGGATAT GGAAGCAAAA AGACCATATG CTGCAGGTAT TAAAAGAAGA CATTACTACT ATGTTAATAC AAGGGAAAAA TCCTCAAACA TTAGCAAGAG ATTTCGCAAG AAGGTTTAAA ACAAAAGAAT ATGAAGCATA TAGGCTGCTA CATACAGAGA GCAGTTTTAT TATCGAACAG GGAACTTTAG CAGCATATAA AGAAGATGGG GTGGAGAAGT ATCAGATTCT GGCTACTCTG GACATGAGGA CATCGGATAT ATGCAGAAGT GAGGATGGGA AAATATATGA TGTGGATGAG GCGACAGTGG GAGTAAATTA TCCTCCATAT CATCCATTTT GTAGGACCAC AACAGTGCCA TATTATGAGG ATGCTGAGGT AGGTACAAGG GTTGCGCGTG ATCCGGTAAC AGGTAGAAGT TATGAAGTTC CAGCGAATAT GACATATGAG CAATGGAAAA ATAGATATAT AGATCAACCT GACAATATTA TTCGCCAAGA GATACTGAGT AATCCTGAAA GACTTGATAA TTATAGTATC CAACATTATA ATAAGCATAA AGAAGGAACC AAACAATATG AGCAGTATAA GCAATCAAGA CTTAAAAAAG GTCAAACTGA ACAAAGCAGT TTACTAATTT CTTACGATGA AGCTAAAGAA ATAATAAAAA AATATGCTGG TACTGGAGTA TTTAGTAGAG ACAGGAAAGG GAAATGGAGA AATGAGGAAT TTGTGGATGT AGATTCTATA ATTGGTGTTG TGCATAATAT TGATGGGACA GTAACGCCTA CTAATAGAAT TCAAATAAAA TATGGGAAGA ACAGCGTGCA CATTGTACCT GTATTACCAA GAAAGGAGAG AAATAAATGA
|
Protein sequence | MNKKDITYWE KRQERKYLAG EKKLDEYYKG LQKAFRQAKR EIQSVINDFY MRYAKENKVS YAEAQKLLDK AEIGELQDFI DLVNKNMGKY NRKLNNMSIK ARITRYQALE KQIDAILQQL YAIEYEYKGK ELLKEVYEDS YYRTWFNIDQ YHGFHQEFAQ INPRTIEELI KYPWNGASFS DRIWKQKDHM LQVLKEDITT MLIQGKNPQT LARDFARRFK TKEYEAYRLL HTESSFIIEQ GTLAAYKEDG VEKYQILATL DMRTSDICRS EDGKIYDVDE ATVGVNYPPY HPFCRTTTVP YYEDAEVGTR VARDPVTGRS YEVPANMTYE QWKNRYIDQP DNIIRQEILS NPERLDNYSI QHYNKHKEGT KQYEQYKQSR LKKGQTEQSS LLISYDEAKE IIKKYAGTGV FSRDRKGKWR NEEFVDVDSI IGVVHNIDGT VTPTNRIQIK YGKNSVHIVP VLPRKERNK
|
| |