Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1630 |
Symbol | |
ID | 4809325 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1956610 |
End bp | 1957932 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640107046 |
Product | HK97 family phage portal protein |
Protein accession | YP_001038047 |
Protein GI | 125974137 |
COG category | [S] Function unknown |
COG ID | [COG4695] Phage-related protein |
TIGRFAM ID | [TIGR01537] phage portal protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.506997 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAATAT TTTCCCGCTT GTTCAAAGCA AGGGACAAGC CGAAAAACAG CCTGTTCGGT AATGCATATA GCTTTTTCTT CGGCGGCACA TCCAGCGGAA AAGCTGTCAA TGAGCGGACT GCTATGCAGA CAACTGCAGT GTATGCCTGT GTAAGGATAC TTGCAGAAGC CATCGCCGGG CTTCCGCTTC ATGTATACCG ATACAAGGAA GACGGTGGCA AAGAAAAAGC GTTGACCCAC CCGCTCTATT ATTTGCTCCA TGACGAACCA AACCCTGAGA TGACTTCATT TGTGTTCCGA GAAACACTGA TGAGTCATCT TCTTTTATGG GGAAATGCTT ATGCCCAGAT TGTCAGGGAC GGTTCCGGGC GAGTGCTGGC GCTTTATCCC CTTTTGCCAA ACAAAATGAC GGTAGACAGG GCTCCAAACG GAGAGCTGTA TTACACTTAT CGGCGCGACA GCGATGAGAG CAGGGTTAAT CCAAAAGCAG GCCTTATATA CCTACGAAGT GATGAGGTTC TTCACATCCC GGGACTCGGT TTTGACGGAC TGATCGGATA CTCCCCTATT GCTATGGCCA AGAATGCCAT AGGCATGGCT ATTGCCTGTG AGGAGTATGG TGCATCCTTT TTTGCCAACG GAGCAAATCC GGGTGGCGTT CTGGAACATC CCGGCGTATT AAAGGATCCG GCAAAGGTGC GTGAAAGCTG GAACGCTGTT TATCAAGGAA GTGCCAATGC TCACCGTATT GCAGTCCTGG AAGAGGGAAT GAAGTTTCAG CCAATCGGCA TTCCACCCGA ACAGGCACAG TTTTTGGAGA CAAGAAAGTT CCAGATAAAC GAAATTGCCC GGATATTCCG AGTACCTCCC CATATGGTTG GAGATCTTGA AAAGTCAAGC TTTTCAAACA TCGAACAGCA ATCTCTGGAA TTTGTTAAAT ACACGCTTGA CCCGTGGGTG GTGCGTTGGG AACAGGCTCT CCAAAAAGCG CTGCTTTTAC CATCAGAGAA GCGGGCATAC TTTGTCAAAT TCAATGTAGA TGGCCTTCTG CGCGGTGATT ATGCAAGCCG CATGAATGGT TATGCTGTAG CTCGCCAGAA CGGCTGGATG TCTGCTAACG ATATCCGCGA GCTTGAGGAC ATGAACCGGA TTCCGGCGGA GTTGGGCGGA GATCTGTATC TTGTTAACGG TAACATGACC AGGCTTGCCG ATGCAGGTAC ATTTGCAGGC AAAAACAATG CTGAAACGGA GGGATCAAAA GTTGAACAAA TCACAAAAAC AAAAACCGGT TCGCCGCTTC TGGAACTGGA TACAAAACGA TGA
|
Protein sequence | MRIFSRLFKA RDKPKNSLFG NAYSFFFGGT SSGKAVNERT AMQTTAVYAC VRILAEAIAG LPLHVYRYKE DGGKEKALTH PLYYLLHDEP NPEMTSFVFR ETLMSHLLLW GNAYAQIVRD GSGRVLALYP LLPNKMTVDR APNGELYYTY RRDSDESRVN PKAGLIYLRS DEVLHIPGLG FDGLIGYSPI AMAKNAIGMA IACEEYGASF FANGANPGGV LEHPGVLKDP AKVRESWNAV YQGSANAHRI AVLEEGMKFQ PIGIPPEQAQ FLETRKFQIN EIARIFRVPP HMVGDLEKSS FSNIEQQSLE FVKYTLDPWV VRWEQALQKA LLLPSEKRAY FVKFNVDGLL RGDYASRMNG YAVARQNGWM SANDIRELED MNRIPAELGG DLYLVNGNMT RLADAGTFAG KNNAETEGSK VEQITKTKTG SPLLELDTKR
|
| |