Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1100 |
Symbol | |
ID | 4600967 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1037629 |
End bp | 1039233 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 639773877 |
Product | PTS system mannose/fructose/sorbose family IID component |
Protein accession | YP_920502 |
Protein GI | 119720007 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3715] Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IIC [COG3716] Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IID |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.220553 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGACCA TAGGCCCACT CGAGGTAGGG CTTCTAGGAC TGCTAGCCTT CATCTTCGGA CTGGACTACA TCTGGGTAAC ACCTCTCGGA ATCTGGCGCC CCGTGGTCGC GGGGACCCTC ACAGGCATTA TTCTAGGAGA CCCCTTGACA GGGCTACTCG TAGGCTCGCT ACTAGAGTTC GTGTTCGCTG GGCTATTCAC GATAGGAGGC GGGACCGTTC CGGAGGCTGC TAGCGGGACG ATAGCCTCCG TGGTTGTCGC CGTTACTACG GGGTTAAAGC CTGAGGCGGC GGTACCGCTG GCTATACCCG TAGCAGTGCT TACAATGAAC TTGGAGATAG TTGTCAGGTC TTTCGACGCG GTGTTCACGC ACTGGGCTGA CAGGGAGATA GAGAGGGGGA ACTACGGGGC AATCCCCCTG ATAAACATTC TCGGCGCGGT ACCATGGGGG CTCAGCAGGG CAATACCTAT CTGGCTATTC GCGGGAGCCT TAGCCATAAA CCCACAGGCC GTTAAAGCGG CGATAGATGC CCTCCAAGTG GTTCAAATCG GACCCTTCAC GGTGCGCTTC TGGGACGCGA TGGCAGTCGC GGGCGCCGTA CTCCCAGCGC TCGGTGCCGC CATTCTAATG AAACTCATGA TCTCGCGTAG GAACGTGATG TTCTTTGTAC TCGGCTTCGC TCTCGCAGCC TACCTGAAGC TAAGCCTCCT GGCGATAGCC CTCGTAGCGG GATCCATAAT CTTCGCTATC TACTACTTCA CCCACCGCGA GGCATTGGAG GCTGGAGCCG CGGTAACCAC AGCGGCACCA CCCACCGGCA AGGCAACGAC GAAGGACTTC ATAAGGTGGT TCGGGGTCTC ATGGTTCATA CAGTCGTCCT GGAACTACGA GAGAATGATG GGGACAGGCT TCGCGCACGG TATGCTTGAA ATAGAGAAGA AGCTTAGAAA GGACCCGGAG GAGCTGAAGT CCTGGATGAG GCTACACAAC GAGTTCTACA ACACCGAGCC CCACCTCCAC AACGCCATTT ACGGGATGGT GATATCCCTA GAGGAACAGG GGGCGGATCA GGATACGATA AGAGGAGTCA AAACAGCGCT TATGGGTCCA TTCGCAGGGC TCGGAGACTC GATAATGTGG TTCACGATCC TCCCGATAGC GTTCCTCTTA GGAGCCTCGC TGGGAGCCCA GGGCAACATA CTCGGCCCGG TAATAGCGCT ACTGATATGG ATACCAGTCT CCTGGGCCGT TAAGTACTAC ACGCTCGTCT ACGGGTACAA GTACGGCTTA TCCCTGGCGG AAATACTCAA GGGAGAAGTC CTGAAGATAT TTAGGGAGGG TATAGCAGCC TTCGCGATGG CAATGGTCGG AGGAATCGCG GCGACATACG TCAGGGCGAC AACCCCGATA GTACTGGCCC AGTACGCTGG TCACGCCATT AAGCTACAAC CAGTACTGGA CCAGTTGATG CCATCCCTGC TCCCACTGCT CTTCACCCTC TACGCCTACT GGCTAATAAA GGTCAAAGGC TACAGCTACG GTAAAGCCGT CGTCATACTC TTCCTCACGG CATTCATACT CGCACTGCTA GGAGTACTCG GTTAA
|
Protein sequence | MATIGPLEVG LLGLLAFIFG LDYIWVTPLG IWRPVVAGTL TGIILGDPLT GLLVGSLLEF VFAGLFTIGG GTVPEAASGT IASVVVAVTT GLKPEAAVPL AIPVAVLTMN LEIVVRSFDA VFTHWADREI ERGNYGAIPL INILGAVPWG LSRAIPIWLF AGALAINPQA VKAAIDALQV VQIGPFTVRF WDAMAVAGAV LPALGAAILM KLMISRRNVM FFVLGFALAA YLKLSLLAIA LVAGSIIFAI YYFTHREALE AGAAVTTAAP PTGKATTKDF IRWFGVSWFI QSSWNYERMM GTGFAHGMLE IEKKLRKDPE ELKSWMRLHN EFYNTEPHLH NAIYGMVISL EEQGADQDTI RGVKTALMGP FAGLGDSIMW FTILPIAFLL GASLGAQGNI LGPVIALLIW IPVSWAVKYY TLVYGYKYGL SLAEILKGEV LKIFREGIAA FAMAMVGGIA ATYVRATTPI VLAQYAGHAI KLQPVLDQLM PSLLPLLFTL YAYWLIKVKG YSYGKAVVIL FLTAFILALL GVLG
|
| |