Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1676 |
Symbol | |
ID | 4600928 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1622391 |
End bp | 1624301 |
Gene Length | 1911 bp |
Protein Length | 636 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 639774449 |
Product | extracellular solute-binding protein |
Protein accession | YP_921074 |
Protein GI | 119720579 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.24706 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGG CAAAAGTTGC ACCAATACTG GTACTAGTAG TTCTCGCGAC AATCGCCCCT GTTTTCGCAG CGCCCGAGAC TATACCCAGG GAGGAAGCCA TCTACGTCGG CGGCGGAATG TGGTCTCCAC CGAACAACTG GAACCCCCTA ATACCGTGGT CCGCTGTAAC GGGTACAATA GGGCTTGTCT ACGAAACATT GTATCTATAT AACCCTGCAA ACGGAACTTT CATTCCTTGG ATAGCCGATG GTCAACCCAG CTTCCAGGTA AGCGGTAATA CAGTGAGAAT AACAATTAAG CTAAAAGAGG CAAGATGGCA GGACGGGCAA CCGCTTACAA GCGAAGATGT TGCGTACACG TTCTACGAGT TTCCGAAGAA AAACACAGCC GTCTATTATT CTAGCATTCT AAACTACCTG CTGTCCGTTG AAACACCCGA CACCAGAACC GTAGTTTTTG TCTGCAATGC CACAACCGTA AACTACCCGC AGATCTACGA TTTCCTGAGA TCAGTCGCTA TAATCCCGAA ACATGTATGG GTTAACAAGG AAAAACCATT GGAAGATGCA AACTGGCCGC CCATAGGCTC CGGGATGTAC AAGGCTAGTA GCTACACGTC TGACCGCATG ATATGGGTTC GCGACGATAA CTGGTGGGGC ACGAAGTACT TCGGAACACC CGGTCCAAGG TATATCGTAT ACGTCATGGT ATCGAGTAAC GCTGTTGCTC TAGCGATGCT GGCTCGAGGA GAGCTGGACT GGAGCAACTA CTTCCTGCCA GGCTTCTCCA GTCTTGTACA ACAATACCCG TTCTTAGTAA CCTGGTATAA TAAGTCACCG TGGAACCTTC CTGCAAACGT TGCATTCCTC TTTGTGAACA CGAAAAAAAC GCCCATGGAT AACCCTACTT TCCGCAAAGC TCTCTACTAC GCTATAGACG TGGATAAAAT AATCAACACA GTATTTGAGG GGGGCGTTAT CAAAGCCTTA CCTATAGGCA TACTCGACAT ACCTGGATAC AAGCCTTTCA TCGACACAGA ACTTATATCT AGGTATGGGT ATAAGTACGA CCCCGAAAAA GCAAAGCAAT TGCTTGACAG CATAGGGATA AAGGACTATA ATGGAGACGG GTGGAGAGAG CTACCCGGAG GCAAGCCTTT AAAACTTACT ATAATTGTGC CGTATGGCTG GACTGATTGG ATGGAGGCAG CCAGACTTAT CGCAAGCGAT CTCCAGAGAG TAGGACTTTA CGTCGAGGCG CAGTTCCCGG ACTACTCGGC GTATAGCGAG GCATTGTACA AAGGAACATT TGACATGTTG ATAAACAACT TTGGCAGCTT TGCCTCTATA TCCCCCTGGG TTATCTACAA CTGGGCACTA TGGCCCGATG CCCCACCCGT AGGCGAATAC TCCTGGAGCG GGAACTTTGG CAGGTACTCT AATCCGAAGG TAACAGAGCT TTTGCACACA ATAGCGAATA CACCTCTCAG CGATGTTACC AAGCTGAAGC AACTCTACGG TCAACTTGAA CAAATATACC TAGACGAGAT GCCTTACATA CCATTATGGT ACAACGGCTA CTGGTTCATT GGCTCGAAGC TGTACTGGAC AGGATGGCCT AGCGCCGATA ACCCGTACGG CGTACCGGTG ACGTGGCCTG GGAGGTGGCA AGACGGCGGC TTATTGGTGC TCCTTAAACT CAGACCTGTG AAAACTCCCA CAACTACTCC AACAACTACA CCGACCACTC CAACTACTCC GACTACCCCC ACTGCTCCGA CGACGCCTAC CGCTCCAGCA CCAGACTACA CTCCGTACAT AGTGGCGCTA ATAGTGATAG TCGCTATACT TGCCGTAGCT TACATGTTCT TTGTACAGCG CAAAAAGAAG GAAGAAACCA AGCCTCAATA A
|
Protein sequence | MKKAKVAPIL VLVVLATIAP VFAAPETIPR EEAIYVGGGM WSPPNNWNPL IPWSAVTGTI GLVYETLYLY NPANGTFIPW IADGQPSFQV SGNTVRITIK LKEARWQDGQ PLTSEDVAYT FYEFPKKNTA VYYSSILNYL LSVETPDTRT VVFVCNATTV NYPQIYDFLR SVAIIPKHVW VNKEKPLEDA NWPPIGSGMY KASSYTSDRM IWVRDDNWWG TKYFGTPGPR YIVYVMVSSN AVALAMLARG ELDWSNYFLP GFSSLVQQYP FLVTWYNKSP WNLPANVAFL FVNTKKTPMD NPTFRKALYY AIDVDKIINT VFEGGVIKAL PIGILDIPGY KPFIDTELIS RYGYKYDPEK AKQLLDSIGI KDYNGDGWRE LPGGKPLKLT IIVPYGWTDW MEAARLIASD LQRVGLYVEA QFPDYSAYSE ALYKGTFDML INNFGSFASI SPWVIYNWAL WPDAPPVGEY SWSGNFGRYS NPKVTELLHT IANTPLSDVT KLKQLYGQLE QIYLDEMPYI PLWYNGYWFI GSKLYWTGWP SADNPYGVPV TWPGRWQDGG LLVLLKLRPV KTPTTTPTTT PTTPTTPTTP TAPTTPTAPA PDYTPYIVAL IVIVAILAVA YMFFVQRKKK EETKPQ
|
| |