Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1152 |
Symbol | |
ID | 4600958 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1090712 |
End bp | 1091893 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 639773928 |
Product | extracellular solute-binding protein |
Protein accession | YP_920553 |
Protein GI | 119720058 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0401617 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGATAAGGT TCGCCGGGTG GAGCGCCGGA GAAACAGAGA TGAAGAACTA CCAGAAGATA ATCGAGGACT TCCAAAAGGC TAACCCGGAT ATAATCGTCA AGTACGAGGT AATAACCCAG ATGTTCTCCG AGAACATTCT AGCAAGCTAC GCGGCCGGCG CCGCTCCCGA CATATTCTAC GTTGACTCTG CGTGGGCGCC CACGTTCATC AGTAAGGGCG CTCTCTACCC CATAGGCGAC AAGCTACCCA AGGACTTCAT CGACCAGTTC TACCCGTTCC TCCTCGAACC GTTCAAGGGG CCTGACGGGA AGATCTACGG ATTACCGAAG GACTGGTCCG TGCTCTCGCT GTTCTACAAC AAGAAGTTGT TCGCCCAGGC AGGAGTACCC GAGCCCACCG CCGACTGGAC CTGGGACGAC CTCTTCAACG CCGCTAAGAC CATCTACCAG AAGACCGGTA AGCCCGGGCT AGTCGTACAC GCAGAGCTCA ACAGGTGGGT ACCCTTCCTC GTCTCCAACG GTGCTCCCCC ACCGCGCTTC GACTCGGCGG CCGACGCCGC CTACTTCGAC AAGCCCGAGG TTAGGAACGC GATTTCGAAG ATGATAGCCA AGATACAGGA GGGGCGTAAA GAGGGCTACA TAGTCCTGCC CTCGGACGTG AACGCCGGCT GGAACGGGGA GGCCTTCGGC AAGCAGCTCG CCGCGATGAC TATCGAGGGT AGCTGGATGA TACCCTACCT CGCGGACCAG TTCCCCAACT TCAAGTACGG CTCCGACTGG GACCTCGCGA TGCTACCTAA GGGTCCCGCC GGCAGGGCAA GCATGGCTTA CACCGTGGCG CTCGGAGTGA ACTCCAAGAC CGAGAACCTG GACGCGGCGC TGAAGTTCCT GCAGTACGTT GAAGGCATTG AAGGGCAGAA GCTACTAGTG GTGAAGATGG GTCATACCCT CCCGTCCATA AAGGCGCTAG CAAACGACCC AGACCTCTGG CCCTCTCACG CTAAGGAGCT ATCGTTCGTG AACAAGTACG ACCGCGTGGC GCTCTTCTTC TACGGCCCGA AGACAGGACA GATAGAGGGA AGCATAAACC AGATCATCCA GTCTGCGGTC AGGGGGGAGA TAACGATAGA CGAAGCGCTA AGGCTGATGA AAGACAAGGT TGCAGAAGCC TTTAAATCCT AG
|
Protein sequence | MIRFAGWSAG ETEMKNYQKI IEDFQKANPD IIVKYEVITQ MFSENILASY AAGAAPDIFY VDSAWAPTFI SKGALYPIGD KLPKDFIDQF YPFLLEPFKG PDGKIYGLPK DWSVLSLFYN KKLFAQAGVP EPTADWTWDD LFNAAKTIYQ KTGKPGLVVH AELNRWVPFL VSNGAPPPRF DSAADAAYFD KPEVRNAISK MIAKIQEGRK EGYIVLPSDV NAGWNGEAFG KQLAAMTIEG SWMIPYLADQ FPNFKYGSDW DLAMLPKGPA GRASMAYTVA LGVNSKTENL DAALKFLQYV EGIEGQKLLV VKMGHTLPSI KALANDPDLW PSHAKELSFV NKYDRVALFF YGPKTGQIEG SINQIIQSAV RGEITIDEAL RLMKDKVAEA FKS
|
| |