Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1257 |
Symbol | |
ID | 4600423 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1193429 |
End bp | 1194832 |
Gene Length | 1404 bp |
Protein Length | 467 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 639774033 |
Product | extracellular solute-binding protein |
Protein accession | YP_920658 |
Protein GI | 119720163 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCCAGC AGGCAACCAA GAAGAAAGTT TCGCCGCTTC TAATCGCCGG AATCCTGGTA GTCATAGTGG TCCTCGTCGC CGCGGCGTTC CTGCTCTACA AGCCTCCAGC CGCCCCCACT AAGCCCTCGA AGATAACGTT CTACACGTGG TGGGCCGGGC TCGAGAGGTT CGCCATAGAC GCGCTTATCG GCAACTTCAC GAAGAATACG GGGGTTGCTG TTGAAAAGAC GGCGGTACCC GGAGGCGCGG GGGTAAACGC TAAGTACGCC ATCATAGCGC TGATAATGGC CGGGAAGCCC CCGGAGGCCT TCCAGGTACA CTGCGGGCCC GAGATGATTA GCTACTTCAT GGCGGCGCCA CACGGAAAAG ACGACTTCGT CGACCTGACC TCCGTCGGGC AGGAGATAGG CCTCACGGCG ACTCCTCCCG GGCAGGTGTG CATGCTGAGC GGGCGCCTCT ACACGCTCCC AGTGAACCTC CACAGGGCTA ACCTGATCTT CATGAACAAG CAGGTACTCG ACAAGTACGG CGTAAAGCCT CCGACCACGA TCGACGAGCT GAACGCCGCT TGCAGCAAGC TCAAGGCGGC GGGAGTACCG TGCCTCGTGC AGGCAGGAGC GGACCAGTTC ACGGTGCTAC ACCTCTGGGA GCAGATATTC CTCGCCGTGG CGGGACCCGA TAAGTTCATA AAGTTCATGT ACGGGACCCT CGACCCCAAC GACCCCAGCA TAACCCAGGC CACCCAGATA TTCCTCGGCT ACGTTGACAC GTTCCCGCCG GACTGGATGG CCCTCGACTG GACCTCCGCG GTAGACAGGG TGGTAAAGGG CATGGGAGCC TTCCACGTGG ACGGGGACTG GGCTGTCGGG CTTATCTACA ACGTCTACCC GAACGTGAAG ATGTGCCCCA TAGACGCCAT TACCCCTGAC TGCAACATCA TAGTGGCGCC GTTCCCGGGC ACGCAGGGCA TCTACAACAT GGTCATCGAC GCCGTAGCCG TGCCCAAGGG TCCCGCCCAG GACCTCGGAG TCCAGTTCGC CAAGTTCTTC GCCTCGAGGG ACGGTCAGAA GATATTCAAC CCGCTTAAAG GCTCGATAGC GTGCTACGCG GACCTACCGA CCGACATATA CCCGACCTCG ATACAGAAGT GGGAGGTAAG CCAGTACGCG GCTTCCAAGT CGCAGGTATT CAGCATCACG CACGGTGCCC TGTTCTCCGA CGTCTGGAGC AAGCTTCTGA GCGGCGCAGT GCTCCTAGCG CAGACAAAGC AGACCTCGAT GTGGTACTCG ACCGTCAGCG ACGCGATTAA GCTCGAGAGA CAGCTCTGGG AGCAGAGCGG GCTCTTCCTG GGAACCCCCG AGAAGCCGTT CGCCGGCTAC CTCCCGCCCT GGGCAAAGAA GTAG
|
Protein sequence | MSQQATKKKV SPLLIAGILV VIVVLVAAAF LLYKPPAAPT KPSKITFYTW WAGLERFAID ALIGNFTKNT GVAVEKTAVP GGAGVNAKYA IIALIMAGKP PEAFQVHCGP EMISYFMAAP HGKDDFVDLT SVGQEIGLTA TPPGQVCMLS GRLYTLPVNL HRANLIFMNK QVLDKYGVKP PTTIDELNAA CSKLKAAGVP CLVQAGADQF TVLHLWEQIF LAVAGPDKFI KFMYGTLDPN DPSITQATQI FLGYVDTFPP DWMALDWTSA VDRVVKGMGA FHVDGDWAVG LIYNVYPNVK MCPIDAITPD CNIIVAPFPG TQGIYNMVID AVAVPKGPAQ DLGVQFAKFF ASRDGQKIFN PLKGSIACYA DLPTDIYPTS IQKWEVSQYA ASKSQVFSIT HGALFSDVWS KLLSGAVLLA QTKQTSMWYS TVSDAIKLER QLWEQSGLFL GTPEKPFAGY LPPWAKK
|
| |