Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_1089 |
Symbol | |
ID | 6165555 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | - |
Start bp | 972979 |
End bp | 973995 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641668241 |
Product | ABC transporter periplasmic-binding protein |
Protein accession | YP_001794466 |
Protein GI | 171185547 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG4143] ABC-type thiamine transport system, periplasmic component |
TIGRFAM ID | [TIGR01254] ABC transporter periplasmic binding protein, thiB subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.167186 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0159055 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCATTA GGGGTTTATT GGCGGCTGTG GTGGTCGTCG CCGCCGTGTT GGCGGCCCTC ATGGCGTGGC AGTCCCTCCA GCAGCAGATG GAGAGGAAGC TCGTGATCGT GGGGCCGGCC GGCATCGGCG ACTTGGGCAG GGAGCTGGCC AGGAGGTTCA GCGAGAGGTA TGGGGTAAAC GCCACCTTTG TGGCGCTTGG GGGGGCGGTG GAGATGGTGA ACGAGCTGGT GAGAAACAGG GACAACCCGC CGTGGGACGT GGCCATCGGG GTGCCCGAGT TCTACTACAC CGTTCTGGTG GAGAAGGGCG TGCTCTACTG CCCCGGCTTC AAGGTGGAGG GGGTGCCGGC CGAGGAGTAC TGGGATCCCC ACGGCTGCGT CTACCCGCTT GACAAGTCCT ACATCGGGAT CGTCTACAAC GAGTCGGCCC TCGCCGCGCG GGGCCTCAAG CCGCCTCAGA CCCTCGACGA TCTTCTGAAG CCGGAGTACA GGGGGCTTAT CACATATCCC AACCCGGTCC AGTCGGGCAC CGGCCTCGCC GTGCTCTCCT GGGTCATGTC TGTGAAGGGG GAGGAGGAGG GCTGGCGCTA CCTCAAGCAG CTGTCCAGCC AGATCTCCAA GATCGGCTAC CCGTCCGGCT TCACGCCGTT GAGAAGCGCG TTGAAGAGGG GGGATGTTTT GATCGCCCTC TCGTGGTACA GCCACGCCAT CGACCCCGGG ACCCCCAGCA TGAAGGCCGC GACGTACAGC GCCTTCCTAT ACAAGGAGGG GGTGGCCGTG TTGAAAAACG CCAGGAACAG GGACCTGGCC CTGGAGTTCG TCAAATTCGC GCTGAGTAAG GAGGGGCAGG ACCTGGTCGA CCCCTACAAC TACATGCTCC CGGTTAGGCC AGACGCCGTG GTTAAAAACA ACCTGGGCCT CCCGAGGCCG CAGTCCGTCG TCGTCTACAA CCCGGCGCTG GGGTCCAAAG CCGACGAGTG GAGGCTGAGG TGGCAGAGGG AGATCGCGTC TGGGTGA
|
Protein sequence | MSIRGLLAAV VVVAAVLAAL MAWQSLQQQM ERKLVIVGPA GIGDLGRELA RRFSERYGVN ATFVALGGAV EMVNELVRNR DNPPWDVAIG VPEFYYTVLV EKGVLYCPGF KVEGVPAEEY WDPHGCVYPL DKSYIGIVYN ESALAARGLK PPQTLDDLLK PEYRGLITYP NPVQSGTGLA VLSWVMSVKG EEEGWRYLKQ LSSQISKIGY PSGFTPLRSA LKRGDVLIAL SWYSHAIDPG TPSMKAATYS AFLYKEGVAV LKNARNRDLA LEFVKFALSK EGQDLVDPYN YMLPVRPDAV VKNNLGLPRP QSVVVYNPAL GSKADEWRLR WQREIASG
|
| |