Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pisl_1958 |
Symbol | |
ID | 4616339 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum islandicum DSM 4184 |
Kingdom | Archaea |
Replicon accession | NC_008701 |
Strand | + |
Start bp | 1773830 |
End bp | 1774846 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 639785049 |
Product | ABC transporter periplasmic-binding protein |
Protein accession | YP_931448 |
Protein GI | 119873441 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG4143] ABC-type thiamine transport system, periplasmic component |
TIGRFAM ID | [TIGR01254] ABC transporter periplasmic binding protein, thiB subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0213047 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 1.86869e-16 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCCATTA GGACTTTATT GGCAGTTGTA ATAATAATAG CCGCCCTCTT GGCGGCGCTC TTGGCTTGGC AAGCGTTTCA ACAACAGTCG GCGAAGAGAC TTGTGATAGT GGGCCCCGCC GGCATCAGCG ATTTGGGCAA GGAGTTGGCT AAGAAGTTCA GCGAGAAATA TGGCGTAAAC GCCACCTTTG TCCCCCTAGG CGGCGCTGTG GAGATGGTAA ACGAGTTGGT TAGAAACAGA GACAACCCGC CCTGGGACGT GACCATTGGG GTGCCTGAGT TCTACTACAT GGTGCTTATC GAAAGGGGCG TGCTCTATTG TCCGGGCTTC AAGGTGGAGG GGGTGCCGGC CGAGGAGTAC TGGGATCCCC ACGGCTGCGT CTACCCGCTT GATAAGTCGT ACATAGGGAT TGTCTACAAC GAGACAGCTC TCGCCGCGCG GGGCCTCAAG CCGCCTCAGA CCCTCGACGA TCTTCTGAAG CCTGAGTACA AGGGGCTTAT CACATATCCC AACCCGGTTC AGTCCGGCAC CGGCCTCGCC GTGCTCTCGT GGGTGATGTC TGTGAAGGGG GAGGAGGAGG GCTGGCGCTA CCTCAAACAG CTGGCCGGCC AGATCTCTAA GATCGGCTAT CCTTCAGGAT TTACGCCATT GAGAAACGCA TTGAAGAGGG GTGACGTATT GATCGCCCTC TCGTGGTACA GTCACGCCAT CGACCCAGGC ACGCCGAATA TAAAGGCCGC GACGTACAGC GCCTTCTTGT ATAGGGAGGG GGTGGCTGTG TTGAAAAACG CCAGGAATAG GGATCTGGCT GTGGAGTTCG TCAAGTTCGC ACTGAGTAAA GAGGGGCAAG ATCTTGTCGA CCCATACAAC TACATGCTCC CGGTTAGGCC AGACGCCGTT ATTAAAAACA ACAAGGGCCT CCCGCAGCCC CAGTCTGTGG TGGTGTACAA CTCTGCCCTG GGCGCAAAGG CCGACGAGTG GAGGCTGAGG TGGCAGAGAG AAATCGCCTC TGGGTGA
|
Protein sequence | MSIRTLLAVV IIIAALLAAL LAWQAFQQQS AKRLVIVGPA GISDLGKELA KKFSEKYGVN ATFVPLGGAV EMVNELVRNR DNPPWDVTIG VPEFYYMVLI ERGVLYCPGF KVEGVPAEEY WDPHGCVYPL DKSYIGIVYN ETALAARGLK PPQTLDDLLK PEYKGLITYP NPVQSGTGLA VLSWVMSVKG EEEGWRYLKQ LAGQISKIGY PSGFTPLRNA LKRGDVLIAL SWYSHAIDPG TPNIKAATYS AFLYREGVAV LKNARNRDLA VEFVKFALSK EGQDLVDPYN YMLPVRPDAV IKNNKGLPQP QSVVVYNSAL GAKADEWRLR WQREIASG
|
| |