Gene Pisl_1958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_1958 
Symbol 
ID4616339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp1773830 
End bp1774846 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content57% 
IMG OID639785049 
ProductABC transporter periplasmic-binding protein 
Protein accessionYP_931448 
Protein GI119873441 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4143] ABC-type thiamine transport system, periplasmic component 
TIGRFAM ID[TIGR01254] ABC transporter periplasmic binding protein, thiB subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0213047 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value1.86869e-16 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCATTA GGACTTTATT GGCAGTTGTA ATAATAATAG CCGCCCTCTT GGCGGCGCTC 
TTGGCTTGGC AAGCGTTTCA ACAACAGTCG GCGAAGAGAC TTGTGATAGT GGGCCCCGCC
GGCATCAGCG ATTTGGGCAA GGAGTTGGCT AAGAAGTTCA GCGAGAAATA TGGCGTAAAC
GCCACCTTTG TCCCCCTAGG CGGCGCTGTG GAGATGGTAA ACGAGTTGGT TAGAAACAGA
GACAACCCGC CCTGGGACGT GACCATTGGG GTGCCTGAGT TCTACTACAT GGTGCTTATC
GAAAGGGGCG TGCTCTATTG TCCGGGCTTC AAGGTGGAGG GGGTGCCGGC CGAGGAGTAC
TGGGATCCCC ACGGCTGCGT CTACCCGCTT GATAAGTCGT ACATAGGGAT TGTCTACAAC
GAGACAGCTC TCGCCGCGCG GGGCCTCAAG CCGCCTCAGA CCCTCGACGA TCTTCTGAAG
CCTGAGTACA AGGGGCTTAT CACATATCCC AACCCGGTTC AGTCCGGCAC CGGCCTCGCC
GTGCTCTCGT GGGTGATGTC TGTGAAGGGG GAGGAGGAGG GCTGGCGCTA CCTCAAACAG
CTGGCCGGCC AGATCTCTAA GATCGGCTAT CCTTCAGGAT TTACGCCATT GAGAAACGCA
TTGAAGAGGG GTGACGTATT GATCGCCCTC TCGTGGTACA GTCACGCCAT CGACCCAGGC
ACGCCGAATA TAAAGGCCGC GACGTACAGC GCCTTCTTGT ATAGGGAGGG GGTGGCTGTG
TTGAAAAACG CCAGGAATAG GGATCTGGCT GTGGAGTTCG TCAAGTTCGC ACTGAGTAAA
GAGGGGCAAG ATCTTGTCGA CCCATACAAC TACATGCTCC CGGTTAGGCC AGACGCCGTT
ATTAAAAACA ACAAGGGCCT CCCGCAGCCC CAGTCTGTGG TGGTGTACAA CTCTGCCCTG
GGCGCAAAGG CCGACGAGTG GAGGCTGAGG TGGCAGAGAG AAATCGCCTC TGGGTGA
 
Protein sequence
MSIRTLLAVV IIIAALLAAL LAWQAFQQQS AKRLVIVGPA GISDLGKELA KKFSEKYGVN 
ATFVPLGGAV EMVNELVRNR DNPPWDVTIG VPEFYYMVLI ERGVLYCPGF KVEGVPAEEY
WDPHGCVYPL DKSYIGIVYN ETALAARGLK PPQTLDDLLK PEYKGLITYP NPVQSGTGLA
VLSWVMSVKG EEEGWRYLKQ LAGQISKIGY PSGFTPLRNA LKRGDVLIAL SWYSHAIDPG
TPNIKAATYS AFLYREGVAV LKNARNRDLA VEFVKFALSK EGQDLVDPYN YMLPVRPDAV
IKNNKGLPQP QSVVVYNSAL GAKADEWRLR WQREIASG