Gene Pisl_1331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_1331 
Symbol 
ID4617492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp1205540 
End bp1206808 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content50% 
IMG OID639784420 
Productmajor facilitator transporter 
Protein accessionYP_930837 
Protein GI119872830 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones85 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTATAA GACTTGTAAC TTCAGCATCG ATGGCCGGAA CTCTAATTGA ATGGTATGAG 
TTTTTTGCAT ATGCCTCGTT ATCTCCCTAC ATAATGGCGA ATTTCTTCCC AAAGGGAGAC
CCCATAGCAG CTAGTTTACT CACGTGGCTA GTGTTTGCAA CAGGTTTTGT CGTCAGGCCG
GTCGGTGCCG TGGTTTTTGG CCACCTCGGC GATAAAGTGG GTAGAAAAAC CACCTTTATT
GCCACTCTTC TCCTAATGGG CATGGCGACG TTTTTCATGG GGCTTCTGCC GACGTATAGC
CAAGTGGGAA TTCTAGCGCC TATATTGCTG ACTTTATTAC GAATCCTACA GGGTATTTCC
CTCGGCGGCG AATACGGCGG GGCTATAACA TATGTACTTG AACATGCGGC CGGCCGTAAT
AGGGCGTTTT ATGCTGGTTT TGTTGCCGCC ACTCCGCCGT TGGGCCTTGG CCTTTCTTCG
CTGACGCTTG TAACGGTGGC CAAACTTCTG CCGCCCCAAG ACTTTCAGAC ATACGGCTGG
CGCATGCCAT TTCTCGTCTC TATCGTGTTA ACTATACTTG GCTTAGTGTT ACGTCTAAAG
CTGACGGAGA CCCCCCTCTT TGAGAAAATT AAGAGCGAGA AAGCCGTGGC GAAGATACCT
CTGATAGAGG CGGTGGTTAA GTATCCACGT TATATCTTAG TCGGCATAGC TGTGGCGGCG
GGACATTCTG TCCTTGCATA TACAGCGACC GGCTATATAT TCCCATTTCT CACAAGCGTG
CCTAAGCTCA GCCCTGTCGA TGCCAGCTTC GCTGTGGGAA TAGCCGGCGT ACTCCAGCTA
CCGTTCTACA TACTCAACGC CTGGCTTTCA GAAAAAGTAG GCCGGAGGGT AATCTACTCG
GCCGGCCTCG CTATGGCGTT GGCCACGTAC TATTCAATTT TCAGTTGGCT GACAGGAGTT
AAAGACCTAG CTCTAGTCAC GCTTGGCGTC TTTGTTCTCA TACTCGCCAC GGCGTTTACC
TTCAGCGTGC TTGGCACGGC GCTAGCTGAG CTATTCCCCA CTAAGGTGAG ATACACGGCC
ATGTCTCTCT CATTTAACCT AGGCATTGGC GTCTTTGGCG GCTTTACGCC ATCGATAGTA
CAAGCGATTT CCTACTGGCT TAACAACCCA ATTGCAGGCG TGCTATTTTA CACATATATA
GTAGCCGGCT TCGCCCTGGC TGTAGCACTT ACACTCATGC CTGAGACTAA GGACAAACCT
CTAGACTAG
 
Protein sequence
MSIRLVTSAS MAGTLIEWYE FFAYASLSPY IMANFFPKGD PIAASLLTWL VFATGFVVRP 
VGAVVFGHLG DKVGRKTTFI ATLLLMGMAT FFMGLLPTYS QVGILAPILL TLLRILQGIS
LGGEYGGAIT YVLEHAAGRN RAFYAGFVAA TPPLGLGLSS LTLVTVAKLL PPQDFQTYGW
RMPFLVSIVL TILGLVLRLK LTETPLFEKI KSEKAVAKIP LIEAVVKYPR YILVGIAVAA
GHSVLAYTAT GYIFPFLTSV PKLSPVDASF AVGIAGVLQL PFYILNAWLS EKVGRRVIYS
AGLAMALATY YSIFSWLTGV KDLALVTLGV FVLILATAFT FSVLGTALAE LFPTKVRYTA
MSLSFNLGIG VFGGFTPSIV QAISYWLNNP IAGVLFYTYI VAGFALAVAL TLMPETKDKP
LD