Gene Pisl_1021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_1021 
Symbol 
ID4617320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp921661 
End bp922743 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content43% 
IMG OID639784118 
Productmajor facilitator transporter 
Protein accessionYP_930538 
Protein GI119872531 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.394996 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value1.70461e-18 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGAGAATAA ACTTCGCAAC TCTCCTTTTT TTCATAGCTA ACGGTATAGT TGTAGTTGCT 
ATTCCACCCT ATTTACGCGA CTTAGGAGTG ATCAGCGAAT CTACGATAGG CACAATAATA
TCTACGGCGT TTTTCGTCTC CGTTATAGTA AGGCCGTTAA GCGGCTTTAT AGGAGATAAA
GTGGGTTATG TAAAAGTGAT GAGGATAGGC GTTGTCTTTG CTGTTATGTC GCAAATTATG
TATCTACTTA GCAACCCGCT ATGGGTTCAA ATTGGCAGAG TTTTTCATGG TTTTGCAATA
GGAACTTTTC TTCCGATGTC TATAGCTATT TCAGTCACAG AGGGAGCCAA AGCCATGGCA
ACACGCTCTT TAGCTGTTGG TATCGGAAAC GTAGTAGGCC CCCTTATAGG TTCCATATTA
TACGACCTTG GCGGGGGCCG TCTATCCATC ACGGTGGCAT TACTCTTACA TACAATCAAT
TGGTTTTTTA TAAATGGGGC CGTATCTACA GAAGCTCGGG GTAAGGGAGG GGACATACTT
ATGCCTGAGA CACGTGTATT TTTCTTCACG GCGTTGTTAA CAATTTATGC AACTGTCTAT
ATGGGCATTT CTACTTTTAC GCCACTTCGT TTAAAAGACG AGGGGTTGCC AATAACCTAC
TGGGGTCTTT TTTCTTCAAT TGCAGCAATT TCTAGCCTAA TACCTCGGGC TTTTTTACTT
AGGATGGGGT TTGTTAATTA TATTACTGCC GGACTTGCAT CTGCTATAAC AATGGCTGGT
TTAGCGCTTG TAGCTGTGGC ATGGGATCTA CCTCTTTTTT CAGTGGCCGG GGCGATATAC
GGCCTGGGGC AAGGCGCCGT TGTTGTTACA TATCAAATAT TGGCTCTTGC TGGTAGTAGA
AACGCAGGTC TTGCTAGTGC TGTTTATACC ATGGGTTGGG ATTTGGGGTC GATCATTGGG
CCAATTTTTG CAGGCGTGCT AGTAGAACAT TTTGGCTATG GCGTTTTATA CTATGTGCCA
CTACTTCTTT TAGCAAACAT GGGGACTCTG TTTATATATG CATTACATAA GCGGAAAATG
TGA
 
Protein sequence
MRINFATLLF FIANGIVVVA IPPYLRDLGV ISESTIGTII STAFFVSVIV RPLSGFIGDK 
VGYVKVMRIG VVFAVMSQIM YLLSNPLWVQ IGRVFHGFAI GTFLPMSIAI SVTEGAKAMA
TRSLAVGIGN VVGPLIGSIL YDLGGGRLSI TVALLLHTIN WFFINGAVST EARGKGGDIL
MPETRVFFFT ALLTIYATVY MGISTFTPLR LKDEGLPITY WGLFSSIAAI SSLIPRAFLL
RMGFVNYITA GLASAITMAG LALVAVAWDL PLFSVAGAIY GLGQGAVVVT YQILALAGSR
NAGLASAVYT MGWDLGSIIG PIFAGVLVEH FGYGVLYYVP LLLLANMGTL FIYALHKRKM