Gene Pisl_0602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_0602 
Symbol 
ID4617606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp552674 
End bp554311 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content48% 
IMG OID639783695 
Productextracellular solute-binding protein 
Protein accessionYP_930123 
Protein GI119872116 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.497567 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.000000266408 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATAAGT CTCTAGCTGT AGGAATAGTT GTTTTAGTTG TCGTGTTGGC AGTTATAGCA 
ATATTCCTAT CTCAACAACA AGCCCCTGCG CCCCGTCCGT TGCCACAGTC TTCAAGTCCC
ACTACAACGC AGACTCCTAT TCCCACCACG CCGAAGTCTT CTCCCACTTC TACAACGCCG
CAACCAACTT CGCTTACGAT CGGAGTGACA GATAAAGTCA CCGACTTAGA TCCAGCGAAC
GCCTACGACT TCTTTACATG GGAAATTTTC TACAACACAA TGGCGGGGCT CGTTAGATAC
AAGCCGGGAA CAACAGAGCT AGAGCCCGAC CTAGCTGAGA ATTGGACGGT GCTAGAGGGG
GGCAAGGTGT GGGTGTTTAA ACTAAGGCCA AATCTTAAGT TCTGTGACGG TACCCCGCTT
ACTGCCAGTG ATGTAAAGCG GTCTATCGAG CGCGTAATGA AGATAAACGG GGACCCTGCT
TGGCTTGTCA CAGATTTTGT AGAAAAGGTA GAGACACCCA ACGCAACTAC AGTGATCTTC
TACTTGAAGA CACCTGTCTC CTACTTCCTA GCGCTTGTAG CAACGCCGCC TTACTTCCCC
GTACATCCAA AATATGTACC CGATAGAATC GACTCGGATC AGACGGCAGG CGGCGCAGGG
CCCTACTGTA TAAAGTCATT TGTGAGAGAC CAACAGATAG TGCTAGAGGC AAATCCATAC
TACTATGGGC CTAAGCCGCA GATATCTAAA GTTGTCATAA AGTTTTATAA AGACGCTACG
ACGCTGAGAC TTGCCCTAGA AAGAGGCGAG GTCGATATAG CCTGGAGGAC TCTTAATCCG
CCTGATATTG AGGCGCTAAA GGCATCAGGC AAGTTTAAAA TCGTCGAAGT GCCTGGTTCT
TTTATACGTT ACATAGTACT TAATCTCAAT ATGCCAGAGC TAAAGGACGT TAGAGTAAGA
CAAGCGCTGG CCGCCGCCGT GTGTAGAAAG GACATTGTGC AGACGGTATT CCGTGGCGCT
GTTACGCCGC TTTATACTCT AATACCAGAG GGAATGTGGG GTTCTTACCC AGTCTTTAAG
GAGAGGTACG GGGACTGTAA CATAAGCCTA GCCAGATCCC TTCTACAGCA GGCTGGGTAT
AGCCAGAGCA AGAAACTGTC GATCGAGCTC TGGTATACGC CGACACACTA CGGCGATACC
GAAAAGGATC TTGCAGCAGT GTTGAAAGAG CAGTGGGAAG CCACAGGGCT TGTTTCTGTT
ACCGTGAAGT CGGCCGAGTG GGCTACATAT GTACAGCAAC TTAGAAGCGG CGCTATGATG
GTATCTCTGC TGGGTTGGTA TCCCGACTAC CTGGATCCAG ATGATTACAC AACGCCGTTT
CTCAAAAGCG GCTCTAACAA ATGGTTGGGT AATGGCTATA GCAATCCAAA AATGGACGAG
ATACTCACCA AAGCCTCTGT TGAGGAAAGT CGGTTAGTTA GAGAACAGTT ATATAGACAG
GCACAACAAC TGTTGGCGGA AGATGTGCCC ATAATTCCTC TTGTACAAGG TAAGCTGTTT
ATAGCAACGA AGCCAAATAT ACAAGTCGTA GTAGATCCGA CAATGATCCT GAGATATTGG
GCAATAAAAA TAGCCTAG
 
Protein sequence
MNKSLAVGIV VLVVVLAVIA IFLSQQQAPA PRPLPQSSSP TTTQTPIPTT PKSSPTSTTP 
QPTSLTIGVT DKVTDLDPAN AYDFFTWEIF YNTMAGLVRY KPGTTELEPD LAENWTVLEG
GKVWVFKLRP NLKFCDGTPL TASDVKRSIE RVMKINGDPA WLVTDFVEKV ETPNATTVIF
YLKTPVSYFL ALVATPPYFP VHPKYVPDRI DSDQTAGGAG PYCIKSFVRD QQIVLEANPY
YYGPKPQISK VVIKFYKDAT TLRLALERGE VDIAWRTLNP PDIEALKASG KFKIVEVPGS
FIRYIVLNLN MPELKDVRVR QALAAAVCRK DIVQTVFRGA VTPLYTLIPE GMWGSYPVFK
ERYGDCNISL ARSLLQQAGY SQSKKLSIEL WYTPTHYGDT EKDLAAVLKE QWEATGLVSV
TVKSAEWATY VQQLRSGAMM VSLLGWYPDY LDPDDYTTPF LKSGSNKWLG NGYSNPKMDE
ILTKASVEES RLVREQLYRQ AQQLLAEDVP IIPLVQGKLF IATKPNIQVV VDPTMILRYW
AIKIA