Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pisl_0602 |
Symbol | |
ID | 4617606 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum islandicum DSM 4184 |
Kingdom | Archaea |
Replicon accession | NC_008701 |
Strand | + |
Start bp | 552674 |
End bp | 554311 |
Gene Length | 1638 bp |
Protein Length | 545 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 639783695 |
Product | extracellular solute-binding protein |
Protein accession | YP_930123 |
Protein GI | 119872116 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.497567 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.000000266408 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATAAGT CTCTAGCTGT AGGAATAGTT GTTTTAGTTG TCGTGTTGGC AGTTATAGCA ATATTCCTAT CTCAACAACA AGCCCCTGCG CCCCGTCCGT TGCCACAGTC TTCAAGTCCC ACTACAACGC AGACTCCTAT TCCCACCACG CCGAAGTCTT CTCCCACTTC TACAACGCCG CAACCAACTT CGCTTACGAT CGGAGTGACA GATAAAGTCA CCGACTTAGA TCCAGCGAAC GCCTACGACT TCTTTACATG GGAAATTTTC TACAACACAA TGGCGGGGCT CGTTAGATAC AAGCCGGGAA CAACAGAGCT AGAGCCCGAC CTAGCTGAGA ATTGGACGGT GCTAGAGGGG GGCAAGGTGT GGGTGTTTAA ACTAAGGCCA AATCTTAAGT TCTGTGACGG TACCCCGCTT ACTGCCAGTG ATGTAAAGCG GTCTATCGAG CGCGTAATGA AGATAAACGG GGACCCTGCT TGGCTTGTCA CAGATTTTGT AGAAAAGGTA GAGACACCCA ACGCAACTAC AGTGATCTTC TACTTGAAGA CACCTGTCTC CTACTTCCTA GCGCTTGTAG CAACGCCGCC TTACTTCCCC GTACATCCAA AATATGTACC CGATAGAATC GACTCGGATC AGACGGCAGG CGGCGCAGGG CCCTACTGTA TAAAGTCATT TGTGAGAGAC CAACAGATAG TGCTAGAGGC AAATCCATAC TACTATGGGC CTAAGCCGCA GATATCTAAA GTTGTCATAA AGTTTTATAA AGACGCTACG ACGCTGAGAC TTGCCCTAGA AAGAGGCGAG GTCGATATAG CCTGGAGGAC TCTTAATCCG CCTGATATTG AGGCGCTAAA GGCATCAGGC AAGTTTAAAA TCGTCGAAGT GCCTGGTTCT TTTATACGTT ACATAGTACT TAATCTCAAT ATGCCAGAGC TAAAGGACGT TAGAGTAAGA CAAGCGCTGG CCGCCGCCGT GTGTAGAAAG GACATTGTGC AGACGGTATT CCGTGGCGCT GTTACGCCGC TTTATACTCT AATACCAGAG GGAATGTGGG GTTCTTACCC AGTCTTTAAG GAGAGGTACG GGGACTGTAA CATAAGCCTA GCCAGATCCC TTCTACAGCA GGCTGGGTAT AGCCAGAGCA AGAAACTGTC GATCGAGCTC TGGTATACGC CGACACACTA CGGCGATACC GAAAAGGATC TTGCAGCAGT GTTGAAAGAG CAGTGGGAAG CCACAGGGCT TGTTTCTGTT ACCGTGAAGT CGGCCGAGTG GGCTACATAT GTACAGCAAC TTAGAAGCGG CGCTATGATG GTATCTCTGC TGGGTTGGTA TCCCGACTAC CTGGATCCAG ATGATTACAC AACGCCGTTT CTCAAAAGCG GCTCTAACAA ATGGTTGGGT AATGGCTATA GCAATCCAAA AATGGACGAG ATACTCACCA AAGCCTCTGT TGAGGAAAGT CGGTTAGTTA GAGAACAGTT ATATAGACAG GCACAACAAC TGTTGGCGGA AGATGTGCCC ATAATTCCTC TTGTACAAGG TAAGCTGTTT ATAGCAACGA AGCCAAATAT ACAAGTCGTA GTAGATCCGA CAATGATCCT GAGATATTGG GCAATAAAAA TAGCCTAG
|
Protein sequence | MNKSLAVGIV VLVVVLAVIA IFLSQQQAPA PRPLPQSSSP TTTQTPIPTT PKSSPTSTTP QPTSLTIGVT DKVTDLDPAN AYDFFTWEIF YNTMAGLVRY KPGTTELEPD LAENWTVLEG GKVWVFKLRP NLKFCDGTPL TASDVKRSIE RVMKINGDPA WLVTDFVEKV ETPNATTVIF YLKTPVSYFL ALVATPPYFP VHPKYVPDRI DSDQTAGGAG PYCIKSFVRD QQIVLEANPY YYGPKPQISK VVIKFYKDAT TLRLALERGE VDIAWRTLNP PDIEALKASG KFKIVEVPGS FIRYIVLNLN MPELKDVRVR QALAAAVCRK DIVQTVFRGA VTPLYTLIPE GMWGSYPVFK ERYGDCNISL ARSLLQQAGY SQSKKLSIEL WYTPTHYGDT EKDLAAVLKE QWEATGLVSV TVKSAEWATY VQQLRSGAMM VSLLGWYPDY LDPDDYTTPF LKSGSNKWLG NGYSNPKMDE ILTKASVEES RLVREQLYRQ AQQLLAEDVP IIPLVQGKLF IATKPNIQVV VDPTMILRYW AIKIA
|
| |