Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pisl_1940 |
Symbol | |
ID | 4617877 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum islandicum DSM 4184 |
Kingdom | Archaea |
Replicon accession | NC_008701 |
Strand | - |
Start bp | 1751781 |
End bp | 1753040 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639785031 |
Product | hypothetical protein |
Protein accession | YP_931430 |
Protein GI | 119873423 |
COG category | [S] Function unknown |
COG ID | [COG4938] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.34914 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.0000000111377 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGCTACAT ACTTCGGTGG GGTTAGGGTG TCTGTGTCTA ATTTTGGGCC GGTGAGACGG GCTGAGCTGG TGGCGGGGGA CCTCACCGCC GTCATAGGCC CGAATAACGT GGGTAAGTCG TGGCTGGCTT CCCTCATATA CGCGGTTCAC AACCTCGTCT TCTTCCACTT GCCTGCTATT GCACACCACG CCCTGTCTGA GGTGGGTATG CGGAGGTGCT CCGGGGCCGG CGCCGAGTGT CTCAGGGAGC ATCTGGAAGA GGTGGAGAGG GTTGCACTGG AGATGTCGAA GATGTCTACA GGAGAAGCCT TAAAGCGAGA GTTGCAATCC GTGTTGGGGA CAAGCGACTT TGTCACATAC GGCGAAGAGG AGGCGATTGT GGAGGCTAGT GCACCATCGC TGGAGGTCTC TTTTAAGGCG CTTCTTCCCA GGGGAGAGAG GCCGCGCGTT GAGGTAAACA TATCCAGTAG TTTCGTAAAA CGCGCTGTAG AGGACGTCGA GTGGGCGGCG TTCTCAGCCC GCGGGGCGAG CCTCAGCTTT AAAGGGCTTC TCCCCATGCG ACGCTCCGTC TACATTCCCG CCGAGAGGAT TGCCCTCCTC ACGGCGTTCT ACGGAGTCGT GGAGGCGTTG ATTAGGCTGT ACGGAATTAG CCAGGTGGTT AGGGAGGAGC TTCCCAGGCA GTTTTTCAAG CCTTCTCTGT CGCTGTATGC AGCCGACGTG ATAAATCTCA TTAGGAGCGG CGCGGTGGAG AGAGTGGGTG GGCAGTTCGG CGGCGTTGAG ACTGCGCTTC TCGACGGCGA CGTGGCCGTG GACAGGGGGA CCCTGGCGAT TGTGTATGAA TACGCGAGCG GGCTGAAGCT GCCGGCGGCT TTTGCCTCGT CTGGCGTGGC GCAGGTGGCG GGGCTCCTGC TGGGCCTCAT GGTGAGGAGG GGCGACCTGG CGGTGGTGGA GGAGCCCGAG ATCAACCTGC ACGCCAATAG GCAGGTCCAG GTGGCCGAGC TGCTGGCCCG GGAGGCGCGG CGGAGGCCTC TCTTCGTGAC AACCCACTCG GACCTAGTGG TGATGAAGCT GGCCCACCTC TACGCGAAGG GGGAGGTGGA GAGCCTGAGG CTGTACTACC TACACGACGG CGAGCTGGAG GAGCTACCCG TCGACAGAAG CGGGGGCGTG CCCGAGATAA AGTCCATCTC CTCCGTGATA GAGGAGCTGA GCGGAGAGGC CCTAGAGCTA TATGCCCAAC TGCAACGCAG CGTGCGTTAG
|
Protein sequence | MATYFGGVRV SVSNFGPVRR AELVAGDLTA VIGPNNVGKS WLASLIYAVH NLVFFHLPAI AHHALSEVGM RRCSGAGAEC LREHLEEVER VALEMSKMST GEALKRELQS VLGTSDFVTY GEEEAIVEAS APSLEVSFKA LLPRGERPRV EVNISSSFVK RAVEDVEWAA FSARGASLSF KGLLPMRRSV YIPAERIALL TAFYGVVEAL IRLYGISQVV REELPRQFFK PSLSLYAADV INLIRSGAVE RVGGQFGGVE TALLDGDVAV DRGTLAIVYE YASGLKLPAA FASSGVAQVA GLLLGLMVRR GDLAVVEEPE INLHANRQVQ VAELLAREAR RRPLFVTTHS DLVVMKLAHL YAKGEVESLR LYYLHDGELE ELPVDRSGGV PEIKSISSVI EELSGEALEL YAQLQRSVR
|
| |