Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_81488 |
Symbol | NRP1 |
ID | 4836895 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 2508099 |
End bp | 2509892 |
Gene Length | 1794 bp |
Protein Length | 460 aa |
Translation table | 12 |
GC content | 46% |
IMG OID | 640388210 |
Product | Asparagine-rich protein (ARP protein) |
Protein accession | XP_001383238 |
Protein GI | 150864428 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.771675 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0322455 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCAACACCCA TGAGTGACGT CTACATTGTT GTTCACATTG CAACCACCTG CGATGAGTCC ACCACGTATG TGACAAAGGA CTCGTCTGAG TTGATTGAGT TTGCCTGGTC AGTAGTCGAT GCCACAACAC TTGAAAACTT GGACAAGGAG TCAGTTTTGG TGAGACCGGT CAATACACCC ATCACGCCGT ACTGCTCACA GCTCCACAGA ATCACATGGG AACATGTCAG AAATGCTGGC TCGTTCAAAG ATGCTATCGT CAAGTTCGAC ACATACATCC AAGACAACAT CATCTCAAAA AATAAGGAGT TTTCGCTTGT CACTTTCGAC ATGTCCAAGT TGAGAGTACA GTTACCTCGT GAAGCAAGAG ACAAGGCTGT AGTTTTGCCT CCATATTTGC AACATCCTCG TGTGTTTGAC TTACTGACAG AATACGCCAA ATGGCAGATC ACACATCCGG AAGCTCTTTC ATACACAGCT TCATCTCTCT CTAATGTCAT CACTGCTCTT GAAGTAGAAA TCGATCTGGA TACTGAAGAC GCCTTGAAAA ACCCCTTGTC GTCATCGTCT ACTCCTCCAC CTACGGCGCT GTCGTCGTCC TCAACCACCT CTTTCTCGTC TGAACCGTCA ACGACCAACC CAGCCAGAAC GTCCCCACCA GCTGCTCCAG TATCGAGTGA CACTAAGACC ATGGCTACTG TTAACTTATT CACTAAGATC TTGGTGCAGC TCATCAGAAA GTCTATCCCC ATTGAAGACC ATGCATCTGT GCTTACAAAA CCATATGATT CTGCTAAAGA TGTCTCTGTG TTCTTGTCAG AAAGATCAAA AATCTTGTAT CTCTATAATT TGCCCAACGA TACCACCCAG TCAGAGTTGG AATCGTGGTT CACCCAGTTC GGTGGAAGAC CTATTGCATT CTGGACCTTG AAGAATCTAG ACACTGCTGA GGCTGGTAAA ACCAACGCTG CTTCTAACTC TCCACATAAG TCCAAGGGAA TTGCTGGTTT TGCTGTGTTT GCCACTCATG AAGAAGCTTC TGAATCCTTG TCTATGAACG GTAGAGTTTT GAACGATAGG GCTATCGAAG TGCAACCCTC GTCTACTAGA GTCTTGGATA AGGCTAGTGA CTTGTTGACT CCCTTCCCGC CTTCGAAGAA CAGAGCTAGA CCTGGTGACT GGACTTGTCC TTCGTGTGGA TTTTCCAACT TCCAGAGAAG AACACACTGT TTCAGATGCT CTTTCCCTGC TTCTAGTGCT GTGGCCATCC AGGAGTCGAT CTATTCCAAC AATAACCATA ATGGCAACAA CCGTCGTACC GGCAACAACA ATGGCTACAT CAACGGAGCC AATGGCAATC AAAACAATAA CAACAACAAT GGCTACCTAA GCGGAGTCAA CGGTAATCAA AACAATAACA ATAGCAATAG TACGAACGGT CAACCCAACT TCAAGTTGGC GTTGAACTCT GCTGTTGTCA ATGCTGCTGC TGCTGTTTTG AACTCCAACA ACTACGGTAA CCCGTCTTAC AATGATAACG GCTCCCAGCA GAGAGGTAGC GACTCTCCAT ATTCAGTTCA ACAAAGTCAA AGTGGCGTCA ACCACCACAA TAATAACTCG AACCACAACC AGGGACATCA TAACAACAAC AACAACAACC ATTCTCGCTT GCATTACAAC AATAGTGTTC CATTCAGAGC TGGAGACTGG AAGTGTGAAG TATGTATATA CCACAACTTC GCTAAAAACT TGTGTTGTTT GAAGTGTGGA GCTTCCAAGC CAGCACTTGC TATA
|
Protein sequence | MSDVYIVVHI ATTCDESTTY VTKDSSELIE FAWSVVDATT LENLDKESVL VRPVNTPITP YCSQLHRITW EHVRNAGSFK DAIVKFDTYI QDNIISKNKE FSLVTFDMSK LRVQLPREAR DKAVVLPPYL QHPRVFDLST EYAKWQITHP EALSYTASSL SNVITALEVE IDSDTEDALK NPLSTSPPAA PVSSDTKTMA TVNLFTKILV QLIRKSIPIE DHASVLTKPY DSAKDVSVFL SERSKILYLY NLPNDTTQSE LESWFTQFGG RPIAFWTLKN LDTAEAGKTN AASNSPHKSK GIAGFAVFAT HEEASESLSM NGRVLNDRAI EVQPSSTRVL DKASDLLTPF PPSKNRARPG DWTCPSCGFS NFQRRTHCFR CSFPASSAVA IQDGVNHHNN NSNHNQGHHN NNNNNHSRLH YNNSVPFRAG DWKCEVCIYH NFAKNLCCLK CGASKPALAI
|
| |