Gene PICST_81488 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_81488 
SymbolNRP1 
ID4836895 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2508099 
End bp2509892 
Gene Length1794 bp 
Protein Length460 aa 
Translation table12 
GC content46% 
IMG OID640388210 
ProductAsparagine-rich protein (ARP protein) 
Protein accessionXP_001383238 
Protein GI150864428 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.771675 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0322455 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCAACACCCA TGAGTGACGT CTACATTGTT GTTCACATTG CAACCACCTG CGATGAGTCC 
ACCACGTATG TGACAAAGGA CTCGTCTGAG TTGATTGAGT TTGCCTGGTC AGTAGTCGAT
GCCACAACAC TTGAAAACTT GGACAAGGAG TCAGTTTTGG TGAGACCGGT CAATACACCC
ATCACGCCGT ACTGCTCACA GCTCCACAGA ATCACATGGG AACATGTCAG AAATGCTGGC
TCGTTCAAAG ATGCTATCGT CAAGTTCGAC ACATACATCC AAGACAACAT CATCTCAAAA
AATAAGGAGT TTTCGCTTGT CACTTTCGAC ATGTCCAAGT TGAGAGTACA GTTACCTCGT
GAAGCAAGAG ACAAGGCTGT AGTTTTGCCT CCATATTTGC AACATCCTCG TGTGTTTGAC
TTACTGACAG AATACGCCAA ATGGCAGATC ACACATCCGG AAGCTCTTTC ATACACAGCT
TCATCTCTCT CTAATGTCAT CACTGCTCTT GAAGTAGAAA TCGATCTGGA TACTGAAGAC
GCCTTGAAAA ACCCCTTGTC GTCATCGTCT ACTCCTCCAC CTACGGCGCT GTCGTCGTCC
TCAACCACCT CTTTCTCGTC TGAACCGTCA ACGACCAACC CAGCCAGAAC GTCCCCACCA
GCTGCTCCAG TATCGAGTGA CACTAAGACC ATGGCTACTG TTAACTTATT CACTAAGATC
TTGGTGCAGC TCATCAGAAA GTCTATCCCC ATTGAAGACC ATGCATCTGT GCTTACAAAA
CCATATGATT CTGCTAAAGA TGTCTCTGTG TTCTTGTCAG AAAGATCAAA AATCTTGTAT
CTCTATAATT TGCCCAACGA TACCACCCAG TCAGAGTTGG AATCGTGGTT CACCCAGTTC
GGTGGAAGAC CTATTGCATT CTGGACCTTG AAGAATCTAG ACACTGCTGA GGCTGGTAAA
ACCAACGCTG CTTCTAACTC TCCACATAAG TCCAAGGGAA TTGCTGGTTT TGCTGTGTTT
GCCACTCATG AAGAAGCTTC TGAATCCTTG TCTATGAACG GTAGAGTTTT GAACGATAGG
GCTATCGAAG TGCAACCCTC GTCTACTAGA GTCTTGGATA AGGCTAGTGA CTTGTTGACT
CCCTTCCCGC CTTCGAAGAA CAGAGCTAGA CCTGGTGACT GGACTTGTCC TTCGTGTGGA
TTTTCCAACT TCCAGAGAAG AACACACTGT TTCAGATGCT CTTTCCCTGC TTCTAGTGCT
GTGGCCATCC AGGAGTCGAT CTATTCCAAC AATAACCATA ATGGCAACAA CCGTCGTACC
GGCAACAACA ATGGCTACAT CAACGGAGCC AATGGCAATC AAAACAATAA CAACAACAAT
GGCTACCTAA GCGGAGTCAA CGGTAATCAA AACAATAACA ATAGCAATAG TACGAACGGT
CAACCCAACT TCAAGTTGGC GTTGAACTCT GCTGTTGTCA ATGCTGCTGC TGCTGTTTTG
AACTCCAACA ACTACGGTAA CCCGTCTTAC AATGATAACG GCTCCCAGCA GAGAGGTAGC
GACTCTCCAT ATTCAGTTCA ACAAAGTCAA AGTGGCGTCA ACCACCACAA TAATAACTCG
AACCACAACC AGGGACATCA TAACAACAAC AACAACAACC ATTCTCGCTT GCATTACAAC
AATAGTGTTC CATTCAGAGC TGGAGACTGG AAGTGTGAAG TATGTATATA CCACAACTTC
GCTAAAAACT TGTGTTGTTT GAAGTGTGGA GCTTCCAAGC CAGCACTTGC TATA
 
Protein sequence
MSDVYIVVHI ATTCDESTTY VTKDSSELIE FAWSVVDATT LENLDKESVL VRPVNTPITP 
YCSQLHRITW EHVRNAGSFK DAIVKFDTYI QDNIISKNKE FSLVTFDMSK LRVQLPREAR
DKAVVLPPYL QHPRVFDLST EYAKWQITHP EALSYTASSL SNVITALEVE IDSDTEDALK
NPLSTSPPAA PVSSDTKTMA TVNLFTKILV QLIRKSIPIE DHASVLTKPY DSAKDVSVFL
SERSKILYLY NLPNDTTQSE LESWFTQFGG RPIAFWTLKN LDTAEAGKTN AASNSPHKSK
GIAGFAVFAT HEEASESLSM NGRVLNDRAI EVQPSSTRVL DKASDLLTPF PPSKNRARPG
DWTCPSCGFS NFQRRTHCFR CSFPASSAVA IQDGVNHHNN NSNHNQGHHN NNNNNHSRLH
YNNSVPFRAG DWKCEVCIYH NFAKNLCCLK CGASKPALAI