Gene PICST_66987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_66987 
SymbolNBP2 
ID4837596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1548554 
End bp1549822 
Gene Length1269 bp 
Protein Length237 aa 
Translation table12 
GC content40% 
IMG OID640388911 
Productprotein that interacts with Nap1, which is involved in histone assembly 
Protein accessionXP_001383070 
Protein GI126133090 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.315379 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AACATAGAAC ACATAACAGA CACGTTTTAC AAAACAGCCG GGATATATAT CTAACTTTGG 
TTATAAGCTT TGTTTCTGTA GCTGCCAGAT TCAATAATTG TTAGAGATTC TTGAATTACA
ATTGTAGAAT CGTCAAATTG TTAACTTGCT GTCAATTTCA CTTTTGTCAT AAAATACAAA
AGAACAGAAG ATATCATTGA ACAACAAGAA TTGTTCTATT CACTAACAGG ATAACAAGAG
TCTGATCCAA TTCATTCTCA TCAATCTCAA TATACTTCAG TTTGAGGAAA GCCAGATCAT
AAAGATTGCC TCCTAGATAC AGAGACTTAT ACACACAACA CACGAGTTTC ACTGAACACT
TCATCAAACT ACTGAACTTT GATTCTATTG ACTTATTCTA GATTCGATCT TTCATCAAGT
CCATCAAATT TTTCATCATC GTATTTATCG TTCGCTGAAC CATGGCTGAC GAAGATCACA
AGAATATAAG TCTATACCTT CCCAACACCG TGATAAAGGA CTACGGGTAT CCCGAGAGCC
ATCCGTTGCA TACTGGCAAT TTTGGAGCTA TGGGCGATCC TGAGGATGTT GACGATGAAG
ACATCAACAG TGATGACGAC TATGGCTATC TCCTTTCTGC TGCCAACAAC AATACTACAC
ATTACCATAG TCTCGTGCGG AGCATTGACG ACGAGGATGA CGACGAAGAG ATGAACAGCA
AACTTAATGA CGATAACGAC TATATTTATA ATGGAGACGA CGATGGCGGT CGATCCAACG
ATGAAATCAA CTGTAGAGCC AGAGCGTTGT TCGATTTCCA GCCAGAAAAC GATAATGAAG
TGGCATTGAC AGAGGGACAG ATAATCTGGA TTTCGTATAG ACATGGACAG GGGTGGCTCG
TAGCAGAAGA TCCAGAGTCG GGCGAAAACG GCTTGGTTCC GGAAGAGTAT GTAGAGATCT
TCTACACCAA GGAAGAAGTA GCTGACGATG ATGTTCCCAA ACCGTTTCTA CCTGAGCTCT
TGCAAAATTT GGAAGAAGAC GACAGTGACG ACGCCGACTG GGTAGACACA GACTACGACG
AAGATGAAGA TGAAGATGAG CAATCGGAGG ATGAGCACAT AAACGAAGAG GCAGACATTT
CAGACAGATT ACACGATGTC AAGTTGGTAT CGTGACGTCG TTGTTAATAT ACACTATTGT
AATTAGAAAC AGTATATAAT AATAGTATTA ATAGTGAAGT GAACTATCAA ACACAGCATA
TGCTATCTG
 
Protein sequence
MADEDHKNIS LYLPNTVIKD YGYPESHPLH TGNFGAMGDP EDVDDEDINS DDDYGYLLSA 
ANNNTTHYHS LVRSIDDEDD DEEMNSKLND DNDYIYNGDD DGGRSNDEIN CRARALFDFQ
PENDNEVALT EGQIIWISYR HGQGWLVAED PESGENGLVP EEYVEIFYTK EEVADDDVPK
PFLPELLQNL EEDDSDDADW VDTDYDEDED EDEQSEDEHI NEEADISDRL HDVKLVS