Gene PICST_84561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_84561 
SymbolNAG4 
ID4839764 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp11203 
End bp13155 
Gene Length1953 bp 
Protein Length604 aa 
Translation table12 
GC content42% 
IMG OID640391079 
ProductSynaptic vesicle transporter SVOP 
Protein accessionXP_001385337 
Protein GI126137628 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCCAA ATCATACAAT TAGCGCATCG GAAGATGACG ACTCGAACAC CATTGACAAT 
GCCTCTCTCG ACTCCGAAAA GGTGCAATCT TACGGTGATG CCTTGAACAC AGAACAAGAA
CACCAGAACT TGACACCACA CAGGTCTATC AGCAGAATCT TACATTCCAT TAACAGTCGT
GAGTCTGAAG ACAAAGAAGA TAGAGAAGAG TTGAAGAAGT TAATCTCCAA CAACAAGGGT
GTCGAAAGAA TTATTTCTGA ATTGGAAGAA GGTACCGGTA GATTAGGTCC TTTGGAACAG
CCTTACGACT TAAAGAGAAT AACAACCCAG GCTGACCCTA ATAGTGATTT CAATGAAGCC
GATCCCTGGA AATATCCCCT TGATTCAGAA TCGGGATTGA GAATTGTCGA GTTTGTTCCA
AATGACGAAA AGAACCCAAA GAACTTATCT ATAACCACCA AGTGGGTGTA CACTGGTGTA
TTGGGTTTCA TGTGTTTTGT GGTGGCTCTT GGTTCTGCCA TTGTCACGGG TGATTTAGAA
AGACCAGCTG AATACTTTGG AGTAAGTCAA GAAGTCATTA TTTTAGCTTC CGTAACGGTA
TTCGTCATTG GTTTCGGTGT TGGTCCATTA CTTTTCGCAC CAATGTCAGA AGAAGTCGGA
AGAAAGCCTA TATATGCAAC AACTTTGGCT ATTGCTGTTG TTTTCATTGT GCCATGTGGT
GCAGCTAAGA ACATCGGTAC TTTACTTGTT TGCCGGTTGA TCGACGGTAT TGCTTTCAGT
GCTCCAATGA CTCTTATAGG AGGTTCTCTT GCTGATATCT GGGAGGGTAA GGACCGTGGT
ACTGCTATGG CTATTTTTTC TGCTGCTCCT TTCTTGGGTC CCGTTACTGG TCCCATCTTC
GGTGGTCTTT TGGCTGATCA CGCTCCTACC TGGAGATGGA TCTACTGGAC TTTCTTGATT
ATTGCCGGTT TCGTCTATGT CTTGTTCATT TCCATTGTTC CAGAAACCCA TGCTGGTACC
TTGTTGAAGA AGAGAGCTAA GCAGTTGAGA AAGGAAACGG GCGACTCTAG ATACAGATCT
TTTAATGAAT TGAAGATCAG ATCCTTCAGT GAAGTAGCCA AATCTTCGTT GTTGAGACCA
TTCGTCTTGT TAAATGAATT GATTGTCTTC TTAATGACTT TGTACATGTC TGTTGTCTAC
GGTTTATTGT ACATGTTCTT CTTTGCATAC CCAATGGTTT TCCAAGAAGG AAAGGGTTTC
TCTGCATCTT TGACCGGTGT CATGTTTATT CCAATTGGTG CAGGTGTTGG TCTTGCCACT
TTGGCTGCCC CATTCTTCAA CAAAGACTAC AATAAGAGAG CTCAAGTCTA TAGAGACAGA
GGTGAATTAC CACCTGCCGA ATTGAGATTG ATTCCAATGA TGGTTAGTTG TTGGTTCGTT
CCAGCTGGTT TATTTGCCTT TGCCTGGTCT TCTTACCCAA CTATCTCCTG GGCCGGTCCA
TGTTTCTCAG GATTGGGGGT TGGATTCGGT TTCTGTTGTT TGTACAACCC TGCCAACAAC
TACATTGTTG ACTCTTACCA ACATTATGCC GCCTCTGCCT TGGCTGCTAA GACCTTCGTT
AGATCCATCT GGGGTGCTTG CGTTCCACTT TTCACAATTC AAATGTACCA CAGATTAGGT
TTGGAATGGG CCAGTACCTT GATGGCTTTC ATATCGTTGG CTTGTTGTGC TATTCCATTC
TTGTTCTTCA TCTACGGTGC AAGAATCAGA ACGTTCTCTA AGTACGCATA CTCTCCAAAC
ATGGATACCA AGAAAGATGA AGAGAACCAG AAGTAATTAA CTTCCTTCTG AGGTTTCATC
ATTTGCATTT TACAGTGATT TGGTAAGCCT AAATCAATAT TTATTACAGT ACAGACTAGA
TATAATAAAA TAATAATACA TATATACATA TTT
 
Protein sequence
MDPNHTISAS EDDDSNTIDN ASLDSEKVQS YGDALNTEQE HQNLTPHRSI SRILHSINSR 
EEELKKLISN NKGVERIISE LEEGTGRLGP LEQPYDLKRI TTQADPNSDF NEADPWKYPL
DSESGLRIVE FVPNDEKNPK NLSITTKWVY TGVLGFMCFV VALGSAIVTG DLERPAEYFG
VSQEVIILAS VTVFVIGFGV GPLLFAPMSE EVGRKPIYAT TLAIAVVFIV PCGAAKNIGT
LLVCRLIDGI AFSAPMTLIG GSLADIWEGK DRGTAMAIFS AAPFLGPVTG PIFGGLLADH
APTWRWIYWT FLIIAGFVYV LFISIVPETH AGTLLKKRAK QLRKETGDSR YRSFNELKIR
SFSEVAKSSL LRPFVLLNEL IVFLMTLYMS VVYGLLYMFF FAYPMVFQEG KGFSASLTGV
MFIPIGAGVG LATLAAPFFN KDYNKRAQVY RDRGELPPAE LRLIPMMVSC WFVPAGLFAF
AWSSYPTISW AGPCFSGLGV GFGFCCLYNP ANNYIVDSYQ HYAASALAAK TFVRSIWGAC
VPLFTIQMYH RLGLEWASTL MAFISLACCA IPFLFFIYGA RIRTFSKYAY SPNMDTKKDE
ENQK