Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_84561 |
Symbol | NAG4 |
ID | 4839764 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 11203 |
End bp | 13155 |
Gene Length | 1953 bp |
Protein Length | 604 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640391079 |
Product | Synaptic vesicle transporter SVOP |
Protein accession | XP_001385337 |
Protein GI | 126137628 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00880] Multidrug resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCCAA ATCATACAAT TAGCGCATCG GAAGATGACG ACTCGAACAC CATTGACAAT GCCTCTCTCG ACTCCGAAAA GGTGCAATCT TACGGTGATG CCTTGAACAC AGAACAAGAA CACCAGAACT TGACACCACA CAGGTCTATC AGCAGAATCT TACATTCCAT TAACAGTCGT GAGTCTGAAG ACAAAGAAGA TAGAGAAGAG TTGAAGAAGT TAATCTCCAA CAACAAGGGT GTCGAAAGAA TTATTTCTGA ATTGGAAGAA GGTACCGGTA GATTAGGTCC TTTGGAACAG CCTTACGACT TAAAGAGAAT AACAACCCAG GCTGACCCTA ATAGTGATTT CAATGAAGCC GATCCCTGGA AATATCCCCT TGATTCAGAA TCGGGATTGA GAATTGTCGA GTTTGTTCCA AATGACGAAA AGAACCCAAA GAACTTATCT ATAACCACCA AGTGGGTGTA CACTGGTGTA TTGGGTTTCA TGTGTTTTGT GGTGGCTCTT GGTTCTGCCA TTGTCACGGG TGATTTAGAA AGACCAGCTG AATACTTTGG AGTAAGTCAA GAAGTCATTA TTTTAGCTTC CGTAACGGTA TTCGTCATTG GTTTCGGTGT TGGTCCATTA CTTTTCGCAC CAATGTCAGA AGAAGTCGGA AGAAAGCCTA TATATGCAAC AACTTTGGCT ATTGCTGTTG TTTTCATTGT GCCATGTGGT GCAGCTAAGA ACATCGGTAC TTTACTTGTT TGCCGGTTGA TCGACGGTAT TGCTTTCAGT GCTCCAATGA CTCTTATAGG AGGTTCTCTT GCTGATATCT GGGAGGGTAA GGACCGTGGT ACTGCTATGG CTATTTTTTC TGCTGCTCCT TTCTTGGGTC CCGTTACTGG TCCCATCTTC GGTGGTCTTT TGGCTGATCA CGCTCCTACC TGGAGATGGA TCTACTGGAC TTTCTTGATT ATTGCCGGTT TCGTCTATGT CTTGTTCATT TCCATTGTTC CAGAAACCCA TGCTGGTACC TTGTTGAAGA AGAGAGCTAA GCAGTTGAGA AAGGAAACGG GCGACTCTAG ATACAGATCT TTTAATGAAT TGAAGATCAG ATCCTTCAGT GAAGTAGCCA AATCTTCGTT GTTGAGACCA TTCGTCTTGT TAAATGAATT GATTGTCTTC TTAATGACTT TGTACATGTC TGTTGTCTAC GGTTTATTGT ACATGTTCTT CTTTGCATAC CCAATGGTTT TCCAAGAAGG AAAGGGTTTC TCTGCATCTT TGACCGGTGT CATGTTTATT CCAATTGGTG CAGGTGTTGG TCTTGCCACT TTGGCTGCCC CATTCTTCAA CAAAGACTAC AATAAGAGAG CTCAAGTCTA TAGAGACAGA GGTGAATTAC CACCTGCCGA ATTGAGATTG ATTCCAATGA TGGTTAGTTG TTGGTTCGTT CCAGCTGGTT TATTTGCCTT TGCCTGGTCT TCTTACCCAA CTATCTCCTG GGCCGGTCCA TGTTTCTCAG GATTGGGGGT TGGATTCGGT TTCTGTTGTT TGTACAACCC TGCCAACAAC TACATTGTTG ACTCTTACCA ACATTATGCC GCCTCTGCCT TGGCTGCTAA GACCTTCGTT AGATCCATCT GGGGTGCTTG CGTTCCACTT TTCACAATTC AAATGTACCA CAGATTAGGT TTGGAATGGG CCAGTACCTT GATGGCTTTC ATATCGTTGG CTTGTTGTGC TATTCCATTC TTGTTCTTCA TCTACGGTGC AAGAATCAGA ACGTTCTCTA AGTACGCATA CTCTCCAAAC ATGGATACCA AGAAAGATGA AGAGAACCAG AAGTAATTAA CTTCCTTCTG AGGTTTCATC ATTTGCATTT TACAGTGATT TGGTAAGCCT AAATCAATAT TTATTACAGT ACAGACTAGA TATAATAAAA TAATAATACA TATATACATA TTT
|
Protein sequence | MDPNHTISAS EDDDSNTIDN ASLDSEKVQS YGDALNTEQE HQNLTPHRSI SRILHSINSR EEELKKLISN NKGVERIISE LEEGTGRLGP LEQPYDLKRI TTQADPNSDF NEADPWKYPL DSESGLRIVE FVPNDEKNPK NLSITTKWVY TGVLGFMCFV VALGSAIVTG DLERPAEYFG VSQEVIILAS VTVFVIGFGV GPLLFAPMSE EVGRKPIYAT TLAIAVVFIV PCGAAKNIGT LLVCRLIDGI AFSAPMTLIG GSLADIWEGK DRGTAMAIFS AAPFLGPVTG PIFGGLLADH APTWRWIYWT FLIIAGFVYV LFISIVPETH AGTLLKKRAK QLRKETGDSR YRSFNELKIR SFSEVAKSSL LRPFVLLNEL IVFLMTLYMS VVYGLLYMFF FAYPMVFQEG KGFSASLTGV MFIPIGAGVG LATLAAPFFN KDYNKRAQVY RDRGELPPAE LRLIPMMVSC WFVPAGLFAF AWSSYPTISW AGPCFSGLGV GFGFCCLYNP ANNYIVDSYQ HYAASALAAK TFVRSIWGAC VPLFTIQMYH RLGLEWASTL MAFISLACCA IPFLFFIYGA RIRTFSKYAY SPNMDTKKDE ENQK
|
| |