Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_27987 |
Symbol | VCC3.1 |
ID | 4850782 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 1841 |
End bp | 8191 |
Gene Length | 6351 bp |
Protein Length | 2116 aa |
Translation table | |
GC content | 42% |
IMG OID | 640392490 |
Product | vesicle coat complex AP-3 |
Protein accession | XP_001387243 |
Protein GI | 126273507 |
COG category | [R] General function prediction only |
COG ID | [COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.643082 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATACTC AAGCACTAGG CTATTCAATT CATAATCCAA TAACTTTGGA TCTGGACCTG GATCTGGATC TGGACCAGAA CTTCGAAGAT GCTCTAGATG ACTTCAGCCA AGTAGATATC AAATGTGAAA ATTCCAATCT CATTTCTACC GATGAGCATA TTTGGTCGAG TCCTCAATCA TTTAGACAGA GTCTGCCATT CTCATCAATA CAAGCTACTC CATTGAGGAC AAATCTGTTC AAACACGAAC TTGACTACTT GGCCAAAAAT TACTTTGTGT TTGTTGAGCC TCTAGGATAC TTCTATCTGG AAAAAAGACA TTTTTTCATC TCTTCAGATT CAAAATATGC TACTTCCAAG TTATATATAA AGTCGAAAGA AAGATTTCTT GACCTCATTA GCAAACTAAC AAAACAATCA CCTTATTACA ACCAAATAGA ACGTGAGTCA AATATTGTGA ATGTGGCTAC TATTCCAAGC CCTATCCCTG GCATTGAAAT AGAAGAATTT GCCTTCTGTG ATTCCTGTAA ATACACCAGC TGTATTGGGC GCGTCCCAGT GAAATGTACC AGTCATGGAT ACATGAGAAG GGGTGAAGGC TATAGAGTGG AACTGGGAGG TTTCTTTCTA TTGAAGGAAT TCCCACGTAT GCCTTCTTTA CCTGATACTC CTCCACCAAG CACTCCAAAT ATGTATGATC GTAGTCCGAC CATACCTTGC TCTTCTCCCA TCCGCCGCCG ATCTCGCTTC ACCAATAGAG ACTTGCAAGT CGAATTCAAG GGCCCTCTTA TCACCGAAAT CCGTGTGAGA GAATCTGAGA ATATGACCGC TGATGCCTTT TTGCAAAAAT ACGCCAATAT CAAATTCATT CCTGATGTGG GGTTATACTG GCACGAACAT GAACGTGTGT TTTTGGATAG AAACTCTGAA ATTGTTCACA AATCATTGTC TTCTATCATT GACAAATGCA AGTTTCTTGA AACACACATG CATGCAGTAA TGCTGAATTC CTACTACTGG TATTTGTTAG CCAGTGCTAA TACGCTGTCC GAGGTACATA TATGTCTTGA ATGGTTGAAG GAACCTCTAA GTGGCATAGA ACCTTCAAAG ATCCAGTACT GCCCTGATTG CATCAATTCG GCTATCATCG ATGACCATTG TGAGGTCGAT AACCTTAATG TTCTAACTGG GATGGGATAC AGAATGGAGC TGGGGTCTAT TTTCTTATTA GCACTGGGTG GAGGAATGAC AGACATTTTG GAAGATCAAC GAGAGGAGGC TCCCGATCTC GAATATTATG CCTCTGATTC TGGTGCTGTC TTCCTGAGCG ATGAACAAGA AGCTGAGGGG TCAGTAGACT ATGCTTCTGA TATGGAAAAT CTCACAGTAG ATGGTCAAGA AGGTGACGAA TTGACAGCAG TAAATGGAGA ATCTGAGTCG GATACTCTCT TAGAAGATGA AGTTCAAGAA GGTGACTTTA GATCATTTGA CGATTCTTCT GATTCGGATA ATCTCTTTGT AGATGATGCA CAAGAAGATG ATGAAGAAAT GATTGACATT GCAATAGATA ATGATACTGA ATCGGATAAT GACGAATCAG ACATTTCAGC CTATGATACC ACCGATGATG GTCTTGATAA CGAACGTGAT TCAGAAGAAA ATATATTATA TGATGAAGAT CCTGTTAGGA TCTCCACTGT TAGTGGATGG TCTCAGCTAA AAAAGAAAGG TATTTGTCAA ATTAGCAATC TTCTTTGGCT GAGAGGACAA CAACACTTCC TTGACATCAC CTCCAGGGCT TTCAGATCGT TTCTCATGGG TACAGTGTCA AATGCACAAT ACACCCAGAT GTTGGAAGAT GTCGAGGAAT GGGTCGATCT AATGCCTCAG TACTATAGAA TTCCAAAAGA TACCAAGGGT AGAACATTTG AAATTCCCCT GCCCTTTTCA AGACTCCCTC CAAAAATACA TACCTACTGT TCGGAATGTG GATTTGTAAT AGGATTCAAC GAGAACAATG AAAACTTGAG AAGCTCTTCC CGTCATGCAA AGAATACTGG TCATGATAGA TTCATCAGTC GCCAAGGGTA TGCTATCAGA GCCGCAACAG GATTCAATCT CTTGGTCATA TTTGGGTCTA ACGAAACAGT CAGTACATGT GAATCGGAAG ATGGCGATTA TGATGATGAA GATATTGTTA TTCCTTCATT CGAAGAGGGA CTGGGTTCAC CATTCACACC CCCTGAATTC ATAGTGGAGC CGTACTCCAC AGACGAAGAG GCCAGCGACT ACGAACAAGA GCCTGAATCT CTTCCACCTC CATCTAAGGA AGAACAATAT GAAGTGTCCG AACTATTGAA ACATTCCCAT GGGTTAGTTT ACGATCCTAT CTTCTACTTG TACTACTACA TAACAGAAGA GCGGTATGTT GACGTGAATT CAAAGTTTTT CAAAAACACA TTGGAAGATA TTGGTGACCA CAAAAGCCGC ATTCGAAACT TGGTGGACCG TGTCATTAGC ACTCAAATCA TTGACTCTCG AAAATTCGAG AAGAAAGTAA CCAGAGGTCA CATCCACTGG GCCAAAGAGT TGTCCGAAGT AGCTCGCTGG AAGAAGCTAT GCTTTTGTAA ACAATGCTAT CAGGTATTCA AAACAAAAAC TGCATTTTCC AAGCACAGGC GTGAATATGC GAACATGAGC GAAAGAGAAC ATGCTGGTCA TGATATAAGT GCGGATTTTA AAGGATACCT TTTGGAGCGA CCAAAGGGCC TCTACTATGC AATTGGGATT CCAGATGATG TTGCTCAAGC CGAAGCTGCT GAAGCCGAAG CTACCGCTAC TGCTGAAGAA ATTGCAAGTC TGAAATCGCA ATGGGTTATG GGTGGGCCTG GGTGGACGGT GGAAGAATAT GATATCAAAG TGCAATGGTT GGAGGGCAAG CTAGGTATGC CTCCCGACCA ATTCTTGGCC AAGCACAACT ACTTACTAGT GAAGGAGGTT GGTTTATACT ACGATTTCAC GCAGAAGTGG TTCCGAAGTA TAACACACAG GTTGCTGAGG AAGGTCTTCT TCGGCGACTT AATACTGACG GTGAGGCGTG AGTTAATATC TTACATGCTC CACAATATGG ATAGAAAGTA TGAATACATG TTGGCACCCC GTCACCAGCA GCAAGTTGCT ACTTTGGAAG CAGCAGTAAA ATATGACGGT CCAATGGTGG CGGTGGAAAG TGGCAACGAA GAGCATCTAG ATTTCGAGTT GGTAGATAAT TGCATACAGC CAACTAAATA CTACATGTGT CATGTGTGTA AGTTCATTGC CCACCACTCC AGCGTCTCCG CTCATATCAA ACAAGCCAAG CACCAACGAG AAGAGGGACC TGAAAAACAT GTTGTGTATG GTTACAGTGT TCGGAATGTA GAGGCTAAGA CAAACTACTT CCGAATTCAA CACAATCTAG CCAGTAGTTT TCACGAACCA ACAGACAGCA CTGCTGACTC TCAAATTGTA GATCCGTTTT TATCCAGATA CAAAGCGCAT GAGTATACAA AGTTCCGTTC ATTTTTGAAC ATGTCTAATG ACAAAAGATG GCCACGTTTG GTACAATTGT GCAAGAAATA TGTAACCAAG AAAAGATTAG CTTTGAGCAT CGGATCTTCT CTCTCAGCAC TCATCAAAAA CCCCCGGTTC TGTAGCACTC CAAACTCAGC CCTGCGATAC GGACCCATGC TTGCCAGATT CCTCTTTGTT GCCATGAAAA TGTGCGAGTT GAAGGAACAC AAAGCAATGG AAGAGTTTGA GCAAGTATTC TGTGCCAACA AAAAACAGGG ACTACGAAGA TTTTTCGGTC AAATGACGAA ATTTGGTGCG TTCTGGTTGT TGTCTCCGGT ACTAGTGTTG CAGGAAGACG ATTCAAACGC CATGGAAGAT GTCTTTATGA GCAGTCTAAT GGTGTGTCTT CTTGATGAAG GAGGAATAGG ATTCAAGATC AAGTCGTACC GCTGCGATTT TTGCAGTGGT TTATCTTTTG CATTCAAATG CATAGCTGTT GATGTGATGA AAAAAGACAA GTCTTATGTC CGACCTTTTG TTGAGGATTG TAACAGGGCG GCATCCATAG CACTCAACTT TGGCCGATTT TGCAAGTTCA GTGCCGATGT ACACTCGTTG AGTCCTCGTA CATTTGGAAG GACCATGAAC GTGCTACATG TTGGAGACAA AATAAGAATA AATACATTTG AAGTCGGACT CGACGACATC AAAAAATGGG TCCATACTGC CATTCAGCAA TATGTACTTG AAGTTGAGGA ACTTTTCATG GACATCAACA TCACGCCGAC CATGGCCATG AATGAGATGA TGGCAGCACA AAAACCAAGA GTTGTTGGAA TGCCTCAAAC TAGTTACAAC TGGGATCAAA GGGATATCGA CAGATGCTTG GAAATTAACC TGGTTGATAA GTATGACATC AAGAGTCTTC AGTATTTCCG TCGCCGACTC TCTAGATTGA ACGACTTGTT TTTTTACATG ATCTTGTTGA CGTGTGGGTC TCCGTATCGT TTAACGGAAT TATACCTGGT TCTTATTCGT AATGAAGAAA GATTGGGAAG TAATGTGTTT GTTGCTGATG GGATGCTTGG ACTCTTTACC AACAATGGAA AGAACAGTAT GAAGTATAGT CGAGAACGTC CTATTCTCAA GATGTTGCCC GAAGAGGTGT CAGAGTGCTT GGCTCACTAT ATCTATTTTG TCAGGCCATT CGAACACACT CTTGTCGAAT ACCAAGAACA AACCGGTTCA TTGAGCCACA TAGAGTTCAA GAAACTAAGT GATAAATACA AGCTCCATTT GTTCGTGGGA GAGAAACAAA TCAAAAACAG CACTACTCTT GGAAAATCGT TCCGTAAGTT CATAAATAAA ACAACTCCAA AGCTTAGAGG AATCATGTAC GGTGAAATAA GACAAGTGCT TTCATATTTT GTTGGGGCGA ACGTCTCTGG GATGATTGGA TTACAGATCA AGATTGACAA CACCCTTGCC GAACAGGCTG GGCATTCGTT TAGTAGGTTT GTCGAAAGTT ATGCCACAAA TAGAGATGGT TACAATCATG TAATGCTTCA ACGAATGAGA GACTGTCTGG CAGCCTGGCA TACGTGTTTG GAGATGAGTA CTCTTCAGAT ATTTACGCCT AGCGTATTTA CTAGGAACGA ACCACCGAAA TTGACAGGTG AAGAACTTAT AGAGGCTGGC AGACAGATGT TTGGACAACA CTTCAAGTTC AAGGAGGGCC AGCAACAGGC TACATCGGAT GTGGCTAATT GCTTCAGTAT GATGGTTGCT ATTGACCTGG GAAAGACGAC TTGTTGCATC ATTGCGATGT TGGCGGAGCG TACCCACAGT CTTCGCCAGA CCGACCGAGG TAAAAAGACT CACTGTGTTA CGATATTTGT GGTACCTTAC ATTTCTACAC TCAACTCCAC CATCCGCCAA TTGACCGAAA ATTTCAATGT GCGTACGTTT GATGGCTCTG TTGGACAGGT CAACGACTAT GATGTGCTAT TGATGCTGCT TGAAGACGGC CACCTTAGAT ATCTTAATCA GGTAGTCAGA CATTTTCAGT CTACGAACTA TCAAGGCAAG ACCTATCTCC GCCGGGTAGT GGTTGACGAT GCCCATATCT TGCAGATGAA ACAATATGTG GTCGATGGAA ACAAAAATGT ACCTGTTGTA TTTTTGACGT CATTTCTTCC ACAGAAGGAA GACGCTACGA TGAGTAGATT ATTTAATATC AATTCCTTGG TAAGGGTGTC GTCACAGGAA CCACTCTTGC CACATAAAGA ATTCACTCTT TTCAAGAAGC CTAATACCAC TTCAATATGT CAGACGGTTC TAGAAAAAGT TAGGAGTTTC GGTTCGGCAA TTGTGATTGC TGAGTCTTCT GATAAAGTGT CATACTTGGT AAGTCAAATG GCCGACGAGG TTCCATGTAT AGGTATCTAC GGATCCCCAG TCAGCAGAAC AGCTGCTTTG AATAAGATGG CCAATGAGAA TATCAGGGTT GTGGTTACTA CGGCAGAAGC AATGGTGGGT CTTCATATGT TGAAACCTAC AACCGTCTTG TTTGCATATT CGATTAATAA TCCAATAGAA TTGCTTGTTG GCAGTCAAAT GAGCAGTGGA AAGGTAGACA TGGTATTGCT CAACTTTTAT TCGTCTGAGT TTCAACGAGA CTTATTGGAC ACTTGTGTTA ATTTAATGGT AACGAGACAT ATGCGTTTAA CGGAACAAAC TTGTTATGAG GCTGGTAAGG AGCGGTGTCA TGCTTGTGTC AAAAAGAGAA GACATGCTTG A
|
Protein sequence | MNTQALGYSI HNPITLDLDL DLDLDQNFED ALDDFSQVDI KCENSNLIST DEHIWSSPQS FRQSLPFSSI QATPLRTNLF KHELDYLAKN YFVFVEPLGY FYLEKRHFFI SSDSKYATSK LYIKSKERFL DLISKLTKQS PYYNQIERES NIVNVATIPS PIPGIEIEEF AFCDSCKYTS CIGRVPVKCT SHGYMRRGEG YRVELGGFFL LKEFPRMPSL PDTPPPSTPN MYDRSPTIPC SSPIRRRSRF TNRDLQVEFK GPLITEIRVR ESENMTADAF LQKYANIKFI PDVGLYWHEH ERVFLDRNSE IVHKSLSSII DKCKFLETHM HAVMLNSYYW YLLASANTLS EVHICLEWLK EPLSGIEPSK IQYCPDCINS AIIDDHCEVD NLNVLTGMGY RMELGSIFLL ALGGGMTDIL EDQREEAPDL EYYASDSGAV FLSDEQEAEG SVDYASDMEN LTVDGQEGDE LTAVNGESES DTLLEDEVQE GDFRSFDDSS DSDNLFVDDA QEDDEEMIDI AIDNDTESDN DESDISAYDT TDDGLDNERD SEENILYDED PVRISTVSGW SQLKKKGICQ ISNLLWLRGQ QHFLDITSRA FRSFLMGTVS NAQYTQMLED VEEWVDLMPQ YYRIPKDTKG RTFEIPLPFS RLPPKIHTYC SECGFVIGFN ENNENLRSSS RHAKNTGHDR FISRQGYAIR AATGFNLLVI FGSNETVSTC ESEDGDYDDE DIVIPSFEEG LGSPFTPPEF IVEPYSTDEE ASDYEQEPES LPPPSKEEQY EVSELLKHSH GLVYDPIFYL YYYITEERYV DVNSKFFKNT LEDIGDHKSR IRNLVDRVIS TQIIDSRKFE KKVTRGHIHW AKELSEVARW KKLCFCKQCY QVFKTKTAFS KHRREYANMS EREHAGHDIS ADFKGYLLER PKGLYYAIGI PDDVAQAEAA EAEATATAEE IASLKSQWVM GGPGWTVEEY DIKVQWLEGK LGMPPDQFLA KHNYLLVKEV GLYYDFTQKW FRSITHRLLR KVFFGDLILT VRRELISYML HNMDRKYEYM LAPRHQQQVA TLEAAVKYDG PMVAVESGNE EHLDFELVDN CIQPTKYYMC HVCKFIAHHS SVSAHIKQAK HQREEGPEKH VVYGYSVRNV EAKTNYFRIQ HNLASSFHEP TDSTADSQIV DPFLSRYKAH EYTKFRSFLN MSNDKRWPRL VQLCKKYVTK KRLALSIGSS LSALIKNPRF CSTPNSALRY GPMLARFLFV AMKMCELKEH KAMEEFEQVF CANKKQGLRR FFGQMTKFGA FWLLSPVLVL QEDDSNAMED VFMSSLMVCL LDEGGIGFKI KSYRCDFCSG LSFAFKCIAV DVMKKDKSYV RPFVEDCNRA ASIALNFGRF CKFSADVHSL SPRTFGRTMN VLHVGDKIRI NTFEVGLDDI KKWVHTAIQQ YVLEVEELFM DINITPTMAM NEMMAAQKPR VVGMPQTSYN WDQRDIDRCL EINLVDKYDI KSLQYFRRRL SRLNDLFFYM ILLTCGSPYR LTELYLVLIR NEERLGSNVF VADGMLGLFT NNGKNSMKYS RERPILKMLP EEVSECLAHY IYFVRPFEHT LVEYQEQTGS LSHIEFKKLS DKYKLHLFVG EKQIKNSTTL GKSFRKFINK TTPKLRGIMY GEIRQVLSYF VGANVSGMIG LQIKIDNTLA EQAGHSFSRF VESYATNRDG YNHVMLQRMR DCLAAWHTCL EMSTLQIFTP SVFTRNEPPK LTGEELIEAG RQMFGQHFKF KEGQQQATSD VANCFSMMVA IDLGKTTCCI IAMLAERTHS LRQTDRGKKT HCVTIFVVPY ISTLNSTIRQ LTENFNVRTF DGSVGQVNDY DVLLMLLEDG HLRYLNQVVR HFQSTNYQGK TYLRRVVVDD AHILQMKQYV VDGNKNVPVV FLTSFLPQKE DATMSRLFNI NSLVRVSSQE PLLPHKEFTL FKKPNTTSIC QTVLEKVRSF GSAIVIAESS DKVSYLVSQM ADEVPCIGIY GSPVSRTAAL NKMANENIRV VVTTAEAMVG LHMLKPTTVL FAYSINNPIE LLVGSQMSSG KVDMVLLNFY SSEFQRDLLD TCVNLMVTRH MRLTEQTCYE AGKERCHACV KKRRHA
|
| |