Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_86570 |
Symbol | ATP1 |
ID | 4851467 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 1891514 |
End bp | 1893877 |
Gene Length | 2364 bp |
Protein Length | 545 aa |
Translation table | |
GC content | 43% |
IMG OID | 640393175 |
Product | F1F0-ATPase complex, F1 alpha subunit |
Protein accession | XP_001387600 |
Protein GI | 126274595 |
COG category | [C] Energy production and conversion |
COG ID | [COG0056] F0F1-type ATP synthase, alpha subunit |
TIGRFAM ID | [TIGR00962] proton translocating ATP synthase, F1 alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CTTTTAGTAA CACCACAATT TACTAACAAG TTACAATGTT GTCGGCTCGT CCTGTTTTGC GTACTGCTGC CCGTTCTGCC GCCGTTGCTG CCAGAACCTT GAGAGTTGTA AGTATTTGAA TGAATTCTCG GGATGTAAAT CCAATAGTAA AATCAATAGC ATTCAACAAC TTCCGTAAGA AGTCAACAAT AGTCCATGAT TGTCTCAACA ACATCCAACA AGGATTAATC GCCGAACAAT TGTAGATATC TGTACACTGA CGAATACCAT TGTCGTCCGT GGGCATTTTC TGTTTTTGCA GTAGAAAGTT CTGGCAATTT GAATTATTGA TTCAACAAAA GTCGACGTAT GTTGAATCAG AAAGTCAATT CACCACACAA TTGATTCCCT AAGGCTGAAC TGCTATCCTT ATTACCATCA CAGTTATTCC TTCATTGAGT ACCATTAATA TCGGCATTGT TGTCTTCTTG TTATTGTTTC TTACTATTAC TAAGTTTCCT GACATGTACT AACATCTTTA GGCTCGTCCA ACTCTCTTGA CTGCTCAACG TTTTGCCTCC GCCAAGGCTG CTCCAACTGA AGTCTCCTCC ATCTTGGAAG AAAGAATCAG AGGTGTCTCC GATGCTGCCA ACTTGAACGA GACCGGTAGA GTCTTGTCCG TCGGTGATGG TATTGCCCGT ATCTTTGGTT TGAACAACAT TCAAGCTGAA GAATTGGTTG AATTCGCCTC CGGTGTCAAG GGTATGGCCT TGAACTTGGA ACCAGACCAA GTCGGTGTTG TCTTGTTCGG TTCGGACAGA TTAGTCAAGG AAGGTGAAAC CGTTAAGAGA ACCGGTAAGA TTGTCGATGT TCCAACTGGT CCAGAATTGT TGGGTAGAGT TGTCGACGGT TTAGGTAACC CAATCGATGG TAAGGGTCCA TTGAACGCTT CCTCTTCCTC GAGAGCTCAA GTCAAGGCTC CAGGTATCTT GCCAAGAACC TCTGTGTTCG AACCTATGCA AACTGGTTTA AAGTCTGTTG ATGCTCTTGT CCCAGTCGGA AGAGGTCAAA GAGAATTGAT CATTGGTGAT CGTCAAACCG GTAAGACCGC CGTCGCATTG GACACCATCT TGAACCAAAA GAGATGGAAC AACGGTGCTG ACGAATCCAA GAAGTTGTAC TGTGTCTACG TTGCCGTTGG TCAAAAGAGA TCCACTGTAG CCCAATTGGT CCAGACCTTG GAACAAAACG ATGCTCTTAA GTACTCCATC ATTGTCGCTG CCACTGCCTC GGAAGCCGCT CCATTACAAT ACATTGCTCC TTTCACTGCC TGTGCTATTG GTGAATGGTT CAGAGACAAC GGTAAGCACG CTTTGATTGT CTACGATGAC TTGTCCAAGC AAGCTGTTGC CTACCGTCAA TTGTCATTAT TGTTGAGACG TCCTCCTGGA AGAGAAGCTT ACCCTGGTGA TGTTTTCTAC TTGCACTCTA GATTGTTAGA AAGAGCTGCT AAGATGTCTC CAGTCCACGG TGGTGGTTCT TTGACCGCTT TGCCAGTCAT TGAAACCCAA GGTGGTGATG TTTCCGCTTA TATTCCAACC AACGTTATTT CCATTACCGA TGGTCAAATT TTCTTGGAAG CTGAATTGTT CTACAAGGGT ATCAGACCAG CTATTAACGT CGGTTTGTCC GTCTCCCGTG TCGGTTCTGC TGCTCAAGTC AAGGCCATGA AGCAAGTCGC TGGTTCCTTG AAGTTGTTCT TGGCCCAATA CAGAGAAGTC GCTGCCTTCG CCCAATTCGG TTCCGATTTG GATGCCTCTA CCAAGCAAAC CTTGAACAGA GGTGAAAGAT TGACCCAATT ATTAAAGCAA AAGCAATACT CTCCATTGGC TGCTGAAGAA CAAGTCCCAT TGATTTTCGC TGGTGTCAAC GGTTTCTTGG ACGAGATTCC TCTTGAAAGA ATCGGTGAAT TCGAAGAATC CTTCTTGGCT CACTTGAAGG CCAACGAAAC CGAAATCTTG GAAGCCATCC AAGTCAAGGG TGAATTGTCT AAGGAATTGT TGGAAAAGTT GAGATCCACC ACTGAAACTT TTGTTTCTAC TTTCTAAGTC TTTTAGTAAA CTTTATCACC ACCAAAAATG AATAGAAGAA CGTCGCATTG AAATCAATTC ATGGTGTTTT GTTTTCTGGT GTTTAATTCT TGGAAGACAT ATGTAATGTT TTCTCTATTC TCTTTAGATG ACTCCACTTC CGTTTTCAGG CATTTATTTG AACTTTGATT TATTGCTATT TTTTTACTAA GGTGTCGGCT GGTTTCCTTC GGGGAGCTAC TTGCACATCA AAAAAGAAAA TACTCATATA TAAGAAATAC AAGTATTCCG TATT
|
Protein sequence | MLSARPVLRT AARSAAVAAR TLRVARPTLL TAQRFASAKA APTEVSSILE ERIRGVSDAA NLNETGRVLS VGDGIARIFG LNNIQAEELV EFASGVKGMA LNLEPDQVGV VLFGSDRLVK EGETVKRTGK IVDVPTGPEL LGRVVDGLGN PIDGKGPLNA SSSSRAQVKA PGILPRTSVF EPMQTGLKSV DALVPVGRGQ RELIIGDRQT GKTAVALDTI LNQKRWNNGA DESKKLYCVY VAVGQKRSTV AQLVQTLEQN DALKYSIIVA ATASEAAPLQ YIAPFTACAI GEWFRDNGKH ALIVYDDLSK QAVAYRQLSL LLRRPPGREA YPGDVFYLHS RLLERAAKMS PVHGGGSLTA LPVIETQGGD VSAYIPTNVI SITDGQIFLE AELFYKGIRP AINVGLSVSR VGSAAQVKAM KQVAGSLKLF LAQYREVAAF AQFGSDLDAS TKQTLNRGER LTQLLKQKQY SPLAAEEQVP LIFAGVNGFL DEIPLERIGE FEESFLAHLK ANETEILEAI QVKGELSKEL LEKLRSTTET FVSTF
|
| |