Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_52035 |
Symbol | ARH1 |
ID | 4851278 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 1393749 |
End bp | 1395194 |
Gene Length | 1446 bp |
Protein Length | 462 aa |
Translation table | |
GC content | 41% |
IMG OID | 640392986 |
Product | mitochondrial protein |
Protein accession | XP_001387494 |
Protein GI | 126274264 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0493] NADPH-dependent glutamate synthase beta chain and related oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.176886 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATAGCTGTAG TAGGAACTGG TCCAGCAGGT TTCTATACTG CTCATCATAT TCTCCTGAAA TGTTCTGATA ACATGAGGAT CAACTTGGAT TTCTTTGAGA GGTTGCCGGC TCCATTCGGT TTGAGCCGTT ATGGTGTTGC CCCCGACCAT CCCGAAGTCA AGAACTGCGA AGAGTACTTG GAGAATATAA TGGACAAGTA CTCAGATACA AAAGATAACA ATAGCCATAA GGTTCGATTT TTAGGGAACG TCAATATCGG AAAGGATATC TCGTTAAAGC AATTAGAGTC GTATTACCAC TCCATCGTTC TTTCATATGG TAGCACTTCG GCTGACAACA AGCTTCAAGT AGCTGGTTCA GATTTGCCAG GAGTTATTTC AGCAAGACAG TTTGTTAATT GGTACAATGG GCATCCTGAC TTCTATGCCA CTGGAAAAAA GTTTGTTCCT CCTCCTCTTG ACCAGATTGA GGATGTGACG ATAATTGGAA ATGGTAATGT TGCCTTAGAT GTAGCACGTA TCTTGTTGGC CGACCCAGCA ACCCATTGGT CTAAGACGGA TATATCTGTA GATGCGTTAC AGGTGTTACA AAACAGCACA GTCAAGAATG TCAACATCGT AGCCAGACGT GGGTTATTGG AGTCGGCTTT TTCAAATAAA GAGATCAGAG AATTGTTTGA ACTCTCGAAC GAGAATAGCA TCAAGTTCAT TCCTCTAGAC GACAAGGAAT TCGAAAACAT AGACATAAAG AGTTTAGGAA GAGTTGATAA GAGAAAAGTC TCTATAATAG AAAAGTATAC AAAGGCTTCG CATACAGCTC CGGCAGCCGA CAGGACCTGG TCTCTCCAAT ATTTAAAAAG TCCCAAGGCA TTCCTCCCTC ATGCGTCAAA TGCCAAGTTG TTGTCTGCAA CTGAGGTTGT CAAGAACGAA TTGATACATG ATCCACTTAC AAATACCGCT AAAGTACGCC CTACCTCTGA AAGTGAAACC ATCAAAAACG AGTTAGTTAT TTTGTCTATA GGGTATCAAG GATCACCGTT ACTTGGATTT GAAGAAAATG GGATCCTTTT TGAAAAGAAC CGCTTGTTCA ACAAACATGG TCGTATCTTG TCTATAGAAT CGAAAGAAGA GGAGGAACAT AATTCAGTTT ACAAGAAGGG TTGGTATACT TCTGGATGGA TTAAGAATGG TCCAAAGGGC GTTATTGCAA CAACCATGAT GGATTCGTTT GATACGGCTG ACAAAGTTCT TGAGGACCTT TCCAATGGGA TTCACCTAGA AACATCCGGT GGTGATATTC ACGATTTGTT GAAGGATAAG ACAGTAGTGG GCTGGGATAA TTGGAAAGTT CTTGATGCCT ACGAGAAAAG CAAGGGTGAG AAAGAGGGTA AGTCGAGATA CAAGATCTGT AATTCGGAAG ACATGATCAA AATTGCATGT AATTAG
|
Protein sequence | IAVVGTGPAG FYTAHHILLK CSDNMRINLD FFERLPAPFG LSRYGVAPDH PEVKNCEEYL ENIMDNHKVR FLGNVNIGKD ISLKQLESYY HSIVLSYGST SADNKLQVAG SDLPGVISAR QFVNWYNGHP DFYATGKKFV PPPLDQIEDV TIIGNGNVAL DVARILLADP ATHWSKTDIS VDALQVLQNS TVKNVNIVAR RGLLESAFSN KEIRELFELS NENSIKFIPL DDKEFENIDI KSLGRVDKRK VSIIEKYTKA SHTAPAADRT WSLQYLKSPK AFLPHASNAK LLSATEVVKN ELIHDPLTNT AKVRPTSESE TIKNELVILS IGYQGSPLLG FEENGILFEK NRLFNKHEEE HNSVYKKGWY TSGWIKNGPK GVIATTMMDS FDTADKVLED LSNGIHLETS GGDIHDLLKD KTVVGWDNWK VLDAYEKSKG EKEGKSRYKI CNSEDMIKIA CN
|
| |