Gene PICST_52035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_52035 
SymbolARH1 
ID4851278 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1393749 
End bp1395194 
Gene Length1446 bp 
Protein Length462 aa 
Translation table 
GC content41% 
IMG OID640392986 
Productmitochondrial protein 
Protein accessionXP_001387494 
Protein GI126274264 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0493] NADPH-dependent glutamate synthase beta chain and related oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.176886 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATAGCTGTAG TAGGAACTGG TCCAGCAGGT TTCTATACTG CTCATCATAT TCTCCTGAAA 
TGTTCTGATA ACATGAGGAT CAACTTGGAT TTCTTTGAGA GGTTGCCGGC TCCATTCGGT
TTGAGCCGTT ATGGTGTTGC CCCCGACCAT CCCGAAGTCA AGAACTGCGA AGAGTACTTG
GAGAATATAA TGGACAAGTA CTCAGATACA AAAGATAACA ATAGCCATAA GGTTCGATTT
TTAGGGAACG TCAATATCGG AAAGGATATC TCGTTAAAGC AATTAGAGTC GTATTACCAC
TCCATCGTTC TTTCATATGG TAGCACTTCG GCTGACAACA AGCTTCAAGT AGCTGGTTCA
GATTTGCCAG GAGTTATTTC AGCAAGACAG TTTGTTAATT GGTACAATGG GCATCCTGAC
TTCTATGCCA CTGGAAAAAA GTTTGTTCCT CCTCCTCTTG ACCAGATTGA GGATGTGACG
ATAATTGGAA ATGGTAATGT TGCCTTAGAT GTAGCACGTA TCTTGTTGGC CGACCCAGCA
ACCCATTGGT CTAAGACGGA TATATCTGTA GATGCGTTAC AGGTGTTACA AAACAGCACA
GTCAAGAATG TCAACATCGT AGCCAGACGT GGGTTATTGG AGTCGGCTTT TTCAAATAAA
GAGATCAGAG AATTGTTTGA ACTCTCGAAC GAGAATAGCA TCAAGTTCAT TCCTCTAGAC
GACAAGGAAT TCGAAAACAT AGACATAAAG AGTTTAGGAA GAGTTGATAA GAGAAAAGTC
TCTATAATAG AAAAGTATAC AAAGGCTTCG CATACAGCTC CGGCAGCCGA CAGGACCTGG
TCTCTCCAAT ATTTAAAAAG TCCCAAGGCA TTCCTCCCTC ATGCGTCAAA TGCCAAGTTG
TTGTCTGCAA CTGAGGTTGT CAAGAACGAA TTGATACATG ATCCACTTAC AAATACCGCT
AAAGTACGCC CTACCTCTGA AAGTGAAACC ATCAAAAACG AGTTAGTTAT TTTGTCTATA
GGGTATCAAG GATCACCGTT ACTTGGATTT GAAGAAAATG GGATCCTTTT TGAAAAGAAC
CGCTTGTTCA ACAAACATGG TCGTATCTTG TCTATAGAAT CGAAAGAAGA GGAGGAACAT
AATTCAGTTT ACAAGAAGGG TTGGTATACT TCTGGATGGA TTAAGAATGG TCCAAAGGGC
GTTATTGCAA CAACCATGAT GGATTCGTTT GATACGGCTG ACAAAGTTCT TGAGGACCTT
TCCAATGGGA TTCACCTAGA AACATCCGGT GGTGATATTC ACGATTTGTT GAAGGATAAG
ACAGTAGTGG GCTGGGATAA TTGGAAAGTT CTTGATGCCT ACGAGAAAAG CAAGGGTGAG
AAAGAGGGTA AGTCGAGATA CAAGATCTGT AATTCGGAAG ACATGATCAA AATTGCATGT
AATTAG
 
Protein sequence
IAVVGTGPAG FYTAHHILLK CSDNMRINLD FFERLPAPFG LSRYGVAPDH PEVKNCEEYL 
ENIMDNHKVR FLGNVNIGKD ISLKQLESYY HSIVLSYGST SADNKLQVAG SDLPGVISAR
QFVNWYNGHP DFYATGKKFV PPPLDQIEDV TIIGNGNVAL DVARILLADP ATHWSKTDIS
VDALQVLQNS TVKNVNIVAR RGLLESAFSN KEIRELFELS NENSIKFIPL DDKEFENIDI
KSLGRVDKRK VSIIEKYTKA SHTAPAADRT WSLQYLKSPK AFLPHASNAK LLSATEVVKN
ELIHDPLTNT AKVRPTSESE TIKNELVILS IGYQGSPLLG FEENGILFEK NRLFNKHEEE
HNSVYKKGWY TSGWIKNGPK GVIATTMMDS FDTADKVLED LSNGIHLETS GGDIHDLLKD
KTVVGWDNWK VLDAYEKSKG EKEGKSRYKI CNSEDMIKIA CN