Gene PICST_39453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_39453 
Symbol 
ID4851668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2497845 
End bp2498966 
Gene Length1122 bp 
Protein Length373 aa 
Translation table 
GC content48% 
IMG OID640393376 
Productpredicted protein 
Protein accessionXP_001386814 
Protein GI126275209 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1957] Inosine-uridine nucleoside N-ribohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0480355 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCAAA TTTCAAAGGT TCTTGTAGCA GCTTCAGTTG CTGGTTTGAC CGCTGCTAAA 
AAGGTGTTTA TTGACAACGA TGGATTGGCT CCCTTGCAAG TTTTGTTTCC TTTGTTGGCC
GGTTGGGAAG TCCTCGGTAT CTCTACCTCG TTCGGTTCCT CTTCTACAGT GGATTCTGCA
GGTGCAGCCT ACGACGTGTT AACAGCCTAC AACTTGACTT CTTGTATTCC ACACTATGTA
GGTGCCCAAC AGCCATTGTT GAGAACTCAG GACACCTTTG ACACCTGGCA ATCCCTCTTC
GGAGAATTGG TATGGCAAGG TGCCTTCGCT CCTTCCTACG AAGATCTTTA CTCTTGGGAC
AATATCACCT ACAACGACTC TGTCCCAGGT GCCGTAGCCT TGATTGAAGC CGTCAAGGCC
AACAAGGATA CTGACCCTGT CTATATCTAT GCTGCAGGTA TGATGACAAC CGTTGCTCAG
GCCATTTCCC TCTACCCAGA CCTCGTCAAG GATGCTGCCG GTTTGTACAT TATGGGAGGG
TATTTTGATC AACAATTCGC AGCCGGCACT GGAACTCCTA TTGTCAATGA CATAAACACC
GACATCAACT TGATGCAAGA TCCAGAAGCT GCCCAAATCG TCTTGACTGC CAACTGGACT
GAATTGTACA TCGGTGCCAA CGTCACCAAC TACTTGGTTC CATCCCAAGA ATTGTACGAC
AGACTCATCA CCAAGGCCGG TGGCTACAGT GTGTTGGAAG AAAACTCCTA CTTAGAACCA
GTCTTGAACT TGGTTGCTAC GGGAAACTAC ACTGAAAATA CTTCTGAACA GACCCTTCCA
TTCTGGGACG AAGTAGTCTC TGCCTTCATG GTGTGGCCAG ACATGGTTCA AAGCACAACA
AACTTCTCTG TAGCTGTGGA CACGCAGTTC TACTCTCCAT TCTACGGAAG TTTGAGAATC
TGGGGTTCTG AGTTTGCTCC AAAGGGCCAA ATCACCGGTA ATGCCACCAT CGTCAACAAG
ATCGACGACA GCAGATTCTA CGACTTATTG GTTTCTACAT ACTTCATGGA CTGGAGACAG
TATTGTGAAG TTGGCGGTCC AGTCACTTTA GAAGGCTACT AA
 
Protein sequence
MVQISKVLVA ASVAGLTAAK KVFIDNDGLA PLQVLFPLLA GWEVLGISTS FGSSSTVDSA 
GAAYDVLTAY NLTSCIPHYV GAQQPLLRTQ DTFDTWQSLF GELVWQGAFA PSYEDLYSWD
NITYNDSVPG AVALIEAVKA NKDTDPVYIY AAGMMTTVAQ AISLYPDLVK DAAGLYIMGG
YFDQQFAAGT GTPIVNDINT DINLMQDPEA AQIVLTANWT ELYIGANVTN YLVPSQELYD
RLITKAGGYS VLEENSYLEP VLNLVATGNY TENTSEQTLP FWDEVVSAFM VWPDMVQSTT
NFSVAVDTQF YSPFYGSLRI WGSEFAPKGQ ITGNATIVNK IDDSRFYDLL VSTYFMDWRQ
YCEVGGPVTL EGY