Gene PICST_19381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_19381 
Symbol 
ID4838190 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1216635 
End bp1217801 
Gene Length1167 bp 
Protein Length379 aa 
Translation table12 
GC content44% 
IMG OID640389505 
Productpredicted protein 
Protein accessionXP_001383862 
Protein GI150864867 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.797995 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TGGCAAGACG ACTTGGCTGT TTCTGCCTGT TTCTTGTGCC ATTCACACTA CACGTTCTTC 
AACCGAAGGC ATCATTGCCG CAAATGTGGC CGTGTAGTAT GTGCATCCTG TTCTGATAGA
CCGGTGAAGT ACTTTCCCAA CACTTATGTG GTTAGTCCTC ATGGTTCTCG AGTTGTGGAT
ACCTCGTTCG AGATGTTTCG TACGTGTGAT GAGTGTGTGG ATGAGATTCG TATGATTAGG
AGAGCGTTGT TTACTACAAA TCTGTCAGTA GACAACAACA GCATCAACAG CTCTTCGTCT
TCGCTAAATG TGATGAATTA CTTCGATCCT CATAGTCATG AACATGAACG TGACCACATC
CATTATCACG ATCATGATAA CGATTCAACC ACTAAATACT CTACCAGAAC TCATACCAGA
ATTGTAGACT CTTCGACAAA CTCTTCCGCC ACTAATCTCG CACATTCTCA CCATCGTCGT
ATTCATGGAA GAGGGGGAAC CCTGGCAGCG GCTGGTTCGG ACGATACTGA GTCTGATCTC
AACTTGTGTC CGGTATGCGC TACTGATTTG CTCAAGCTTT ATATCAATGC ACATAAACGT
AGAATCGATG AGATTTCTCA TGAAGACTTT GACGCTTTTA AAGAAACTCA CATCAACGAC
TGTTTGACTC ATTTCGATTT CAATACAGAA AACCAACGCT TCAACTCACC AGAGTCAAAC
CATCATTCTC ATCCTAGGAA CAAAATGTTG GTCTACAACA TTCCCCCTAT TCCCAAACCT
AAGTATGAGA CAATCCCTAT TATTGACGAA GCCCCCATTT CTGATGGAAC GTCAGAAGAA
GCCACGGTTC ATGATTCAGT TCACAACTCT CAATCAGGAG GCATTTCGCC TTCTTCGCTG
ACTGGTCAGG AAGGTGAGCA AGTCGAATTT TCTCAGTTGG ATACGATTAT AGGATCCGTG
ACTTCAACTT CCACCATCCA GCCTTCTGCT GAAAAGATTT CGTACGATGA TGTCATTGAT
AATGAATGTG TCATATGTTT GGAAGACCTA AAGCCAGGAG ATAAGGTTGG TCGCTTGGAA
TGCTTGTGCG TGTTCCACTA TAAGTGTATC AAAGATTGGT TCAACAAGAA GGGCTACGGT
GAATGTCCTG TTCATTTCTT GCACAAG
 
Protein sequence
WQDDLAVSAC FLCHSHYTFF NRRHHCRKCG RVVCASCSDR PVKYFPNTYV VSPHGSRVVD 
TSFEMFRTCD ECVDEIRMIR RALFTTNSSV DNNSINSSSS SLNVMNYFDP HSHEHERDHI
HYHDHDNDST TKYSTRTHTR IVDSSTNSSA TNLAHSHHRP AGSDDTESDL NLCPVCATDL
LKLYINAHKR RIDEISHEDF DAFKETHIND CLTHFDFNTE NQRFNSPESN HHSHPRNKML
VYNIPPIPKP KYETIPIIDE APISDGTSEE ATVHDSVHNS QSGGISPSSS TGQEGEQVEF
SQLDTIIGSV TSTSTIQPSA EKISYDDVID NECVICLEDL KPGDKVGRLE CLCVFHYKCI
KDWFNKKGYG ECPVHFLHK