Gene PICST_30449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30449 
Symbol 
ID4837925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp82260 
End bp83477 
Gene Length1218 bp 
Protein Length405 aa 
Translation table12 
GC content43% 
IMG OID640389240 
Productpredicted protein 
Protein accessionXP_001383651 
Protein GI150864711 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.903908 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.469416 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGCTA CACTTTTGCC CAACCCCAAG GATACTAATG CCACAATTTT GTTGATGGGA 
TTAAGACGAG GTGGCAAGTC GTCTATTTGC AAAGTTGTAT TCCACAACAT GCAGCCTTTG
GATACGTTGT ATTTGGAAAG TACATCCAAG CCCACAACAG AGCAGTTCAG CTCGCTCATA
GATCTTTCGG TGATGGAATT ACCGGGTCAG TTGAACTACT TCGAGCCCAA TTACGACTCA
GAAAGGCTCT TTTCGTCTAT CGGAGCTTTG GTTTATGTGA TTGACTCACA GGACGAATAC
TTGAATGCTT TGACCAATTT ATCGATGATC ATAGAATTTG CATACAAAGT CAACCCCAAG
ATCAACATTG AGGTATTAAT CCACAAAATC GATGGGTTGT CGGAAGACTA CCGTATCGAT
GCCCAGAGAG ATATCATGCA AAGAACCGGC GACGAATTGT TGGACTTGGG GCTTGAAGGC
GTGCAAGTGT CTTTCTACTT GACTTCTATT TTTGACCATT CTATCTACGA AGCATTTTCG
AGAATCGTCC AGAAGTTAAT TCCCGAGTTG CCTTCCTTAG AAAATATGCT TGACAACTTG
GTACAGCACT CATCTATCGA CAAGGTCTTT TTGTTTGACG TCAACTCCAA GATTTACGTA
GCCACAGATT CGTCACCAGT AGACATTCAG ACTTACGAAG TTTGTGCCGA ATTCATTGAC
ATTACCATCG ACCTTGATGA TTTGTATGTG GAAAACGAGT CTGGAACCAG AAAACAAAAT
TCAACAAGTC AACAAAAGGA GCTCAAGTCT GTAAGTCATC TTTCAAACGG TTCCATATTG
TACTTGAAGC AGATGATCAG AGGTCTAGCT CTTGTAGCCC TAATCAGAAA CGACGAAGTC
CGAAACTCTG CCAACAATAC TACAAATACG AACAAGATAA ATAGCGACTT CAGTGACGAC
AATGTCGACG TATTGGAGTC ATCTAGAACA AATAACCAGG ACAGTTCTTT GGCCATCATC
GACTACAATG TAAATCTTTT CAAACAGGCC ATGATGCGGA TGTGGGAAAA CTCCAGATTC
ATCAACCCCA ACGAGCCGCT AGAGCGTGGC TCTCTGGCGG AGTCCCATCT CTACGTTTCA
GACAGCAATG GAGCAGGCAG TGGTTTATAT AAGGGTATCA ACAATAACGG CCTGACGACT
CAAGATCACT TCAACTAA
 
Protein sequence
MEATLLPNPK DTNATILLMG LRRGGKSSIC KVVFHNMQPL DTLYLESTSK PTTEQFSSLI 
DLSVMELPGQ LNYFEPNYDS ERLFSSIGAL VYVIDSQDEY LNALTNLSMI IEFAYKVNPK
INIEVLIHKI DGLSEDYRID AQRDIMQRTG DELLDLGLEG VQVSFYLTSI FDHSIYEAFS
RIVQKLIPEL PSLENMLDNL VQHSSIDKVF LFDVNSKIYV ATDSSPVDIQ TYEVCAEFID
ITIDLDDLYV ENESGTRKQN STSQQKELKS VSHLSNGSIL YLKQMIRGLA LVALIRNDEV
RNSANNTTNT NKINSDFSDD NVDVLESSRT NNQDSSLAII DYNVNLFKQA MMRMWENSRF
INPNEPLERG SSAESHLYVS DSNGAGSGLY KGINNNGSTT QDHFN