Gene PICST_30250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30250 
SymbolNTG2 
ID4837588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2358789 
End bp2359937 
Gene Length1149 bp 
Protein Length382 aa 
Translation table12 
GC content42% 
IMG OID640388903 
ProductEndonuclease III 
Protein accessionXP_001382677 
Protein GI150864007 
COG category[L] Replication, recombination and repair 
COG ID[COG0177] Predicted EndoIII-related endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00854867 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.477388 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCGGC TAAGAACCGC AAGTTCCACG GCAGTACAAG CCACGAAGAG AAAGAGAGTA 
GAAAAGGTCA AGATTGAATC AAGTGAAGAA AATGAGATTG AAATAAAAAG GAAAAGTTCT
CGGCCCGCGA TCAAGACTGA AACAGCGATT GAATTGAACA ACGAGGTTAC TGGTGTTCTA
TCTGACATAA AGATTGAGAT CAAACAAGAA CCAGCAGATA TAGCCGATAA AAAGTCTGAC
CTGAAATATA TTCCCCATAT CAAGGTAGAA ATTGAAAAAG ACACTCCGCC TAACATCTTT
CCATACATAG AAAAGCAACA TCATGATCTA CCCAAGGCTC CCAAGAATTG GTACAAGATA
TACAACGAGA TCGTTACCAT GAGGAGTAAG ATATCAACCC CCGTAGACAC ACAGGGCTGT
GAACGAATGC CCAACTCCAT TAACCCTAAC GTAAGAACCA GAAACCCACG AATTTACCGA
TTTCAACTCT TGATTTCACT CATGCTCTCT TCACAGACTA AAGATGAAGT CAACTACTTG
GCCATGAAAA CCATGCACGA AGGCTTATTG GCGAATGGAT ACAAAGATGG ATTATGTATT
GAAGCATTAT TAGAGCTCAC AGAAAAAGAG TTAGACGATT ACATTTGCAA AGTAGGTTTT
CACAACCGTA AAGCTGGCTA CATCAAGAGA GCCTGTGAGA TGCTCAGAGA CAATTTTCAA
TCAGATATCC CCAGTACCAT CGAAGATGTT GTCACCTTAC CTGGGGTAGG TCCAAAAATG
GGCTATTTGC TTTTGCAGAA TGCCTGGGGA ATCAATAGTG GCATTGGAGT GGATGTTCAT
CTCCACCGTT TAGCCCAGAT GTGGTCTTGG ACTTCCAAGA ATGCGAAGAC TCCTGAACAT
ACGAGAGTGG AATTGGAAGA CTGGTTACCA CCTAAGTATT GGGCCGATAT AAATCCACTA
TTGGTAGGTT TTGGCCAGAC AATATGTGTT CCAAGAGCGC CCAACTGTGA TATCTGCACG
CTAGCTACAA CTGGTTTGTG CAAGGCGTCC AAAAAAAGTC TTCTCAAGAC TCCCATCACC
GATGAAAGAT TAAAAAAACT CAACAAGCAA CGGGGTGATT TGTCGAAGCT CATTGCTGAA
TTCGTCTAA
 
Protein sequence
MSRLRTASST AVQATKRKRV EKVKIESSEE NEIEIKRKSS RPAIKTETAI ELNNEVTGVL 
SDIKIEIKQE PADIADKKSD SKYIPHIKVE IEKDTPPNIF PYIEKQHHDL PKAPKNWYKI
YNEIVTMRSK ISTPVDTQGC ERMPNSINPN VRTRNPRIYR FQLLISLMLS SQTKDEVNYL
AMKTMHEGLL ANGYKDGLCI EALLELTEKE LDDYICKVGF HNRKAGYIKR ACEMLRDNFQ
SDIPSTIEDV VTLPGVGPKM GYLLLQNAWG INSGIGVDVH LHRLAQMWSW TSKNAKTPEH
TRVELEDWLP PKYWADINPL LVGFGQTICV PRAPNCDICT LATTGLCKAS KKSLLKTPIT
DERLKKLNKQ RGDLSKLIAE FV