Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_30250 |
Symbol | NTG2 |
ID | 4837588 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 2358789 |
End bp | 2359937 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640388903 |
Product | Endonuclease III |
Protein accession | XP_001382677 |
Protein GI | 150864007 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0177] Predicted EndoIII-related endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00854867 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.477388 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCGGC TAAGAACCGC AAGTTCCACG GCAGTACAAG CCACGAAGAG AAAGAGAGTA GAAAAGGTCA AGATTGAATC AAGTGAAGAA AATGAGATTG AAATAAAAAG GAAAAGTTCT CGGCCCGCGA TCAAGACTGA AACAGCGATT GAATTGAACA ACGAGGTTAC TGGTGTTCTA TCTGACATAA AGATTGAGAT CAAACAAGAA CCAGCAGATA TAGCCGATAA AAAGTCTGAC CTGAAATATA TTCCCCATAT CAAGGTAGAA ATTGAAAAAG ACACTCCGCC TAACATCTTT CCATACATAG AAAAGCAACA TCATGATCTA CCCAAGGCTC CCAAGAATTG GTACAAGATA TACAACGAGA TCGTTACCAT GAGGAGTAAG ATATCAACCC CCGTAGACAC ACAGGGCTGT GAACGAATGC CCAACTCCAT TAACCCTAAC GTAAGAACCA GAAACCCACG AATTTACCGA TTTCAACTCT TGATTTCACT CATGCTCTCT TCACAGACTA AAGATGAAGT CAACTACTTG GCCATGAAAA CCATGCACGA AGGCTTATTG GCGAATGGAT ACAAAGATGG ATTATGTATT GAAGCATTAT TAGAGCTCAC AGAAAAAGAG TTAGACGATT ACATTTGCAA AGTAGGTTTT CACAACCGTA AAGCTGGCTA CATCAAGAGA GCCTGTGAGA TGCTCAGAGA CAATTTTCAA TCAGATATCC CCAGTACCAT CGAAGATGTT GTCACCTTAC CTGGGGTAGG TCCAAAAATG GGCTATTTGC TTTTGCAGAA TGCCTGGGGA ATCAATAGTG GCATTGGAGT GGATGTTCAT CTCCACCGTT TAGCCCAGAT GTGGTCTTGG ACTTCCAAGA ATGCGAAGAC TCCTGAACAT ACGAGAGTGG AATTGGAAGA CTGGTTACCA CCTAAGTATT GGGCCGATAT AAATCCACTA TTGGTAGGTT TTGGCCAGAC AATATGTGTT CCAAGAGCGC CCAACTGTGA TATCTGCACG CTAGCTACAA CTGGTTTGTG CAAGGCGTCC AAAAAAAGTC TTCTCAAGAC TCCCATCACC GATGAAAGAT TAAAAAAACT CAACAAGCAA CGGGGTGATT TGTCGAAGCT CATTGCTGAA TTCGTCTAA
|
Protein sequence | MSRLRTASST AVQATKRKRV EKVKIESSEE NEIEIKRKSS RPAIKTETAI ELNNEVTGVL SDIKIEIKQE PADIADKKSD SKYIPHIKVE IEKDTPPNIF PYIEKQHHDL PKAPKNWYKI YNEIVTMRSK ISTPVDTQGC ERMPNSINPN VRTRNPRIYR FQLLISLMLS SQTKDEVNYL AMKTMHEGLL ANGYKDGLCI EALLELTEKE LDDYICKVGF HNRKAGYIKR ACEMLRDNFQ SDIPSTIEDV VTLPGVGPKM GYLLLQNAWG INSGIGVDVH LHRLAQMWSW TSKNAKTPEH TRVELEDWLP PKYWADINPL LVGFGQTICV PRAPNCDICT LATTGLCKAS KKSLLKTPIT DERLKKLNKQ RGDLSKLIAE FV
|
| |