Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_52420 |
Symbol | TRF4 |
ID | 4851161 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 1083392 |
End bp | 1085290 |
Gene Length | 1899 bp |
Protein Length | 605 aa |
Translation table | |
GC content | 41% |
IMG OID | 640392869 |
Product | topoisomerase I-related protein |
Protein accession | XP_001387852 |
Protein GI | 126274147 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG5260] DNA polymerase sigma |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGGCA AAAGGAAAAG AGATATCACG CCTAAACTCA AGTCCAAGAA GCAGAGAAAG AGACAAAAGA CGGGAAAAGG TGATGTTTTT ATCCCTGCAC AGAACAACTT CCACCATCTC CCAGAGGAAG ACTCTGATTC CGATTCTAGC GTGATCTTTG ACTACGACAG TCGGGAAAAG AGTACTAAGG GAGAGATCGT CCTTGTAGAG CTGGATGATG AGAGTGAGAA ATCTGACGTA AAAGTGACAA ATCAAGATGA ATCGGTAATT GTAGCTATAT CTGATGATGA TGATGATAAT AATGATGATG AAAATGAAAA TGAACAAGAC GGCAATGTAA AGGTACTGAA AACCGAGAAA AACGAGCTTG CAAAGAATGA AGATTTCATA GGTTTCGGCT TCCTGTCTTC AGAAGAAGAG AACAATATAG AAGATGAAGA TTTCTCCGAC GATGGAGTGA TGAGTGCTGA TGAACATGGT AAAGTAACTG CAAAAAAGGC CTCCTCTTCG TTTCCGTGGT TAAAGGACCA CGATCATTCT GAGCAGAAGG AGATGGCAGA TTGGTTGACT ATGGAAATGA AGGATTTTGT CAACTATATA TCACCTTCCA GTGAGGAAAT CAGAACGAGA AATAGACTCA TCAACAAGTT GAAGTCCAGC ATCAGTCTGT ACTGGCCCGA AACGGAAACT CACGTATTTG GGTCTTCAGC TACAGACTTG TACTTACCAG GTTCTGATAT CGACATAGTA ATTGTTTCTC GGACCGGAGA CTATGAAAAC AGATCTAGAT TGTATCAATT GTCTTCTTAC TTGAGACATA AGGGCCTTGC CAAAAACATG GAAGTCATCG CTAAAGCAAA AGTGCCTATT ATCAAATTTG TAGATCCTGA GTCCAACGTC AATATTGATG TGTCATTTGA AAGAAGAAAT GGAATAGAAG CAGCGAAAAA GATACGAAGA TGGATGACTA CTACTCCAGG ATTACGTGAG TTGGTCCTTA TTATAAAACA GTTTTTGAGT TCAAGAAGGC TTAACAATGT GCACAGTGGT GGATTGGGAG GCTATGCTAC TATTATCCTC TGCTATCACT TTTTGATGAT GCATCCTCGT GTGTCCACAA ATCTGATCAA CATAACTGAC AATCTTGGGG CTTTACTAAT CGAATTTTTT GAATTGTACG GCAGAAACTT TTCGTATGAT AACTTGATCA TCTCGTTGGA TCCACGGTCT GACCTTCCCA GATATTTGTT GAAAAGAGAC TACCCCCATT TGAACACCAA TAAGAACCCA TTCACTATAG TGGTACAAGA TCCTTCTGAT GAAGACAACA ACATTACCAG ATCGTCATAC AACCTTCGAG ACTTGAAAAA AGCATTTGGA GGAGCCTACC AGTTGCTAGT AGAAAAGTGT TACGTGCTTA ATGGCACATC CTACAAGAAT AGACTAGGGG AGCTGATATT GGGAGACATC ATTAAGTACA GAGGCAAGGA AAGAGATTTC AACGATGACA GACACAAAGT CATCAACGAT GCATTAATAA ATCATGAAAG TTCTGAAGAC GAAAGTGATG GAGCGGACGG GAATGACAAA TACTACTATT CAGATATAAC CGTAGAAAGC GACGATGAGC TTCAATTCGA AGTCGACGTA CCAAAGAAAG AATCGGCCAA GAAAGAGGCA GAATCCGCTG TGCATACCAA AAAGCAAGTT GAAGAGATTC TTGGCTTGAA AGGTGATAAC CCTGAAGAAG ATGAAGAAGA AGGAGAGGGT GATTATAGTC CTCCAGATAC CACTAAGGAA GACGAAGAAG AAGATGCTAC GGATATGAAA AGGTCTAAGA GCAATCTTGA CAAGGATGTT CGTCGTGATT ATTGGAGGCA GAAGGGATTA GAATTGTAG
|
Protein sequence | MTGKRKRDIT PKLKSKKQRK RQKTGKGDVF IPAQNNFHHL PEEDSDSDSS VIFDYDSREK STKGEIVLVE LDDENESVIV AISDDDDDNN DDENENEQDG NVKVLKTEKN ELAKNEDFIG FGFLSSEEEN NIEDEDFSDD GVMSADEHGK VTAKKASSSF PWLKDHDHSE QKEMADWLTM EMKDFVNYIS PSSEEIRTRN RLINKLKSSI SLYWPETETH VFGSSATDLY LPGSDIDIVI VSRTGDYENR SRLYQLSSYL RHKGLAKNME VIAKAKVPII KFVDPESNVN IDVSFERRNG IEAAKKIRRW MTTTPGLREL VLIIKQFLSS RRLNNVHSGG LGGYATIILC YHFLMMHPRV STNLINITDN LGALLIEFFE LYGRNFSYDN LIISLDPRSD LPRYLLKRDY PHLNTNKNPF TIVVQDPSDE DNNITRSSYN LRDLKKAFGG AYQLLVEKCY VLNGTSYKNR LGELILGDII KYRGKERDFN DDRHKVINDA LINHESSEDE SDGADGNDKY YYSDITVESD DELQFEKEAE SAVHTKKQVE EILGLKGDNP EEDEEEGEDT TKEDEEEDAT DMKRSKSNLD KDVRRDYWRQ KGLEL
|
| |