Gene PICST_52420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_52420 
SymbolTRF4 
ID4851161 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1083392 
End bp1085290 
Gene Length1899 bp 
Protein Length605 aa 
Translation table 
GC content41% 
IMG OID640392869 
Producttopoisomerase I-related protein 
Protein accessionXP_001387852 
Protein GI126274147 
COG category[L] Replication, recombination and repair 
COG ID[COG5260] DNA polymerase sigma 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGGCA AAAGGAAAAG AGATATCACG CCTAAACTCA AGTCCAAGAA GCAGAGAAAG 
AGACAAAAGA CGGGAAAAGG TGATGTTTTT ATCCCTGCAC AGAACAACTT CCACCATCTC
CCAGAGGAAG ACTCTGATTC CGATTCTAGC GTGATCTTTG ACTACGACAG TCGGGAAAAG
AGTACTAAGG GAGAGATCGT CCTTGTAGAG CTGGATGATG AGAGTGAGAA ATCTGACGTA
AAAGTGACAA ATCAAGATGA ATCGGTAATT GTAGCTATAT CTGATGATGA TGATGATAAT
AATGATGATG AAAATGAAAA TGAACAAGAC GGCAATGTAA AGGTACTGAA AACCGAGAAA
AACGAGCTTG CAAAGAATGA AGATTTCATA GGTTTCGGCT TCCTGTCTTC AGAAGAAGAG
AACAATATAG AAGATGAAGA TTTCTCCGAC GATGGAGTGA TGAGTGCTGA TGAACATGGT
AAAGTAACTG CAAAAAAGGC CTCCTCTTCG TTTCCGTGGT TAAAGGACCA CGATCATTCT
GAGCAGAAGG AGATGGCAGA TTGGTTGACT ATGGAAATGA AGGATTTTGT CAACTATATA
TCACCTTCCA GTGAGGAAAT CAGAACGAGA AATAGACTCA TCAACAAGTT GAAGTCCAGC
ATCAGTCTGT ACTGGCCCGA AACGGAAACT CACGTATTTG GGTCTTCAGC TACAGACTTG
TACTTACCAG GTTCTGATAT CGACATAGTA ATTGTTTCTC GGACCGGAGA CTATGAAAAC
AGATCTAGAT TGTATCAATT GTCTTCTTAC TTGAGACATA AGGGCCTTGC CAAAAACATG
GAAGTCATCG CTAAAGCAAA AGTGCCTATT ATCAAATTTG TAGATCCTGA GTCCAACGTC
AATATTGATG TGTCATTTGA AAGAAGAAAT GGAATAGAAG CAGCGAAAAA GATACGAAGA
TGGATGACTA CTACTCCAGG ATTACGTGAG TTGGTCCTTA TTATAAAACA GTTTTTGAGT
TCAAGAAGGC TTAACAATGT GCACAGTGGT GGATTGGGAG GCTATGCTAC TATTATCCTC
TGCTATCACT TTTTGATGAT GCATCCTCGT GTGTCCACAA ATCTGATCAA CATAACTGAC
AATCTTGGGG CTTTACTAAT CGAATTTTTT GAATTGTACG GCAGAAACTT TTCGTATGAT
AACTTGATCA TCTCGTTGGA TCCACGGTCT GACCTTCCCA GATATTTGTT GAAAAGAGAC
TACCCCCATT TGAACACCAA TAAGAACCCA TTCACTATAG TGGTACAAGA TCCTTCTGAT
GAAGACAACA ACATTACCAG ATCGTCATAC AACCTTCGAG ACTTGAAAAA AGCATTTGGA
GGAGCCTACC AGTTGCTAGT AGAAAAGTGT TACGTGCTTA ATGGCACATC CTACAAGAAT
AGACTAGGGG AGCTGATATT GGGAGACATC ATTAAGTACA GAGGCAAGGA AAGAGATTTC
AACGATGACA GACACAAAGT CATCAACGAT GCATTAATAA ATCATGAAAG TTCTGAAGAC
GAAAGTGATG GAGCGGACGG GAATGACAAA TACTACTATT CAGATATAAC CGTAGAAAGC
GACGATGAGC TTCAATTCGA AGTCGACGTA CCAAAGAAAG AATCGGCCAA GAAAGAGGCA
GAATCCGCTG TGCATACCAA AAAGCAAGTT GAAGAGATTC TTGGCTTGAA AGGTGATAAC
CCTGAAGAAG ATGAAGAAGA AGGAGAGGGT GATTATAGTC CTCCAGATAC CACTAAGGAA
GACGAAGAAG AAGATGCTAC GGATATGAAA AGGTCTAAGA GCAATCTTGA CAAGGATGTT
CGTCGTGATT ATTGGAGGCA GAAGGGATTA GAATTGTAG
 
Protein sequence
MTGKRKRDIT PKLKSKKQRK RQKTGKGDVF IPAQNNFHHL PEEDSDSDSS VIFDYDSREK 
STKGEIVLVE LDDENESVIV AISDDDDDNN DDENENEQDG NVKVLKTEKN ELAKNEDFIG
FGFLSSEEEN NIEDEDFSDD GVMSADEHGK VTAKKASSSF PWLKDHDHSE QKEMADWLTM
EMKDFVNYIS PSSEEIRTRN RLINKLKSSI SLYWPETETH VFGSSATDLY LPGSDIDIVI
VSRTGDYENR SRLYQLSSYL RHKGLAKNME VIAKAKVPII KFVDPESNVN IDVSFERRNG
IEAAKKIRRW MTTTPGLREL VLIIKQFLSS RRLNNVHSGG LGGYATIILC YHFLMMHPRV
STNLINITDN LGALLIEFFE LYGRNFSYDN LIISLDPRSD LPRYLLKRDY PHLNTNKNPF
TIVVQDPSDE DNNITRSSYN LRDLKKAFGG AYQLLVEKCY VLNGTSYKNR LGELILGDII
KYRGKERDFN DDRHKVINDA LINHESSEDE SDGADGNDKY YYSDITVESD DELQFEKEAE
SAVHTKKQVE EILGLKGDNP EEDEEEGEDT TKEDEEEDAT DMKRSKSNLD KDVRRDYWRQ
KGLEL