Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_30908 |
Symbol | XPF |
ID | 7198823 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | + |
Start bp | 64267 |
End bp | 67373 |
Gene Length | 3107 bp |
Protein Length | 975 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | excision repair cross-complementing rodent repair deficiency, complementation group 4 |
Protein accession | XP_002184960 |
Protein GI | 219129574 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGTTCTTC CATTCTGGTA ATTCGCTTGC ATAGTGAGAG GCTACTACTC AGGCACCGTG TCCAGCGCAC AGGGAAGTAC AGCATGAATG AGGCATCTAT CACCAGTTCA CGGTCTACAG CTGGCAAAAG ACAGCGAGAG AACGCCGGGG TCGAAAACCA TAGCAATGCC ATGGATTCGT CGTATACGGC AGGCAATCCA CTTATACCAG AGGGTCTCTT GCCTTGTTTC TTGGCAGACG CATTCAGCGA ACTTTATGAA GAAGACGGCT TGATGGTTTT AGGCAAAGGA TTGGGATGTT TGAGTCTCTT GGCCGCCTTT TGCCGATTCT ACGCCGATAT CGAGGAAGGC CATGTTTCAA TCGTACGTGA ATCTGTTGCC AGTAATACCG CCTCCTCCAA CAATGAACCG TCTGTAGCTC CCTTGGTAAT CGTTCTGGGA CTCAAGGACG GCGAGCGCCA AGCGCTCGTG GATATATTGG AGAGCTGGGG CACGCCTCCG GAACTTCTGC CAACCATGGT TACGAATGAA GCTGGACAAG GCAAGGATCG CGCCGCTCTT TACGATCGGG GTGGAATATT TTGCATCACA TCTCGTATTT TCATCGTTGA CTTGCTTACC AATATAGCGT CTCCTAACAA AATTGACGGA TTACTGGTGG CACATGGAGA GAATGTGACG GAGCAATCCA CGGAGGCATT TATTTTGAGA ATATTTCAAG GTCAGAAGCA GCCTTTCGGA TCCGGTTTTA TCAAGGCTTT CACGGACGCT CCGGATCAGC TTATGTCAGG CTTTGCCAAA GTCGATAAAA TTCTGAAATC GCTGCACGTT AGGAGGCTCT ATTTGTACCC CCGCTTTCAT GAAAGTATTC GACAGGAACT CGAGTCGCAT CCACCGTCCG TTACGGAACT TCACCAGGAA CTGTCGCCAC TACAAAAAGA AATGCAAAAT GCAATCGCAG CTGCCGTTTC AGCATGTATA CGGGAGCTCA AGTCGTCAAC CACGTTATTG GAATGGAATG ACTCGGAGCT TTCGATCGAG AATTGTGTGA CAACAAATTT CGATCGGGCC ATTTCCCGTC AACTGGAACA CGATTGGCAT CGACTCAAGC CACAAACCAA ACAACTTGTC CAAGACTTGC GTACTTTACG TACACTCTTT CAGTCCTTAA TTCAGTATGA TTGTGTCACG TTCTGGAAAT TGATAAATTC CATTAAGACC ATGAGCGCAG CTTCTCGTTA CCCGTCGTTA TGGTTGCTGA CTCCTGCGGC TGATGTACTC TTCCGCAAGG CCAAAGCTCG GGTGTACAAC ATCTCGCGAC CGCGGCCCAC GTCCCAACTA TCCCATCCCG TCGCTCACTT AAAGGCTATT CTGGAAGAGA ATCCTAAGTG GAAGTTGTTA AAACAAATCT TAGATGAAAT CCGACTAGAT GATGCCCAGC GAGTGCGAAA CGTGGACTGT GATGGACCAA GAAATGTATT GGTCATGGTA AAGGACGACA AGACTGTTGA TACGCTACGC GAATATCTCA CCGACGGTAA AGATCGGACA TTGACGCTTC GCTGGCTTCG ATTCTTAGAC CAGTACAACG ATCGATCGAG GTCCATTACC AATTGCAAAG GCGGAATATC GGCCATTTCG GAAGAGTCAC GTTTGCTACT TGAAGAAGAA TCCCGTGTTC GCAATGCGCT GTTCGGGAAA AGGCGAAATA GAGGTCACCG AGATACTGTT ACAAAACCAA AAAGCCAACT TAACCAAATC CCTGACTTTC TACGAAAACG CCGGCGGATC GCCGTGGAAA AGGGACGAGG GCAGCTCACC CACCAGGCCG ATGATTTAGA TCGTGAATTT GTGTTGGATG ATGCTTTAGA AGCCACAGAA AAGGCTCTGA ATGATGCCAG CTTTTCCAAA ACTATATTGG CCAGAATTAG AGCGGATCTT AATGCTGAAG AAGACGCTAT GCTTCGTATT TCCAACCCCA GTGAGCTCCG TATTATTCTC AAGAGCTATT CCAGTATCGA CGGCGACCAA TCCTCCCTTT TCCTTCAAGA CATGGAACCT CAATATGTAG TATTGTATGA TACTGACGTG GCTTTCATTC GTTCTGTGGA AATGTACGAA GCCTTGTCGA CTCACTCAGA TCCCGTCAGG GTATTCTTCC TGATGTTTGA AGCGAGTTCC GAACAAAAGA CATTTATGAA GACCTTGGAG CGAGAACAGA ACGCTTTCGA ACGCATGATT GATCACAAAA AGACGATGCC TCCTCCAGCG CTGCAAGTGG TTGGTACCCA GGAAATGCAG CAGGCCATGC ATGTTGGTAG TGCTGGCGGT AGTTACATGG ACGGGTCTTT ACCGCTGGCA TTTGATAGTC GCCGAGGCCG CGGAAAAGAG GACAGGTCCA AAGAACGACG AGATATTGCT GTCGACGTTC GTGAATTCAG GTCGGCGTTG CCTTCGATTC TTCATCAAGG CGGAATGCGC TTAGCACCTG TGACGCTGAC GGTCGGAGAT TTCGTACTCA GCAACGTCCA TTGCGTTGAA CGAAAGAGTA TAAGCGATCT ATTTGGGAGC TTCGCGAGCG GTCGCCTCTA TACTCAAGCT GAGGCGATGT CCAAGCACTA CAAGTGCCCA TGTTTGCTGA TTGAATTTGA TCCCACGAAG TCGTTTTGCT TGCAAAATTC GAACGAGCTG GGAGTCGAAA TCCGAACCGA GTCTGTATGC AGCAAAATTG CCTTACTAAC TATGCACTTT CCTCAATTAC GCATACTTTG GTCGCGCAGT CCCCATGAGA CTCTCCGAAT ATTTCGAGAG CTGAAGACGA ACCACGACGA AGTTGATGTG GAGAAGGCAA TCGACATTGG ACGGAACGAG TCACCGGACG CTTTGCTGCA ACTTCCAGCC GGGCTTGCCG AAGGTGAAGA TGAGATCAAT GAAATGGCTC GTGACATGTT GCTGCGACTT CCAGGTGTCA ACGTCCATTC AGCCAGGCGC ATCATGCAGG AGTGCGATAG TTTGGCGGAG CTTGCTGAAA TGTCCCGGGA TGAGCTTCGA CGAATCGCTG GTCCGGTGAC TGGCCAAAAA CTGTTCGCCT TCTTTCGACA AAAGATC
|
Protein sequence | MNEASITSSR STAGKRQREN AGVENHSNAM DSSYTAGNPL IPEGLLPCFL ADAFSELYEE DGLMVLGKGL GCLSLLAAFC RFYADIEEGH VSIVRESVAS NTASSNNEPS VAPLVIVLGL KDGERQALVD ILESWGTPPE LLPTMVTNEA GQGKDRAALY DRGGIFCITS RIFIVDLLTN IASPNKIDGL LVAHGENVTE QSTEAFILRI FQGQKQPFGS GFIKAFTDAP DQLMSGFAKV DKILKSLHVR RLYLYPRFHE SIRQELESHP PSVTELHQEL SPLQKEMQNA IAAAVSACIR ELKSSTTLLE WNDSELSIEN CVTTNFDRAI SRQLEHDWHR LKPQTKQLVQ DLRTLRTLFQ SLIQYDCVTF WKLINSIKTM SAASRYPSLW LLTPAADVLF RKAKARVYNI SRPRPTSQLS HPVAHLKAIL EENPKWKLLK QILDEIRLDD AQRVRNVDCD GPRNVLVMVK DDKTVDTLRE YLTDGKDRTL TLRWLRFLDQ YNDRSRSITN CKGGISAISE ESRLLLEEES RVRNALFGKR RNRGHRDTAD DLDREFVLDD ALEATEKALN DASFSKTILA RIRADLNAEE DAMLRISNPS ELRIILKSYS SIDGDQSSLF LQDMEPQYVV LYDTDVAFIR SVEMYEALST HSDPVRVFFL MFEASSEQKT FMKTLEREQN AFERMIDHKK TMPPPALQVV GTQEMQQAMH VGSAGGSYMD GSLPLAFDSR RGRGKEDRSK ERRDIAVDVR EFRSALPSIL HQGGMRLAPV TLTVGDFVLS NVHCVERKSI SDLFGSFASG RLYTQAEAMS KHYKCPCLLI EFDPTKSFCL QNSNELGVEI RTESVCSKIA LLTMHFPQLR ILWSRSPHET LRIFRELKTN HDEVDVEKAI DIGRNESPDA LLQLPAGLAE GEDEINEMAR DMLLRLPGVN VHSARRIMQE CDSLAELAEM SRDELRRIAG PVTGQKLFAF FRQKI
|
| |