Gene PHATRDRAFT_30908 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_30908 
SymbolXPF 
ID7198823 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011695 
Strand
Start bp64267 
End bp67373 
Gene Length3107 bp 
Protein Length975 aa 
Translation table 
GC content49% 
IMG OID 
Productexcision repair cross-complementing rodent repair deficiency, complementation group 4 
Protein accessionXP_002184960 
Protein GI219129574 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGTTCTTC CATTCTGGTA ATTCGCTTGC ATAGTGAGAG GCTACTACTC AGGCACCGTG 
TCCAGCGCAC AGGGAAGTAC AGCATGAATG AGGCATCTAT CACCAGTTCA CGGTCTACAG
CTGGCAAAAG ACAGCGAGAG AACGCCGGGG TCGAAAACCA TAGCAATGCC ATGGATTCGT
CGTATACGGC AGGCAATCCA CTTATACCAG AGGGTCTCTT GCCTTGTTTC TTGGCAGACG
CATTCAGCGA ACTTTATGAA GAAGACGGCT TGATGGTTTT AGGCAAAGGA TTGGGATGTT
TGAGTCTCTT GGCCGCCTTT TGCCGATTCT ACGCCGATAT CGAGGAAGGC CATGTTTCAA
TCGTACGTGA ATCTGTTGCC AGTAATACCG CCTCCTCCAA CAATGAACCG TCTGTAGCTC
CCTTGGTAAT CGTTCTGGGA CTCAAGGACG GCGAGCGCCA AGCGCTCGTG GATATATTGG
AGAGCTGGGG CACGCCTCCG GAACTTCTGC CAACCATGGT TACGAATGAA GCTGGACAAG
GCAAGGATCG CGCCGCTCTT TACGATCGGG GTGGAATATT TTGCATCACA TCTCGTATTT
TCATCGTTGA CTTGCTTACC AATATAGCGT CTCCTAACAA AATTGACGGA TTACTGGTGG
CACATGGAGA GAATGTGACG GAGCAATCCA CGGAGGCATT TATTTTGAGA ATATTTCAAG
GTCAGAAGCA GCCTTTCGGA TCCGGTTTTA TCAAGGCTTT CACGGACGCT CCGGATCAGC
TTATGTCAGG CTTTGCCAAA GTCGATAAAA TTCTGAAATC GCTGCACGTT AGGAGGCTCT
ATTTGTACCC CCGCTTTCAT GAAAGTATTC GACAGGAACT CGAGTCGCAT CCACCGTCCG
TTACGGAACT TCACCAGGAA CTGTCGCCAC TACAAAAAGA AATGCAAAAT GCAATCGCAG
CTGCCGTTTC AGCATGTATA CGGGAGCTCA AGTCGTCAAC CACGTTATTG GAATGGAATG
ACTCGGAGCT TTCGATCGAG AATTGTGTGA CAACAAATTT CGATCGGGCC ATTTCCCGTC
AACTGGAACA CGATTGGCAT CGACTCAAGC CACAAACCAA ACAACTTGTC CAAGACTTGC
GTACTTTACG TACACTCTTT CAGTCCTTAA TTCAGTATGA TTGTGTCACG TTCTGGAAAT
TGATAAATTC CATTAAGACC ATGAGCGCAG CTTCTCGTTA CCCGTCGTTA TGGTTGCTGA
CTCCTGCGGC TGATGTACTC TTCCGCAAGG CCAAAGCTCG GGTGTACAAC ATCTCGCGAC
CGCGGCCCAC GTCCCAACTA TCCCATCCCG TCGCTCACTT AAAGGCTATT CTGGAAGAGA
ATCCTAAGTG GAAGTTGTTA AAACAAATCT TAGATGAAAT CCGACTAGAT GATGCCCAGC
GAGTGCGAAA CGTGGACTGT GATGGACCAA GAAATGTATT GGTCATGGTA AAGGACGACA
AGACTGTTGA TACGCTACGC GAATATCTCA CCGACGGTAA AGATCGGACA TTGACGCTTC
GCTGGCTTCG ATTCTTAGAC CAGTACAACG ATCGATCGAG GTCCATTACC AATTGCAAAG
GCGGAATATC GGCCATTTCG GAAGAGTCAC GTTTGCTACT TGAAGAAGAA TCCCGTGTTC
GCAATGCGCT GTTCGGGAAA AGGCGAAATA GAGGTCACCG AGATACTGTT ACAAAACCAA
AAAGCCAACT TAACCAAATC CCTGACTTTC TACGAAAACG CCGGCGGATC GCCGTGGAAA
AGGGACGAGG GCAGCTCACC CACCAGGCCG ATGATTTAGA TCGTGAATTT GTGTTGGATG
ATGCTTTAGA AGCCACAGAA AAGGCTCTGA ATGATGCCAG CTTTTCCAAA ACTATATTGG
CCAGAATTAG AGCGGATCTT AATGCTGAAG AAGACGCTAT GCTTCGTATT TCCAACCCCA
GTGAGCTCCG TATTATTCTC AAGAGCTATT CCAGTATCGA CGGCGACCAA TCCTCCCTTT
TCCTTCAAGA CATGGAACCT CAATATGTAG TATTGTATGA TACTGACGTG GCTTTCATTC
GTTCTGTGGA AATGTACGAA GCCTTGTCGA CTCACTCAGA TCCCGTCAGG GTATTCTTCC
TGATGTTTGA AGCGAGTTCC GAACAAAAGA CATTTATGAA GACCTTGGAG CGAGAACAGA
ACGCTTTCGA ACGCATGATT GATCACAAAA AGACGATGCC TCCTCCAGCG CTGCAAGTGG
TTGGTACCCA GGAAATGCAG CAGGCCATGC ATGTTGGTAG TGCTGGCGGT AGTTACATGG
ACGGGTCTTT ACCGCTGGCA TTTGATAGTC GCCGAGGCCG CGGAAAAGAG GACAGGTCCA
AAGAACGACG AGATATTGCT GTCGACGTTC GTGAATTCAG GTCGGCGTTG CCTTCGATTC
TTCATCAAGG CGGAATGCGC TTAGCACCTG TGACGCTGAC GGTCGGAGAT TTCGTACTCA
GCAACGTCCA TTGCGTTGAA CGAAAGAGTA TAAGCGATCT ATTTGGGAGC TTCGCGAGCG
GTCGCCTCTA TACTCAAGCT GAGGCGATGT CCAAGCACTA CAAGTGCCCA TGTTTGCTGA
TTGAATTTGA TCCCACGAAG TCGTTTTGCT TGCAAAATTC GAACGAGCTG GGAGTCGAAA
TCCGAACCGA GTCTGTATGC AGCAAAATTG CCTTACTAAC TATGCACTTT CCTCAATTAC
GCATACTTTG GTCGCGCAGT CCCCATGAGA CTCTCCGAAT ATTTCGAGAG CTGAAGACGA
ACCACGACGA AGTTGATGTG GAGAAGGCAA TCGACATTGG ACGGAACGAG TCACCGGACG
CTTTGCTGCA ACTTCCAGCC GGGCTTGCCG AAGGTGAAGA TGAGATCAAT GAAATGGCTC
GTGACATGTT GCTGCGACTT CCAGGTGTCA ACGTCCATTC AGCCAGGCGC ATCATGCAGG
AGTGCGATAG TTTGGCGGAG CTTGCTGAAA TGTCCCGGGA TGAGCTTCGA CGAATCGCTG
GTCCGGTGAC TGGCCAAAAA CTGTTCGCCT TCTTTCGACA AAAGATC
 
Protein sequence
MNEASITSSR STAGKRQREN AGVENHSNAM DSSYTAGNPL IPEGLLPCFL ADAFSELYEE 
DGLMVLGKGL GCLSLLAAFC RFYADIEEGH VSIVRESVAS NTASSNNEPS VAPLVIVLGL
KDGERQALVD ILESWGTPPE LLPTMVTNEA GQGKDRAALY DRGGIFCITS RIFIVDLLTN
IASPNKIDGL LVAHGENVTE QSTEAFILRI FQGQKQPFGS GFIKAFTDAP DQLMSGFAKV
DKILKSLHVR RLYLYPRFHE SIRQELESHP PSVTELHQEL SPLQKEMQNA IAAAVSACIR
ELKSSTTLLE WNDSELSIEN CVTTNFDRAI SRQLEHDWHR LKPQTKQLVQ DLRTLRTLFQ
SLIQYDCVTF WKLINSIKTM SAASRYPSLW LLTPAADVLF RKAKARVYNI SRPRPTSQLS
HPVAHLKAIL EENPKWKLLK QILDEIRLDD AQRVRNVDCD GPRNVLVMVK DDKTVDTLRE
YLTDGKDRTL TLRWLRFLDQ YNDRSRSITN CKGGISAISE ESRLLLEEES RVRNALFGKR
RNRGHRDTAD DLDREFVLDD ALEATEKALN DASFSKTILA RIRADLNAEE DAMLRISNPS
ELRIILKSYS SIDGDQSSLF LQDMEPQYVV LYDTDVAFIR SVEMYEALST HSDPVRVFFL
MFEASSEQKT FMKTLEREQN AFERMIDHKK TMPPPALQVV GTQEMQQAMH VGSAGGSYMD
GSLPLAFDSR RGRGKEDRSK ERRDIAVDVR EFRSALPSIL HQGGMRLAPV TLTVGDFVLS
NVHCVERKSI SDLFGSFASG RLYTQAEAMS KHYKCPCLLI EFDPTKSFCL QNSNELGVEI
RTESVCSKIA LLTMHFPQLR ILWSRSPHET LRIFRELKTN HDEVDVEKAI DIGRNESPDA
LLQLPAGLAE GEDEINEMAR DMLLRLPGVN VHSARRIMQE CDSLAELAEM SRDELRRIAG
PVTGQKLFAF FRQKI