Gene PHATRDRAFT_23798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_23798 
Symbol 
ID7198925 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011695 
Strand
Start bp112312 
End bp114807 
Gene Length2496 bp 
Protein Length710 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184969 
Protein GI219129593 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AAGTTTCATT TCTTCTGGAA TAGTCGACCT TTCCTCTCCG CCAAATTTGC TTTTGTACAC 
AGGCTGGCGT CTGCTTTACA GCATACATAG CGTGCGGATA GAATGCCCAA GAAAAAGGGC
AAAAAGGCGG ATCCGGAGAA AAAAGCGGCG TTGCAGGCGA AAAAGGAAGC CAAGGCCGAC
AAGAAAGCGA CCAAGCGCCT GCACAAGGAC GGCAGTCTCG ATCCCGCCAC CGTCGGTAGT
GTGGACGATG TGGATTCGTT GCTGGAGCAG TACCAGCAAC AGGATGTGGC CGGCACCGTC
GAATCGCGTA ACAGCCAAGC ACTCGCCATG GAAGGCTTTC CCGCGGCACG GGCCAACGCT
TCCTGGACGC TTTACGAGGA TACCAAAAAG AGTCACGCGG AAGCGTACCT CTTTGGCGGG
GAATACTACG ACGGCGTCGA AAATATAGTC CTCGATCATC TTTACAAGAT TGATTTGACC
CGTAACGAGT GGAAGCAGAT CGTGCCGGCG GGGCCAGCCC CACCGCCGCG CTGTGCCCAC
TCGGCTGCCT ACGCCAATCA CCATATTTAC GTCTTTGGGG GAGAACTCGC CAGCGCCGAT
CAGTATCATC ACTACCGAGA CTTGTGGAAG TACAGCATTA AGGATAGTCA GTGGGCCGAA
TTGAAGCCGT CCAAAGCGGT AGGGAGTCAT CCGACGGCAA GGTCTGGACA TCAGGCCGTC
ACGTGGAAAC ATTTTATGAT ACTCTTTGGG GGCTTTTACG AAGCGCTCCG AGATACCCCG
CGATGGTACA ATGATGTATA CGTCTTTAAT CTACAAACGG AATCGTGGAT GGATGTCCCG
CATTCCAAAT TAACCGCACG ACCGGAGCCC CGGTCGGCTT GCAACGCCGC TGTAATTGGG
GATCAAATGA TTGTGCACGG TGGATTCTCT AAATTGTCCA AGAGTATCCT GGCAAGATCC
AACAACCATA ACAACAGCCA AATCAACCAA AACAATCCAG AGGAGGAAAC GTCAGAAACC
AAAACTCATT CCGACGCTTG GGTCTTGCAG CTCAAGCCCC TCTTGACCGG CCAGCCTCCG
ATTTGGGAAC GGCTTCTGTC CAGTACCCAA CGCGGCCTGG TCGCGGCCAA GAATCCCAAC
AATCGGGCGG GGACGGCTTC GGTTGCTTAC AAATCCCGAT TGCTCGCGTA CGGGGGAGTG
GTCGACCAAG AATCGCACAA TCACAAGATT CAGTCCATCT TTTACAACGA CCTCTTTGCC
CTGGATGTAG CCCGCCGCAA GTGGTTTCCG GTACACGTCA AGAAGATGCC CAGTAACGGC
ACTGGCAGTA AGCGCCGACG CCGGAAAGAG GACTCCACTC CGGAACAATC AGAACTTCCG
GAATCGAAGG AAACATTCGG CGACGATATC GAAGAGTCTG AGAATGATAG TGACTTGGAA
GAAGACGAAC ATGATGACGA CAACGGTGAA ACTCACGCTT GGGACTTGGA CAAGCTGCGT
TCAAACATGT TTGCGTTTGT CGATGGCAAC GGAAACACAA TTTACGAAAA AATTGAGGAA
GAATCCGACG ACGAGGTCGG CTATAGAAAG CAAGACTCTG AAGAAGAAAA GGAAGAGACC
AAGCAGGAGG AGCTGGAACA ACCGAAACCC GTGGCCGACC ATTCAGAATC ATCGTCCGGA
GAAGAAAAGG AGCAGAAGAA GGAGCCCGTC AGTAAAGCAA AAACTTCTCC CCTTGAACGA
AAAGTCATTG CATCTTCGTC CGTAATGGTA GTCGATCCGG AAACGAAGAT TCCCGAGGCG
GTTGCCCGCA CCGAACCCTT GCCGCGTATC AATGCTTCTT TATTGGTAAG CGGTCATACA
TTATTCGTGT ACGGGGGTCT ACTTGAAGTA GGCGATAGAG AAGTCACCTT GGACGATCTT
TGGTCGTTTG ATTTGCGCAA GCGTGAAAAG TGGGAATGTC ACTGGCCAGG GACCATGCAC
AAGCAAGTAT GGCGGGGTGC TATTCATGAT GATGATGACA GTTACTACAG TTCGACTGCT
GCTGCGTTGG ACGATGATGA AGAAGAAAGA GAAAGCGATT TGAGCGATGA CGAAAGGCTA
GAAGAGAAAG GCACAACAAG GAAGCCAAAA AAGAAAAGTT CTGGGCTGCG GCAAGAAATC
GCCGAGCTCA ACGAAAAATA TCATCTAGGT GACGGAAATC GGACTCCGCA ACCGGGTGAA
GCTTTGTCAG ACTTTTACGC GCGGACCTCC GACTATTGGA ATCAACAAGC GGCCGATCGG
ATGCCGGGGA CAACAAGTGG TGTCGTCAAA AACCCGTCGG AGAGACTGTC CAACAAAGAA
CTCAAGCGCG AAGGCTTCGG CCTCGCCAAC GCACGGTTTG TGGAGTTGGA ACCAGTCATT
GAGCGCCTCC ATGAGTTAGA CTTGGAAAGA GAAGAGCGCA AAGGGCTCAA AAAGGAAAAG
AAGGACAAAT CCAAAAAGAA AGACCGGCGG CATTAG
 
Protein sequence
MPKKKGKKAD PEKKAALQAK KEAKADKKAT KRLHKDGSLD PATVGSVDDV DSLLEQYQQQ 
DVAGTVESRN SQALAMEGFP AARANASWTL YEDTKKSHAE AYLFGGEYYD GVENIVLDHL
YKIDLTRNEW KQIVPAGPAP PPRCAHSAAY ANHHIYVFGG ELASADQYHH YRDLWKYSIK
DSQWAELKPS KAVGSHPTAR SGHQAVTWKH FMILFGGFYE ALRDTPRWYN DVYVFNLQTE
SWMDVPHSKL TARPEPRSAC NAAVIGDQMI VHGGFSKLSK KEETSETKTH SDAWVLQLKP
LLTGQPPIWE RLLSSTQRGL VAAKNPNNRA GTASVAYKSR LLAYGGVVDQ ESHNHKIQSI
FYNDLFALDV ARRKWFPVHV KKMPSNGTGS KRRRRKEDST PEQSELPESK ETFGDDIEES
ENDSDLEEDE HDDDNGETHA WDLDKLRSNI KAKTSPLERK VIASSSVMVV DPETKIPEAV
ARTEPLPRIN ASLLVSGHTL FVYGGLLEVG DREVTLDDLW SFDLRKREKW ECHWPGTMHK
QVWRGAIHDD DDSYYSSTAA ALDDDEEERE SDLSDDERLE EKGTTRKPKK KSSGLRQEIA
ELNEKYHLGD GNRTPQPGEA LSDFYARTSD YWNQQAADRM PGTTSGVVKN PSERLSNKEL
KREGFGLANA RFVELEPVIE RLHELDLERE ERKGLKKEKK DKSKKKDRRH