Gene PHATRDRAFT_51018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_51018 
Symbol 
ID7202210 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp8458 
End bp13746 
Gene Length5289 bp 
Protein Length397 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181116 
Protein GI219121527 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.322976 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GAAGTGTGGT GTATGGTATG CCGTCTCACG CCCCGTTTTA CATCGAGCAT CTGCTATATG 
CGCCTTCACA ACAATCTAGC AAACCAGAAA CCATGGCCGC CCGTCCTCTC GTCAGCGTCT
TTTCTCTCTC CGGTGACAAG TCCGGAGATG TGAGTCTTCC TGCTGTAATG ACGGCTCCTC
TGCGCCCGGA TATCGTTCAG TTTGTGCACA CCAACATGAA CAAGAACCAT CGTCAGGCGT
ACGCTGTTAA CATTCGCGCC GGAAAGCAAG TTGTCGCGTC GTCTTGGGGT ACTGGACGTG
CTGTCGCCCG TATTCCTCGT GTTGGTGGAG GTGGTACTTC CCGTTCTGGA CAGGGTGCCT
TTGGTAACAT GTGCCGTGGT GGACGCATGT TCGACCCCAC CAAGACCTGG CGCAAGTGGA
ATAAGAAGAT CAACATTTCG CAGAAGCGTT ATGCCGTTGC TTCTGCCCTA GCTGCCACCG
CCGTCCCGGC TCTTGTGATG TCCCGTGGGC ATGTCGTCGA CAACGTTCCG GAAATTCCCC
TCGTTGTGGA AAACGCCGTT GAGTCTGCCA AGAAGACCTC GGCCGCGAAA GACATCCTCT
CCGCCATTGG TGCCTTGGAT GATGTCGAAA AAGCGGGAGA ATCGAAGCAA ATCCGTGCCG
GTAAGGGTAA GATGCGTAAT CGTCGTTACA CCCTCCGCCG TGGACCTCTC GTTATCTACA
AATCGAACGA TGGCGTGGAA CAGGCCTTCC GCAATCTTCC CGGCGTAGAG CTATGCTGTG
TTGACCGACT GAACCTTTTG CAGCTAGCCC CGGGTGGTCA CATGGGACGA TTCTGCATCT
GGTCTCAGGC TGCTTTGGAA GAGCTGGACA TTATTTACGG AGAAAATGGC AAGCGCATCC
CCCAAGCAGC CATGACCAAC GCCGACTTGG CCCGTATCAT CAACTCCGAT GAAGTACAGA
GTGTCGTCAA CCCCGCCAAA CCTGGACAGA AGGACAACGC TCCCAAGCAA AATGCTATTC
GTAACGTTGA GGCGTTGGAG AAGCTCGATC CATTTGCCGC CGAAAAGCGC CGTGCCCAAG
CTCGCAATGA CGAGGCTCGT GCTTCGAAGA AGGCCGAAAC TTTGGCGAAA AAGCGTGATA
GCCGCACGGC TAAGAAAGCT TTCAAGGAGC AAGGAAAATC GTTCTATGCA AAGGTTTCCC
AGCAAGGAAC AGTCTGTGAG AATGGCTTTG CTCTAGAGTA AAGACCAAGC AAACAATTAA
AGATATGGCG CCATGGATTT TTGCGATCTG AATATACCAC CTAGCATTTG AGGAGTCGTT
CCTTCGTCAT GAAAGCTGTC TTTCCCGTAG TATTAGCCTC AACCAACTAG GCCACCATGT
AGTTCACAAT CAGTGAAATA ACAAGCGCGA ACCACATCAA TTCGAAAGAA CTAATGTGAG
AGGTTACACT AGTCTATTTA TGGACTAGTC AAGCCGTTGT ATCCATCTCG TGACTGGATT
CGAACATACG TCATTACATT GTTTGCCGCG ACTCTGACAC ATTTCGGAGA GGTTGGGACC
CGATTGCCAT CGCAAGATCG ACATCGAACA AACTTCACTG CTAGGACGGT AAATTACTAT
TCACTGTAAT AACATTGCGA CGTGATTCAG TATGGCCATA CGGACCCGGA TAACTTGTGA
CGTAGTCATT GGGGTTATCC TTCGGTCATG CAACTAGTCT AAACCTTGGG CACATCGGAT
GATGTGCTCA CTTATTTTAT TATAAAGCAC TTTGCAAGAT GTACTTCTAG TCCCCTTGTG
TGCTTGGTTG ACCAACATGC AAATTGATTG TCTCCGTGGA TGCGGGACAA TGGTCAACGA
ACTAATTCAA TATAGTCCGG ATATCGTGTT GACCAGAAAT GTACCTACAC GTGGATATCA
AATCAGACGA CATTGAGAGG ATCTGCGATT TTACCTAAAT CTGTACATCC CACAGATCTG
AAAATCCACT ATCGGATCGA GTCTCTACAA CTACTGCCAT GAAGTACGTC GCCACTGCTC
TCTTCCTCTC CCAGGCTACT GCCTTCACCA TCGTTGGCGC TCCTCGTCTC CAGACACGCC
TCGCTGCTGC TGAGTACGAA GCGATGGATG GTGAAGGAAA AATTAATCTC AAGGTCTGTA
CTTGCAAAAA AATCAGCATG CTTTCAAAGT TCCTTCCTCG CCTAAGCTTT GTTTTTTACT
TTCTAAAGAT TGATTTGGAC TCACCGAAGG TTGCGACGAT GGATGACATT GAAAAAGGCA
AGAAAGTCTA TTGCCGCTGC TGGTTGTCAG GAACCTTTCC CCTTTGCGAC GGTACCCATC
AGAAGCACAA CGATGCTACG GGCGACAATG TTGGCCCACT AATCGTATCC GTGAAGAAGG
AATAGGCAGC CTCGTCGAAG GAAGCCCCGC TTTTTGAATA AAGGACCATT GACTGTAATT
CAATCAAAAG CCTGTCTTGA TGCCAAAAAA AGAACATACC TAAATCCAAT TATTCACGAT
ATCGCCCACA CATATAGCAT GATAATAGTT CTTTTGGAAG TATTCAGACA TCAAGCATGC
TTTGGACGAC TTGCTCTTGT ACGCTTGCCG AGTGTTTTTT TAGTTCAACG GATCGAATGC
GCTTGATAGA TTCCGTCTTC CACATTGGAA CTCCTTCGCT TTCTTTCAGC ATCGCGTTTT
CCGATATAAA GGCCTGGGCA AGCGCTGTCG CCAGCAATTG GCGCAATTGG ATAGTCTCGT
CTCCCGGTGA TTGTCTCGAA CTCAGCGCAC ACAGTCTTGC AAGACATTGC ATTAAGATGG
CTACACCTGT CTTGTCAAAG GCCGACCGTC GGGCAAACTC AGCTAGACAT GATGTGCTCT
TCTCCTGCAA TTCCAATACT TTCTTGGCTA CCTGAAGCAC AGACCAGTGT AGCGAATAAG
GAAGTGCGCA GGAAAGATCA CCAGTGTCGA CCTTCTCGTC TCGCAGACAT GCGACAATGT
GCTCTGCTGT CAAGACAAGG TCATTGGACA GCATGGCGTC AAACAGTAAC ACAACACACT
TGAGTCCCTT TGCTTCTGGT ATTGCGATTA AATCGCTATC CAGCAAATCG ACTGCTTCAC
GAGCACTAAC CGCTACCTTG GAATTCGCGG GTGAATCGCC TACCAAGATC GTAAGCAGTA
ATCTGGCTGT TTCAAGGCAA ACAGATGTCA ACTTTCGATA ACAGGATCCT TGATGAACAA
CAGGCAAAAG ACTGGCTTCC GATTCAATAC CGACTATTAG CCATTGCATA GCCTCCCGAA
GGCAATCGTT CTCGATGTAG TGTCGATAGC GTGACAAGAT TATCGCAACG TAGGCGTCCA
AACCAAGCAA ACTCTTTCGC TCAACAAATG GACGGCAAAG GGATAGAAGC TCCCGCGCCT
CAGCATCAGT GGTAACCAGA AGGTGCTCAA AAATAGAGGC AATGCTTGCG ACTGTCTCAG
GGCAGCCAGC GTCGAGAGCT ACCGGTGAAA GCAAGCGAGT TGCTAAACGT GCGCCAGCGT
CGGAATTGTT ATCTTGAAGA GAAGAAATAA TCGCTGAAGC TGCCTCGCAA ATGAACTCAG
TCTTTCGAAA AGCTGATTCA GAAGGAAGGA TGTTGACTTC GATGTAAGAA TTATAGAGGA
TAGAAAGCTT CATAAATACA GCAAGAGAAA GGTTTTCGCG TGAGGAAGTA GCATCTAGAT
GTGCCGAAGT CCAGCGCTTT TGAGCTCGCT TACCAAGGTT GTCGGCAAAG CAGCTCCACT
CGCTCCTCCG TAAATATACA TTATTTCAAT TGCCCGACCT GCGTTCCCAT TCATAATACT
GAGTAGTTCT TCCATTGGCC CATTGTCAAT TCCCCGACGT TCCATCTCTT TCATAAACCG
GCTGGCACTC GCACTCATAT TCCTCGGTCG TATATCTGGT CGAACGAAGA GGAGCTCAGC
GCACATCTGC TCGGCCCACG AATCGAAGGA AAGCTGCTTC AGTTGTCCAC AGAGAATAGC
CATGATGGAA TCGAGTTCAG GAATACGGCG CGAAAGCTTG AAAGACCCCC GACACTCGGA
TACATACACC TGCCATGATC GATGTTTCTG CATCGCCGCG TCAGAGTTGT ACACCATTGA
GAAATTCGTT TGACCGCTAG TCGCGCCTTC CCAGAACTTG TAATCCGAGC GTTCTACAGT
CAATCCCTCC AAGAAGTATT CTGTCTGTTC AATATCGTCG TCGTGCCAGT CAGGTGTGTC
GAGGCAATCA TCATAATAGT CGTTGCGTCC ACCCGGAAGG GGAGCTCTCA GCATCAACTC
TTGAAGAATC ATGAATCCCT CACGCGTTTC GTCGAGAGTC GATACCAAGT AAGGGTCATC
CGAATACGAA TTGAAATGCT TGTTTGAGCG AACAAACATA GAATGCGTAG ACAAAACATC
CCAGGCCTCT TCCAAACATC CACGTAGCGT TAGGGTACGA ATACACGCCC AAAACAATTC
GCCGTCCCCG TATTGCTCTG GCTGGGCGGA AGCACTCATT TCCTGTATCG ATGGATAGAT
TTTTTCAACC TCGAGCATGT GTTGACATCG AAGATAGCGT ACCATATCCG CTGTGGCCAC
TCCTGGCTTA TCGAAGGGGT CCTGCTGAAA GTTAATATCG TTGTGAGACG GCAACAGTGG
TAAAAATACG TCAGAGAGAT GCATCACGGT GTAAATCGCT TTGAGAAACT CGAGACTGTT
GCAATCGTCC GAAGACAAAG CATCTGGACC CCCCTGCTGT TGCTCGTCAT CCCATCCTTT
CACACAATTT TGAACAGCCG AGCGACATTG CAGTGAAAGC TGTTGAAAGG ATTTGAAAGA
TTCTCGCTGG TCGTCTTCGT AAAAATCTTC GTGCGCTCGG ATAATCGCGT ACAATGCGGA
CAACTCGTGC TTGTTGCAGC CTGCACCTGG ATAGGCAAGT TGCTGAAGTA GAGCTGGATT
TCCATCTTCT GAAGAAACCA AGTAAAGGGA AGCGGGTGAA GATGTGTCCC ACAAGAAAGG
GCTACTCTCA CGAATATATG AAGCCATCTT TGAAGTGATA TACTTGCAAT CTAAACGGGT
AAGAAAGAAG CTGCCTGTCG ACGTAGTAAT TGTATTTGTC TTTGATCGGG ATTGCCGGCA
AACATTGCCG TGTGCCGGTT GCGTGCTACA GGATGCATTG CGCATGTCCA AAGCATGCTG
CAAGTTTTCA TACGCAGGAC AGACGTGATA TTTTACAGTT GGCTTGTTAA GGTGAATGAA
AAGCGATAA
 
Protein sequence
MPSHAPFYIE HLLYAPSQQS SKPETMAARP LVSVFSLSGD KSGDVSLPAV MTAPLRPDIV 
QFVHTNMNKN HRQAYAVNIR AGKQVVASSW GTGRAVARIP RVGGGGTSRS GQGAFGNMCR
GGRMFDPTKT WRKWNKKINI SQKRYAVASA LAATAVPALV MSRGHVVDNV PEIPLVVENA
VESAKKTSAA KDILSAIGAL DDVEKAGESK QIRAGKGKMR NRRYTLRRGP LVIYKSNDGV
EQAFRNLPGV ELCCVDRLNL LQLAPGGHMG RFCIWSQAAL EELDIIYGEN GKRIPQAAMT
NADLARIINS DEVQSVVNPA KPGQKDNAPK QNAIRNVEAL EKLDPFAAEK RRAQARNDEA
RASKKAETLA KKRDSRTAKK AFKEQGKSFY AKVNEKR