Gene PHATRDRAFT_50617 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50617 
Symbol 
ID7199451 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011700 
Strand
Start bp129222 
End bp132392 
Gene Length3171 bp 
Protein Length859 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185586 
Protein GI219130889 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00889588 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGATTC CAACGATTCT CACCGTGGAA GAACCAGAAC GGGATCTGTA CGGCCGCGTG 
AAGGAAAAAG CCGCAATCTC CGTCGCTAAC GATCAGTGCA TGGATTTGGC CACAGCAATG
CGAGGAAGCG AGAAGTCTCA CGTAATTCTC GTACATGGTC CATCAGGTTT GGGCAAAACT
GCTTTGTTAC AGCACGTCTT CGAGGAACGG ATAAGGCAAG AGGGCGGCTT CTTCATATAC
GGAAAGTTCG ACTCAGTCCC GTCGGCGAGC CCTGGCCAGC CATTGTGGAC GCTTTCAGCG
ATTTGTGTGT ACAGTGTCTT GCTTCCGAAA GACGATCGAA AATACGGAAG GCGCTTTGCG
AGGAACTAGA ATCAGACGTC AATCAACTTT CACATTTAAT TCCCAACTTC AAGGCGCTCG
TTGACGGCTT TCTTGAATCG GATGACGACA CTGAGGGAAA TGGAGATTGG AATGACATCC
GAGCATCATT CAGGCGAAAC GAATTTGGGT TTACCCGCAT AAAACTCCTC TTTCAACGAT
TCCTCAGAGC AATTTGTCAC CCCATCTCCT CCCGAATAGT TTTGTTGCTT GACGACGTGT
AATGGGCAGA CCAATTGAGT TTGCTTCTGA TAGAGGCGCT CTTATCCGAA GGCGAGCCTG
TAAGCGGGTT TCTGTTAGCT ATCACATCTG ATGAACCAGG ACTTTTTCGA CGAGCTCAGG
TCAATAGAGA TCTCGTAAAT GCTACCAAAA TTCGTATTCG CAACTTTTCC AAGAATGATT
GGATCGAGAT GACAAAGCAA ACCTTGAAGA AAGCGAAAAG CTACATTGCA ACCGAAGAAT
TGGATATTCT CTTCAAAAAG ACTATGGGCA ACCCTTTCTT GACCGTTCTT TGTGTAAAGG
CAATAGACGA GGGAGGCTTG CTTAAGGAAT TAGAAATTGA AGCTGACAAC GGTGTAAAAG
CTGGAATTTT GTCAGTGATC AAGGGCCGCC TGAGCCGATT GGCAAAGCCA GCTCAAGATG
TTTTGTTTTT GGGAGCCTGC TTTGGACTGA GGTTTTGCAT GGATTGGGTG GCTCCTTTTG
TCACGACATA TGGATCAACA CGAAACGCTT CTCTGCAACC GGACGTCATG CCTTTTAATT
GGGGATCGGG ACGTTTGGAC GACTCAGAGC ATTTGTCTGA AAGTGAACTC AAGGAGACAT
TGAACGAAGC TATCGTTAAC GGCCTGGTCA CTAAACGATG CGGTATCCCT TGGTTCGAAT
TCACTCATCA TTCGATCCGC GATGCGGCCT ATGACCTTTT CAGGAATAGT TCGTGTAAGA
GAGAACGAAT ACATTTGCCA ATAGGCGAAC ACACGCTTCG AAAGGTGAGC TTTTCCTCTA
GCTCCGAGGA CGATACTATT CTATGGACAG CAGTGTCTGA ATCTAGGTTC TTCTCAGCTG
CATGAGGAGC GCAAGTTGAT TGAACTTGCT CGTCTCAATG GAAGGGCAGC AGAAAAATCA
ATGCTGAAGT TAGCGTTCTT CTCGGCTGCC CAGTATGCGT CAGCTGGTTT GGAAAAGATA
GGACGAGTTG GAGGATGGCA AACAGACTTT GCTGTGACCT GGGAGTTGAG CACGCATTTA
TGTCGAATGT ACAGTTGCCT CGGTGAGCAT GAAGCTTGTA AGAGAGTCGC CAACGATGTG
GTGTCCCGCA GTTCGTCCAT TTTTGAAAAG CTTGGTGCTT TCGAAGCTGT CATGGAGTCT
TCTCGAGTTG AAGGGAATGT AGAGGAAGCA TTCAACATAG GATTTGACGT CCTCAGAAGC
TTGAATGAAC CTTTGCCCAA CAGAGTTAGC AAAGCGCTTT TAATTTGGGA AATTGTCAAG
ACAAAACGCA TCTTGACCCG GAACCCTTTA AGGATTTGTC AGGCCTACCA AAAATGGCTG
ACGAAAGAGC ATGTGCTACG GTTCGGTTTC TAGGATTACT ATCCCTCAGC TTCTTTGCCA
TGGGAAACTA TTTCAGCTAC TTTGTGGCAT CGTTGAGAAT CGTCAGACTG AGCACAAAGT
ACGGTGTTGC TCGGGAATCT CCAAAAGCAT TTGTGGTCTT CGGGAATATT CTATCCCAAA
CAAGTCGACA CTTTAACGAA GCTAGCCAAT ACTTGTCCGT TGCGCTGTCT TTAGGTGAAA
AGGCGGGCAA ATCTGGAAGA GCGCAATCTC TTGCCGTTGG AAGTTGGATC TTGACGCCTT
TGCAAGGCAC CGTCTCGGAA GCAGCTGGTC AAGCTCTGTA CGGATATCGG CTCGCAATGG
AGTGTGGCGA GGTTGTGTGC GCGTGTACCT CAGTTCTTTC TTATTGTGGC CTCTATTTTT
GGAGTGGACT TCCTATACCA CCCCTAATGA AAGATCTGCC GACTTTCCTG ACCATGCTAA
GCGAATATAA GCAGGTACGG CATTTCAAAT TCTAGCTCTG CATGGGCATC CGACCATTTT
CTCACCCTTT TCTTTCAGAC TGTGCACGAA GTAGGACTCT CCTCACTGAT GTATTTCATT
CAGACATCAA CAGCGGAGGC CAATCCGACT GATTTATATG ACGATACCGT ATGGATGGTC
AGATGTCAGA ATGCAGGAGC CGCTATTCAA GTCAATACCA TTTGTCTATA CCGCATAATA
TATGCTTATT ACATGCAAGA TCTCAGTTCG ATACGAGCTT CTCAACTAGA TGCGCACCGA
GCTGTAAAGG TACAATTGTC AAGAACTTTG CAAGTTGTCG TGTGCTGGCT TTTTGTGGGG
CTCTCCGATT TCTTCTTAGC ACAGTCTGGG TGTGGCATTG AGTTTCAACG AAGTGGCCAA
AAGATTTTAC GCATGATGCG TCGGTTGGTC GTCAAAGGGG ACAGCAAATG CGAGCATATG
TTCATGTTCC TCAGGGCAGA AAAGTACAAA CTTGAATCAA AACAGAGCAA TGAAGTTCTA
AAATCCTACG ACGAGGCCAT TTCTGGAGCC GTACAGGCTG GATTTTTCAA CCATGCGGCC
TTGGCGAACG AGCGCGCCGC ACTATACTGT TTAGCGCGTG GAAAGGAAAA GAAAGCGGCT
CAATATTTCC AAGAAGCCTG GCAAGGGTAC CTGAACTGGG GAGCCCATTC TAAAGTTGAC
CAGCTTGGGG GGTGCTACTC AGCATACATA CAACAAAGTT CTAGAAAATA G
 
Protein sequence
MAIPTILTVE EPERDLYGRV KEKAAISVAN DQCMDLATAM RGSEKSHVIL VHGPSGLGKT 
ALLQHVFEER IRQEGGFFIY GKFDSVPSAS PGQPLWTLSA IYQLSLLLIE ALLSEGEPVS
GFLLAITSDE PGLFRRAQVN RDLVNATKIR IRNFSKNDWI EMTKQTLKKA KSYIATEELD
ILFKKTMGNP FLTVLCVKAI DEGGLLKELE IEADNGVKAG ILSVIKGRLS RLAKPAQDVL
FLGACFGLRF CMDWVAPFVT TYGSTRNASL QPDVMPFNWG SGRLDDSEHL SESELKETLN
EAIVNGLVTK RCGIPWFEFT HHSIRDAAYD LFRNSSCKRE RIHLPIGEHT LRKQCLNLGS
SQLHEERKLI ELARLNGRAA EKSMLKLAFF SAAQYASAGL EKIGRVGGWQ TDFAVTWELS
THLCRMYSCL GEHEACKRVA NDVVSRSSSI FEKLGAFEAV MESSRVEGNV EEAFNIGFDV
LRSLNEPLPN RVSKALLIWE IVKTKRILTR NPLRICQAYQ KWLTKEHVLR FASQYLSVAL
SLGEKAGKSG RAQSLAVGSW ILTPLQGTVS EAAGQALYGY RLAMECGEVV CACTSVLSYC
GLYFWSGLPI PPLMKDLPTF LTMLSEYKQT VHEVGLSSLM YFIQTSTAEA NPTDLYDDTV
WMVRCQNAGA AIQVNTICLY RIIYAYYMQD LSSIRASQLD AHRAVKVQLS RTLQVVVCWL
FVGLSDFFLA QSGCGIEFQR SGQKILRMMR RLVVKGDSKC EHMFMFLRAE KYKLESKQSN
EVLKSYDEAI SGAVQAGFFN HAALANERAA LYCLARGKEK KAAQYFQEAW QGYLNWGAHS
KVDQLGGCYS AYIQQSSRK