Gene PHATRDRAFT_50440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50440 
Symbol 
ID7199253 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011698 
Strand
Start bp67018 
End bp70241 
Gene Length3224 bp 
Protein Length965 aa 
Translation table 
GC content58% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185423 
Protein GI219130544 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CACGCCAAAA AAAGCTTGCA CCGAGACCAC AATCGATCGG ATACTTTCTC TCTCCGACAC 
CTAGAGTTAC TGATTCCAGC CCAGGTTCTT CCTACTTCTA TTTTCGTTCA CCCATCAACT
AACTGCTACT GCTTCAACCC ATGACGGAAA CGAGTCTGCC CAGCCAAACG GCGCTTGCGC
CCAGCACCAC CAAGCCCGCT CTCGTACTAC CCAAACAAGG CGTCGCCAAG GTCAAGTCCG
TCACTTCGGG GGATACCGTG GTACTTTTGG GTAAGCCTCC GCAACCCAAT CTGCCCTGTC
CCGAAGTACT CTTTACCCTC GAAGGCCTTT CGGCTCCGGT ACGTAGGCCG TCTCGATATC
GACAAACATC ATTATCACCC ACAGTTGGTA TTCACATACA GATACATACC TACACCGATT
GAACGCGTGT TGGAATACTC CCGTATACTC GCCGTTGCAC TCGTGCTCTG TGTTTCTTCT
CCCACTCACT TTTTCGTTCG TTCGTTCGCT CACTTGTTCC GTAGAGAATG GCGAGCAAGG
TCAATCCTAC CGACGAGCCG GGCGCCTTTC CCGCCCGCGA ATGGCTCCGT CAACAGCTGG
TGGGCAAAGT GGTCCGCTTC GAGACTCGCA AGCAGCCGAA CAGTGCCGGT GATCGCGTCT
ACGGCTGGAT CTTTTTGCCC GCCACCGCTC CCACGGATCC TCCCGTACAC GTAGCCGTGG
AATGCGTGCG TGCGGGACAC GCGACGCCCA AATCGCTCAA GTACGCCACC GGCAACGACA
CGGAGGCTCC GGCCGTCGTA CCCACCGCGC CGTCTCCCGA CGATGCACCG GAAGTCGCGG
CCGCCAAGGA ATACGAGCTG CAGCTCGGGA AAGCCTACGC GGAAGCCAAG TCGGCACGGG
TGGGTCTGCA CGCCACGGAT CCCCTACCCC TCGTACGGAC CCTCCGCGTC GCCAACGAAG
ACTTCGCGAC GCTCCAGTTC GTGGAAGCCG TGCAAAAGCA CTGTACCCAC AAACGGATTC
GTTGCGTCAT TGAATACGTC TTTGACGGAT CCCGGCTGCG TCTGCACGTG ACGGATGCAC
AGTTGCCCGA GTTCCAGTAC ACTTCCTTTA CCCTCTTGTT GGCGGGAGTC ACGTGTCCCC
GGCTCGGGAG CGCCAAGTCC GATCCACCCA CTCCGAACGA ACCCTTCGCC GTGCAAGCCC
GGGAATTCAC GCAGACCAGA CTGCTCCAAC GCGAACTCGA CGTGTCTCTC GTCGGCACCG
ATAAGGTCGG ATCCTCCGCC GTCGGAGTCG TCCATCATCC TGTCGGCAAT ATCGCCGTCG
AATTGCTCAA GAACGGCTTG GCCCGCATGG CGGACTGGAG TGTCCGCCTC CTCGCGGTCG
GCGATGTTCC GGCGCTCCGC GTCGCGGAAA ACACGGCCAA ACGCACCGCC TTGAACGTCT
GGCGCAATTA CGCTCCACCC ACGCTGCAGA CGGCGTCGCA AGTCTCCGGA ACCGTGGTCG
AAGTCGTGTC CGGCGACACC GTCCTCATAC TCCCCGACGG CAAGGCCTAC GACAGTGAAG
CCGTCTTGTA CAAGGTCTCG TTGGCGTCGA TGCGCGCCCC GCGGGTCGGG AACGAACGCG
CTGGACGGCC CGACGAACCC TACGCCGTCG AGTGCAAGGA GCGTTTGCGC GTCTTGACTG
TCGGTCGGGC CGTGAAAGCC CAAGTCCACT ACGAACGCGA CATTCCACTG CAACCCGGTG
TCAACGAAAC GCGGCCCTTT GCGACCCTCT CCACACCCAA GTACGAGGAC GTGGCCGAGG
TGCTTATTCA GGAGGGACTG GCTGTGACAC AGCGTCACCG GGATGACGAC GAAACCTCGG
CACGGTATGA TGAATTGCGG GCGGCCGAGG CCACTGCCAA GGCGGCAAAG AAGAATACGC
ACTCGGAAAA GGAGTACAAG AGTGCCACCA TCAATGATTT GACCGATCCA CGAAAGGCCA
AATCGTATTC CGGTTCCCTC ATGCGCTCGG GCCACACCAA AGCCATCGTG GACTACGTCT
TCAACGGCGC ATTGTTCAAG CTGTACATTC CTTCGGAAAA TTGTTACATA CGCTTCGCGC
CAAACTCGAT ACGGTGTCCG CAACCATCGC CGAGTCCGGG TGGTAAGGTG AACAAGGCAG
CCGAGCCTTT CGGCGACGAG TCGAAGCGCC ACGCGCGACT TCACGTCCTA CAGCGTCACG
TAGAAATTGT GTGCAACGGT GTCACCAACA GTGGAATTAT CACGGGGGAC ATGATGGTCG
GACAAGGTGG ACAACGTCGT GATTACGCCA TCGAGTTGGT TGGTGCCGGC TTGGCCACGG
TCGACCAACG CAAGATTGAC TATGGAGAGG CACCACGATC GCTCGTTGAC GCGCAATCAG
CAGCACAGGA AAGTAAGGTC GGTCTATGGT CGATTGTCCA AGAGCAACCC GAAATTAAGG
TTGCCAAAAC AGCAGTCAAA GCCAAGGAAA CGGTCGCCAC GATTCGGTTA AGCGAGATTC
GCAGCGGGAA TCACTTCTTT TATCACGTGG TGGATGATGA AACAGCCAAG GTTGTGGAGG
AATCGATGAA GGTTTTCACC AAAAGCCACG GCACGGGCGG CGCTCCGTGT GACGCTAAAA
TTGGCAAAGT GGTTGCCGCC TTGTTTAACG ACGGCAGCGG AAAGGCATGG TACCGTGCCA
AAGTTATCGA ACGCAAAGGG CCTGGCAAGA TGGCGGTATT GTTTTTGGAT CACGGAAATG
TGGCGACGGT CCCGGTGGCA ACGCATCTGC GCCCTCTCGA TATGAACCTT GGGACAGATC
GTATTCCACC GGTGGCCAAG GAGGCAGTCC TAGCTCTCAC CAACACGCGA CCATTGGACA
GCGATGAGGG TATGGATGCG GCTCGACTGT TGCAAAGCAA ATGCTGGGGT CGCAACTTGA
CGGCCCGGAT TTTCGCTCCG GACGAGTCAG GCAAAGCGGC TCTATCCATC GCGACGGAAG
CTGGTTCCGA CGAAGAAACT ATCAACGCAA GTCTGGTGGT GGAGGGGCTA GCTCGCGTGG
CCAAGCCAGA AACTGTGACG AGCATCTCGA GTCGTATGAT CGATCCTTCG TCATTGGTCG
AGTTGGCGGC GGCACTCAAC GTGGCCCAGG AAGTGGCTCG CAAGTCTCGA GTTGGTATGT
GGCGGTATGG TGATATTGGC GACGAGGATG ACGACGATAT GTAA
 
Protein sequence
MTETSLPSQT ALAPSTTKPA LVLPKQGVAK VKSVTSGDTV VLLGKPPQPN LPCPEVLFTL 
EGLSAPRMAS KVNPTDEPGA FPAREWLRQQ LVGKVVRFET RKQPNSAGDR VYGWIFLPAT
APTDPPVHVA VECVRAGHAT PKSLKYATGN DTEAPAVVPT APSPDDAPEV AAAKEYELQL
GKAYAEAKSA RVGLHATDPL PLVRTLRVAN EDFATLQFVE AVQKHCTHKR IRCVIEYVFD
GSRLRLHVTD AQLPEFQYTS FTLLLAGVTC PRLGSAKSDP PTPNEPFAVQ AREFTQTRLL
QRELDVSLVG TDKVGSSAVG VVHHPVGNIA VELLKNGLAR MADWSVRLLA VGDVPALRVA
ENTAKRTALN VWRNYAPPTL QTASQVSGTV VEVVSGDTVL ILPDGKAYDS EAVLYKVSLA
SMRAPRVGNE RAGRPDEPYA VECKERLRVL TVGRAVKAQV HYERDIPLQP GVNETRPFAT
LSTPKYEDVA EVLIQEGLAV TQRHRDDDET SARYDELRAA EATAKAAKKN THSEKEYKSA
TINDLTDPRK AKSYSGSLMR SGHTKAIVDY VFNGALFKLY IPSENCYIRF APNSIRCPQP
SPSPGGKVNK AAEPFGDESK RHARLHVLQR HVEIVCNGVT NSGIITGDMM VGQGGQRRDY
AIELVGAGLA TVDQRKIDYG EAPRSLVDAQ SAAQESKVGL WSIVQEQPEI KVAKTAVKAK
ETVATIRLSE IRSGNHFFYH VVDDETAKVV EESMKVFTKS HGTGGAPCDA KIGKVVAALF
NDGSGKAWYR AKVIERKGPG KMAVLFLDHG NVATVPVATH LRPLDMNLGT DRIPPVAKEA
VLALTNTRPL DSDEGMDAAR LLQSKCWGRN LTARIFAPDE SGKAALSIAT EAGSDEETIN
ASLVVEGLAR VAKPETVTSI SSRMIDPSSL VELAAALNVA QEVARKSRVG MWRYGDIGDE
DDDDM