Gene PHATRDRAFT_19761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_19761 
Symbol 
ID7199943 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp748175 
End bp750501 
Gene Length2327 bp 
Protein Length702 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179281 
Protein GI219116973 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAAACGTGCG TGAGATCCTG CTTGCTTCGC GCACAACTTC CTATTCTACC TTTGGGATTA 
GGAGTTTTCC GTGCGCTTAC TACGTTCGCA ATTTCTGCAG CATGACTGTC CAGACGAACG
AACAACTTCA AGATGAACTG GACGCTCTAA ACGCTCAGAT TACCAAACAG GGTAGCGCAG
TCCGGGAGCT TAAGAAGGCC GGTGACGCAG ACGCCGTCGC GGAGGCCGTG GCCAAGCTTC
AAGCACTCAA GATCAATGCA GCGGAAATGG GCAAGAGCCT CGTCTCGGAT GAGCCCGAAT
TCAACCGAAA GGCGTTCGAC GAGCTTGTAT TGCGGAAAAT GTTTGTGGTA CCCTCGTTCG
AGATTCATGG GGGTGTGAAG GGTCTCTTTG ATTTGGGTCC TCCAGCTTGC TCTCTCAAGG
TTCGTAGCGA TCAGAGTCAT CGGACGCCTT TTAATCCAAT TATACGCGCA TTGTCTTATA
TGAATTTTTC TCTTGAAGGC TGCCATGATC GATCTGTGGC GCAAGCATTT CGTACTTGCT
GAAAGTATGC TGGAGATGGA ATGCACCTGT TTGACCCCCG AGGCCGTCCT CAAGACAAGT
GGTCACGTGG ACCGTTTCAC AGATCTCATG GTAAAAGATC CGCAGACAGC GGAATGCTTT
AGGGCTGACA AGCTCTTGGA AGATGCCATC GATTCGTTGC TGGAAGCCAA CCCGACAATG
CCGGTGGAAG AACGGGAAGA TCACCTTCGT ATTCAGCGAC AGGCGGACGC CTTTTCGCCT
ACCGAATTAG ACGAACTTTT GATTAAGTAC GATTGCAAGG GACCTTCCGG TGAAGCCTAT
ACACCGTCTT TTCCCTTCAA TCTCATGTTC AAGACCAGCA TTGGCCCGGA AGGTACTTCG
GTAGGTTACT TGCGTCCAGA GACAGCCCAA GGATTGTTTG TCAACTACCG TCGGCTTTTG
GACTTGAATG CTGGAAAAAT GCCTTTCGCC GCTGCCCAAA TTGGTTTAGG ATTTCGCAAC
GAGATTGCCC CACGTTCAGG CCTTTTGCGT GTGCGCGAAT TTTGTATGGG AGAGATTGAA
CATTTCGTCA ATCCTAAAGA CAAATCGCAC CCAAATTTTA AATCTGTCAC CGATAAAGAG
CTGGTTCTTT TTGGTAGAGA TGATCAACTG GGCAGCGGAA AGACCAAAAC AATTGCCTGC
GGTGAAGCCG TTGAGAGTGG CTTGATCAAC AACGAAACTT TGGCTTATTT CATGGCGCGC
ACACAGCTCT ACATGGAAAA GATCGGCATG GATCCCCAGC GTTTGCGTTT CCGCCAGCAT
TTGGCTACCG AAATGGCTCA TTATGCCGCC GATTGCTGGG ATCTGGAAAT CAAATCAAGT
TATGGCTGGC AAGAATGCGT TGGACACGCT GATCGCGCTT GCTACGATCT CGAGGTGCAC
AGCAGGGCAA CCAAGACCTC TATGGTGGCC ACTCAGAAAG TTGATCCACC ACAAGAAATG
GAAGTTGCCA AACTCAAGTT TGATCGAAAG TTGCTCGGTC AAGCCTTCAA AGGTGATCAA
CGCGTGGTGT CCGGTATGCT CGAGTCGTTG GCCGAAAGTT GGACGGACTT TGAGCCTATT
GCGACGAAAT TGGAAAACGA TGGAAAGACG ACGGTGGAGG GTTTTGAGAT CACTAAAGAA
ATGATGACTT GGACGAAGCA GAAAAAGACA GTTCACGAAA TTAAGTTTAC TCCCTCCGTA
ATCGAACCTT CGTTTGGGAT GGGGCGTATC TTGTATTCGC TATTGGAGCA TTCCTTCTAC
CAGCGCGAAT CGGACGAGCA GCGCGTAGTT ATGAGGTTTA CTCCGCAGGT GGCCCCTGAA
AAATGTGCAG TATTGCCGAT CAGCAGCAAT CCTGAGTGCA ACGAAATCGT TGATGACATT
GCTCGTGACT TAATGGACAG CGACTTGGCT ACTAGAGTTG ACAAGTCTAC GGCAGCGATT
GGTCGTCGCT ACTCTCGTTC TGACGAGCTC GGTATCCCTT TTGCCGTGAC GGTGGATTTC
GATACTCTCA ACGACGGAAC CGTGACGCTC CGTGAACGCG ATTCCACAGT TCAGGTTCGC
CTCCCCAAAA ATGACATTAA CTACCTTCTT TTTCAAATAG TTCACAGCCG CATGACTTGG
GAGGACGTCA TGAAGAAGTA TCCTGTTGTT TCAACTGGTG ATGATACTGA GGTAGCCGCG
GAAGACGCTG TAACTGTAGT TGAGCAGACG TCTCGTGGCG CATTCCGGCG GCCGGCCCAG
CCTAAGTAGC TGACTAAAGA TATTAACTAT GAGACAGTTC AGATTTC
 
Protein sequence
MTVQTNEQLQ DELDALNAQI TKQGSAVREL KKAGDADAVA EAVAKLQALK INAAEMGKSL 
VSDEPEFNRK AFDELVLRKM FVVPSFEIHG GVKGLFDLGP PACSLKAAMI DLWRKHFVLA
ESMLEMECTC LTPEAVLKTS GHVDRFTDLM VKDPQTAECF RADKLLEDAI DSLLEANPTM
PVEEREDHLR IQRQADAFSP TELDELLIKY DCKGPSGEAY TPSFPFNLMF KTSIGPEGTS
VGYLRPETAQ GLFVNYRRLL DLNAGKMPFA AAQIGLGFRN EIAPRSGLLR VREFCMGEIE
HFVNPKDKSH PNFKSVTDKE LVLFGRDDQL GSGKTKTIAC GEAVESGLIN NETLAYFMAR
TQLYMEKIGM DPQRLRFRQH LATEMAHYAA DCWDLEIKSS YGWQECVGHA DRACYDLEVH
SRATKTSMVA TQKVDPPQEM EVAKLKFDRK LLGQAFKGDQ RVVSGMLESL AESWTDFEPI
ATKLENDGKT TVEGFEITKE MMTWTKQKKT VHEIKFTPSV IEPSFGMGRI LYSLLEHSFY
QRESDEQRVV MRFTPQVAPE KCAVLPISSN PECNEIVDDI ARDLMDSDLA TRVDKSTAAI
GRRYSRSDEL GIPFAVTVDF DTLNDGTVTL RERDSTVQVR LPKNDINYLL FQIVHSRMTW
EDVMKKYPVV STGDDTEVAA EDAVTVVEQT SRGAFRRPAQ PK