Gene PHATRDRAFT_50474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50474 
Symbol 
ID7199272 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011698 
Strand
Start bp159756 
End bp162213 
Gene Length2458 bp 
Protein Length747 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185391 
Protein GI219130478 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0241823 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGATC AGTCGGTCGG ATGGGACCAT GAAAAGGAAA ATGTCGGCTA CTTTCATAAG 
CATGTAGAGA AGGGTGGGCT AATTATGGTC AGTCGCACGC CGCTATGCAG ATTCGGGTAT
CGGAACCGAA ACTCTTTCTC GGCTCTTATT GTATTCATCT ATTTAGTGGC GTACACTTTG
GCCTGGTGCA GACCTTCGAT CAGTTGGAGT CGCCGGATCT CCTGCTGTTG CTTTTCAAGT
CCTTCCTATG CAATCAATCA GGCTTCTAGA GACAACGACG AGGCAAACAT TGATCTCGCG
GTCTTGTCCG TACCACTGGC TGAAGCGACG TCCACACTGC TAGAACTCTG CTCCTCCAAC
AACACAAGAA TCGTACAGAT GGAAGGATAC GTAACGGCGA AACGAGGCTT TGGGTCCTCC
TTTTGCTTCC TGGATCTGTC CGAACGGGGG TGGGAACGTA GACCTGTACA AGTCATGCTG
AAGAGACAAA ATTATAGCCC ACCAATGGGA GCAGATAGCG ATGGGAATGC ACCTACATTC
GATGGGATTT TTAGGTCAAT GCTCCCGGGA ACATACGCTT CCATCACGGG CGTTGCGTCG
CCGACCCGCA ACCCTGGGGA AGCTGTTTTA TTGTGTCGCG AAGTAGATTT GTTGGGCCTC
CCGCGCAACC CGCAGCATAT TCGTGTGATT CTGGAATGCA CCGCCAAAGG TCTATTGCCC
ATTGCGAGTG TCGCTCGTCT CTATAACCAA TCGGCCGAGC AACTTCAGCG TGATCTAGAA
TGCAGCGGCG AAGGATGCTA TGGAAAGCCT TTCGACAGCC TAGCCAAGAT GGTTTTGCGG
TCGCTGCCAG CCGATGAACG GTACCCTGAT TTGGTGAAGT ACAAGCAAAG CTTCCGGCTA
CCCGTTGCAC CCGCCGAGAC TTACTCTTTG CCAGAAAGCG TTAAAATGGC GACAAAGGTC
TCGGTTTCTG GAATTAAAGA GAGCGCTCCA CCGCTTTCTG TCGAGGCCGT CTTGTCACAA
TTTTATGAAT CAGAGGACAT AGATTGTTCT GGCTCCACTT TGCAGCTCCC AGCCTGTGTA
CATGGATGGA TCCAGAATCG TCGACGATTC GATCGCAATG TGACGGTCCT GGAATTGGTG
GATTCCTTGC GGGAAAGCGA CAATAGCACT ACACTGGAGA GCGCGCCTCA CTTTTGCCAG
CGGCTGAAAT GTGTTTCGCA TCCCCAAATA CTTGCCGTGT CGGACACGAT GGCGCATTTG
CTGGCTCCTT CGGCCCAAGT GCAAATACAA GGCGTGGTGA TCGGGGATAA GAATGATGGT
GCACCGACGC TTTGGATAAC AAACATCCGG CTGCTCCAAG CAAGCTGGTG GCCTTCGGTG
ACGCGCTTTC TTTTGGAATT GGTTTTGGAA AAGCGATTTA GTGTCCCCGA TGCAGCTTTG
GCCTTGCAAG TGTCGGAATC GGAAGTTGTC GAGGCCACCA ACGCGAACGC ATTGGACTTG
ACAGCCCGTC AGTGGAAGGC TGCCGAGTTT TCGCAAGCGC TAAAGCTCAA GGCGCAAGAA
GGATCCTCCG TGACTTGTTC CACAGAAGAG ATTTTCATTC TGGAAAAGTA TGAAGCAAAG
TGTGCCGATC AGTTTCCCAT CCAAGATGTA ACGAATCACG TGTCTGAAGT CCATTTTTCG
TCGTCCACGT CGTCTACGAA TGAAACAGTG GTTACACACT CCCGTGAGGG GAGTCGTTGG
AGAAGGCAAA AAGAGCCGCA GTTGGTTTGG ATGGGCAAAC AAGTATTGGA GGTTCTACAA
ACACACCCGA ATTGGAATAG CACACCAGGT CAGTGAGTTT CGTATCCTCG ATGTTGGAGG
CGGCCAAGGT TGGTTGGCGA ACCATTTGGC CCAAACCGTC CCGGAGGCTC GAATTCAAGT
TATCGATATT GCTTCGGGTG CCGTCCAGAA TGGTGCCATG CGTTCCCGGC GCCTCGGACT
ATCCAACAAT AACCGGGTAT CGTATACTGT TGGCGATGCA TCTTCCCCAA ATTTGACGCT
CTGGGAGGAT GACTTCGACT TGGTCGTGGC GTTGCACGCC TGCGGAGGCT TGACCGACGT
GGCTTTGTCG CATGCGCGGT CCCGCCAGAT CCCATTTGTA ATTTGCCCGT GCTGCTACCG
TTCGAATGCA CATTTACAGG TACAAGCGTC GTCATCGTCA TCGTCGTCCA CAAGTTCCGT
CAATATTTCG AGATGGCTCG ACGTGGCGCC AACGACCGGG GCGGATGAGT ACACCATCTT
GACGCGTTTG GCCGAAACGC AGCACGATCT GGTGCTGTCC CGCCGGGCGA CGCACGTCGT
TGGTCGCTTA CGCGCCGCCG CGACGGAACG AGATATACCC AACATCCAGG TATCGCTGTG
GACCTTTCCC GTGGCTTTTT CCACCCGCAA CCTCTGTTTA GTGGGTAAGT ATAGATAG
 
Protein sequence
MMDQSVGWDH EKENVGYFHK HVEKGGLIMA SRDNDEANID LAVLSVPLAE ATSTLLELCS 
SNNTRIVQME GYVTAKRGFG SSFCFLDLSE RGWERRPVQV MLKRQNYSPP MGADSDGNAP
TFDGIFRSML PGTYASITGV ASPTRNPGEA VLLCREVDLL GLPRNPQHIR VILECTAKGL
LPIASVARLY NQSAEQLQRD LECSGEGCYG KPFDSLAKMV LRSLPADERY PDLVKYKQSF
RLPVAPAETY SLPESVKMAT KVSVSGIKES APPLSVEAVL SQFYESEDID CSGSTLQLPA
CVHGWIQNRR RFDRNVTVLE LVDSLRESDN STTLESAPHF CQRLKCVSHP QILAVSDTMA
HLLAPSAQVQ IQGVVIGDKN DGAPTLWITN IRLLQASWWP SVTRFLLELV LEKRFSVPDA
ALALQVSESE VVEATNANAL DLTARQWKAA EFSQALKLKA QEGSSVTCST EEIFILEKYE
AKCADQFPIQ DVTNHVSEWL HTPVRGVVGE GKKSRSWFGW ANKYWRFYKH TRIGIAHQVS
EFRILDVGGG QGWLANHLAQ TVPEARIQVI DIASGAVQNG AMRSRRLGLS NNNRVSYTVG
DASSPNLTLW EDDFDLVVAL HACGGLTDVA LSHARSRQIP FVICPCCYRS NAHLQVQASS
SSSSSTSSVN ISRWLDVAPT TGADEYTILT RLAETQHDLV LSRRATHVVG RLRAAATERD
IPNIQVSLWT FPVAFSTRNL CLVGKYR