Gene PHATRDRAFT_45684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45684 
Symbol 
ID7200462 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011675 
Strand
Start bp922652 
End bp924638 
Gene Length1987 bp 
Protein Length561 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179746 
Protein GI219117921 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AATCTTCGAA GGGGGAGGAG GAGAACAACT CTACGTATAT CCTTGATTTG CTTCTATTGT 
TCTTTGTTTC CCGAATAGAA TAGTGTACAA CAATGAGAGA TGGGGCAACG TTAATAGCGT
TTCTATTCGC GAGCGTCTTC TCGGCAAGTA CCATCTCGGC TTGGATTCCC CTGCGTATAT
GCACAAGATC TCGTGTCCGC AGTCGCAACG TCCTTGCGGC GTCGTCGGAA TGGTCCGCCA
CGGACGATTG GAACAACCTT TCGTCCGAAA ATCCGGACAA TGGACGACAG GATTACGCCG
TCGATCAGGA TTTTGCACAG CGCGAGGCGA TTAGGATGCA GAATTGGGAT CTGGATAGCC
TCGACCCAAC GGCACTATCC CCGGAAGACG CTTGGTTGCA GGACGCGATT GAAACCGTTT
TATTGGACAG CACGATTACT CCAGAGGAGC GCTTGGATAC CCAAGACTTC TTGGAGGATA
TGGGTAGAGA AATAGCTTTA CTCGTTCGTT GCAACCAAAG TCCGCAAGAA ATGCTTATCG
CTGCAGGTAA GGCCTTACCC ATCTTGACGA CCGAAGACAA GCACAACCCA CGACAACTTG
TGCGATTGGA ACCCTCCAAC CAGAACGAGG AGACGGAAGG GGAAGTCGTG GTTTGGGCCG
CTACCGATTT CTTGAAAACC GCCACGCGTG TCATGTTCGA ACAACACGCT CACAACGCGA
AGACCGGCGC GGGGGACACG AAGGCCATTT TGGATCCGCG TGGCGTCGCA TCGTGGATGA
AAAAGAGCTT GCGGGAAGGC GCGATTGGAC CGCACGATCC TCGTGTAATG TTCATCATCT
CCAAGTTTGG AACCTACGGG ACCGGAACCT TGCAGTACGA AGACTTTCTG AATCTCTACG
TTTCCACCAT TTGTGGAAGC TCCCCTAGTC GTTGGAAACA GCTCGAGTAC CGTAGCGAGG
AAATTGAAGC CGTCTGGCGA GATCTTCGCA ATCACGACAT TGTTTCTCCC GTGGAGCAGG
AGCGGGTAGC CCTCTTGCAA AAAATGAAGG AGAAATACGA AGAGAGCTTT TCACACGTCA
CGGACGAGAC ATTGCTGGAC GAGTGTGAGA TTATTGACGA TAAAGTCGCC TCATGGGAGG
AGACTTCCCA GGGCCAATGG CGGCAGACTG GCAAAAGCAG CCACGAGCTA GTCGAATTGG
CGTACGATGG CAAAACACCC CTGCGTCTCA AAGACGGTGA ATTTGTCTTT ATCGACGAGG
ATTCGTGTAT TGGATGCAAG CAGTGCGCCT CGGCATCTCC AGCATCTTTT CATATGCTGG
ACGACGGTCG GGCCCGTACG TTTGCACAGC GCAATAGCCT TGATGTCAAG GCTGCTGTTG
CGGTGTGCCC TGTCAGCTGC ATGCACTATG TGGGTTTTGA CCGTCTCAAA GAGTTAGAGA
CTTCGCGTGA TTCGCCAGAT GGCGATGGTC GGACGGATCA CCGCCATTTT GGACAAAACC
ATCGAAACGG AGGCTATATT GCGAGAGCGC CGCTGCAGTA AGTTTTGTCA TCGAACCTTA
AATCTATTTT GATTATTTTT AAAATCTTAC ACCATTTCTT TTTATGATTG GTAGTTTGAC
GAGAAGAGAC AGCGACGCTA ACCACAAGAG CTCCTGGTAT CATTACTTGG TGAACAAGTG
CTATTGTAAG TGTCGAGTCC GACGGAAAAT TGGACTTTCC CTTCAATACG TATTCTAACC
CTTCTGTTGA TATTGTCTAC AACAGTATCA TCCGATTGTC CTCAGAGGGG CTGTTTTGAC
TGTCCTCAAT TTCGCACCCA GCCAGGTAGC AACCCGAGTT GCCAATCGAA AATGAAAGAC
GCACTGCACA TCAAGGCGGA ACATTTTATC CAAACCGGAG AAGCTAATCT TTACCGTAAA
TCGGCCGACC TCTGAAAATG AGAAAGATAC TCAGCCGGAA ACATATAGAC GCTTTGTTTT
GATTAGA
 
Protein sequence
MRDGATLIAF LFASVFSAST ISAWIPLRIC TRSRVRSRNV LAASSEWSAT DDWNNLSSEN 
PDNGRQDYAV DQDFAQREAI RMQNWDLDSL DPTALSPEDA WLQDAIETVL LDSTITPEER
LDTQDFLEDM GREIALLVRC NQSPQEMLIA AGKALPILTT EDKHNPRQLV RLEPSNQNEE
TEGEVVVWAA TDFLKTATRV MFEQHAHNAK TGAGDTKAIL DPRGVASWMK KSLREGAIGP
HDPRVMFIIS KFGTYGTGTL QYEDFLNLYV STICGSSPSR WKQLEYRSEE IEAVWRDLRN
HDIVSPVEQE RVALLQKMKE KYEESFSHVT DETLLDECEI IDDKVASWEE TSQGQWRQTG
KSSHELVELA YDGKTPLRLK DGEFVFIDED SCIGCKQCAS ASPASFHMLD DGRARTFAQR
NSLDVKAAVA VCPVSCMHYV GFDRLKELET SRDSPDGDGR TDHRHFGQNH RNGGYIARAP
LHLTRRDSDA NHKSSWYHYL VNKCYLSSDC PQRGCFDCPQ FRTQPGSNPS CQSKMKDALH
IKAEHFIQTG EANLYRKSAD L