Gene PHATRDRAFT_16603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_16603 
Symbol 
ID7198866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011695 
Strand
Start bp274989 
End bp276788 
Gene Length1800 bp 
Protein Length599 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185079 
Protein GI219129822 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.212191 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCGGTGTC CTATTGTTGT CATCATGGGA CACGTAGACA CCGGGAAAAC GAAGTTACTG 
GATAAGATCC GCAAGACCAA TGTACAGGAA GGTGAGGCGG GAGGGATTAC ACAGCAAATT
GGCGCAACTT ACTTCGAAAA GAAAACTCTT GAACAACAAG TATCAAAACT GAATGCTACG
GAAAATGTAG AGCTCAAAGT GCCTGGTATG CTAGTGATTG ATACGCCCGG TCATGAGTCC
TTTACAAACC TTCGTTCCCG CGGTTCTTCC CTTTGTGACG TCGCTATTCT TGTTGTCGAT
CTAATGCACG GTTTAGAGCA GCAGACGATT GAGAGTCTGA CTATGCTTCG AAAGAGAGGC
GTTCCTTTTG TGGTTGCCTT GAACAAAGTC GATCGGTGTT ACGGCTGGAA AACTATGAAG
GACTCACCGA TCCGCGATGC ATTGAAACTT CAGGACGACA GCACAATGTC CGAATTCCGA
AGCCGTGCCA CGGACGCTAA GGTTCAGCTT CAAGAACAGG GTGTCAACTC GAATCTATAT
TGGGAAATGG GTGACGATGA TTGGACCAAT TCGGATTTTA TTCCACTTGT TCCCACCTCA
GCGGTAACGG GAGAAGGTAT TCAAGACGTT CTCTTGTTGC TTTGCCAGAT GGCTCAGCGC
AAGATAACTG ATCAGATCAT GTGGCATGCA AACTTGCAAT GTACGGTGTT GGAGGTCAAA
GCTATTGACG GCCTTGGGAT GACTGTTGAT GTCCTTGTTG TGAACGGATT TTTGCGCGAA
GGTGATCGTG CCGTTTTCTG TACACTGGAC GGCCCTATTG TGACTGAGAT CCGCGGTCTT
TTGACACCAC CGCCTAGTCG CGAAATGCGT ATCAAATCGG AATACATCCA TCACAAGGAG
GTTAAGGGAG CGCTTGGGGT GAAACTCATT GGCAACAATC TTGAGAAAGT AATGGCCGGT
ACACCGGTAA TGGTTGTTGG ACCCGGTGAT GAAGAGGAGG ACATCAAAGC CGAGGTTATG
TCCGACCTTA CAAAGCTGGA AGACAAGCTG AGTACGGACA AGGTTGGGGT GCTTGTACAG
GCCTCAACAT TGGGAGCACT CGAAGCTCTT TTGCAATTCC TTCGTGAAGA AACAAAGCCC
CCTATTCCTG TCAGTGCTAT TGGAATTGGC AGGATTCACA AGCGTGATGT GACAAAAATT
TCAATCATGA ACGAAAAGGG TCACCCCGAA TTTGCCACTA TTTTGGCGTT TGATGTCGAT
ATTGAGCGCG AGGCCCGCGA GCATGCCGAA GATATGGGAG TTCGCATCAT GACAGCTGAT
ATCATCTATC ACTTATTCGA TCAATTCACC CGATTTATGG ATGAACTCAA CCAAAGGAAA
CGGGAAGAGG CCACTGCTGT CGCTGTTTTC CCAAGTATAA TTCGTGTTTT ACCCCAGCAT
GTTTTCAATC AAAAAGATCC CATCATTGTC GGTGTGGAAA TTGTCGAAGG AATTTTGAAG
GTTGGTACGC CGCTTTGTGT CCCAGCGCTG GGAGGCTTAC ATATCGGAAA GGTAACATCG
ATTGAGATGA ACGGACGCGA ACAAGAGACA GCACGGAAAG GTCTATCGGT TGCCATCAAG
ATCGTGAACG AAAGTAATCC AACGATCACG TATGGGCGTC AGTTCGACTC CTCGCACAGC
CTATACTCAT CTTTAACTCG AGCTTCTATC GATGCACTCA AGGCCCACTT CAAGGAGCAG
CTTGAAAACG AAGATTGGCG GTTGGTCGTG AAGCTGAAGA AGGTATTCAA CATTATATGA
 
Protein sequence
LRCPIVVIMG HVDTGKTKLL DKIRKTNVQE GEAGGITQQI GATYFEKKTL EQQVSKLNAT 
ENVELKVPGM LVIDTPGHES FTNLRSRGSS LCDVAILVVD LMHGLEQQTI ESLTMLRKRG
VPFVVALNKV DRCYGWKTMK DSPIRDALKL QDDSTMSEFR SRATDAKVQL QEQGVNSNLY
WEMGDDDWTN SDFIPLVPTS AVTGEGIQDV LLLLCQMAQR KITDQIMWHA NLQCTVLEVK
AIDGLGMTVD VLVVNGFLRE GDRAVFCTLD GPIVTEIRGL LTPPPSREMR IKSEYIHHKE
VKGALGVKLI GNNLEKVMAG TPVMVVGPGD EEEDIKAEVM SDLTKLEDKL STDKVGVLVQ
ASTLGALEAL LQFLREETKP PIPVSAIGIG RIHKRDVTKI SIMNEKGHPE FATILAFDVD
IEREAREHAE DMGVRIMTAD IIYHLFDQFT RFMDELNQRK REEATAVAVF PSIIRVLPQH
VFNQKDPIIV GVEIVEGILK VGTPLCVPAL GGLHIGKVTS IEMNGREQET ARKGLSVAIK
IVNESNPTIT YGRQFDSSHS LYSSLTRASI DALKAHFKEQ LENEDWRLVV KLKKVFNII