Gene PHATRDRAFT_50640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50640 
Symbol 
ID7199477 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011701 
Strand
Start bp55891 
End bp59267 
Gene Length3377 bp 
Protein Length1001 aa 
Translation table 
GC content44% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185602 
Protein GI219130924 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCGCGAGATA TACCAAACAC CGTCCTTTTC ATTTTGCACA TAGTCATTTT GTTGCATCTA 
CAGTAAAAGA GTCAGTGTTT GCTTCAATGC TTGGAGACGG AACAAGACAA GTCCACAAGG
CCAACAATGG TCCAAGTCGA AATAATGTCT AAAATATATG GTGAGCTTGT AGCATATCCA
CCACTCCGGA TTCTGCAAAA TGGGAAGGAA CATGACCCTT ACGTTGTTGT AGATGAAGCT
ATTGAAAAAT TTGGTGCAGC AGTCAAAGAT CAAATGCTTA CAAAGGAACA AGCTTTGGTG
AAGGTGACAA AAGTTGCTAA TTCAATTGAC TGGCTACCTA CTGAACTGAA AGAACTGGTT
GATGCACACT GTCGTAAAGA CTCAGATGTT GACAATGATG GATATGTCAA TAAGATGAGA
CTTTCAATCA AGGCAAATGA ACTCTTTGGG AAGATGAAGG CCAGCACCTA TTTTGTCAAC
TTTTACCAGC TCAAGCAGGT TGCTTCACGA TTTGCCGCAC ACTGGGGATT TGTTGTTGTT
TCGTCTGGCA ACAAATTGTC ATGCTTTTTT GCCAAGAGCT CAACTAAACC ACGTGAGTCA
ATTGTGTCAC CAACAAGACA AAGAGAGCGG ACAAGCATCA AATCAAATTG TTCATTTATC
ATCCGATCAT CATCTTGTTG CAAGGATGAC ACAAAACCGC GGCATCGGAG AGCTGTGAAA
TTAACATCAT ATGAATTAGA GCACAGCACG GAATGCCATC CAGGTGTTAA GGAGCAGCGC
CTTGCAAGGA AGGCTGCTGG CATAACAATT GGTGGCTTGG ATCTGACAAA GGTCAATGAT
ATTGTGACAT TAATAGCTGC AGGAAATATC AATGCTTGCC AGATGAAGAG TTTGCTAAAA
GATCATGTGC CAGAGCACTA TGCAATTACT GCTTCTGATA TATGCAACAT CAGAAAGCGA
GCAGTCAAGT ACTCCATAGA ACAAACACCA ATAAACATTG CAACAGCACA GAAATTAATT
GATTTTGTTC CATTGGATGA GGATGAGACC ATAATCTCAA AGGATGATGA TGTGTCCAGA
GAGAAAGTGG CTGAATTTAT GAGACAGGTT CTACAGGACA CAGGTGAAGG TTGGAAGGCA
CTTGCATTTC TTGAAAAAGT GAAATCTGAG ACCATTGGCT TTGACTTCCG TGTGCATTAC
GATATTGACG TACGGCCAAT TGGCATTGTT TGGATTACCA AAAGTATGCG CAAAGCCTGG
ATCCGGTTTG GGAGCACAAT ATACCTGGAT GCTATGAAAA GGAAAATGAA CAGTCTCCAC
TGGCCATACA TTGGTCCTGT TGCAATGGAT CATGAAATGC GAGTGGTTCC ACTCTGTGAA
AGTATCTGTT TGGGGGAAAC ACTTGCCGCA TATGCATTTG CCTTAAATTC TTTGGAGCAA
ATGGAGCCAC GGCGAAAATT GGCCTCTATC CGTCTCATCT ACGCAGACTG CTTTTTAACA
GATGCACTCC TACCATTGGT AGGTCTGAAG CGCCCTTCCA CAACTCTTGC ATGGGATTCC
TTTCACCTGA AGTCAAAAGT ATGGCCAGAA TACTTTGGAC CAACCCTGTT TGACCAGTTA
AAGGCTTCAC TTGGGAAGAT GCTTTATGGA AAGTCACGCA AAGAATATGA TGAAGCATAC
CAGGAGATAG CACAAACACT GGCCCACAAC CCTGCTAAGC TTGAGTATGT GAAGGGATTG
TATGATCACC CAGAACGGTT TGCTCACCAC TTTATCAAAA CCATACCTGG AAATCTTGGC
AAATCCAGCA GTCAACCAGC AGAATCAAAC CACTCTAGTG TTGTAGCTCG AGTTGGTCCA
GGCTCATCAC AAGACATTGT CAAAGAAATC AGTGCACTTC TGTACCGACG ACAAGACTTG
GCCAACTTGC ATGAACAAGA AGACGCAAGA TATGAGCTAT TATCCTTTAA TAGGGCTTCA
AAAACCAAAA CAACCACACT ATTGCATGAT CTGTCAGATG CTGGGGCACA CAAAGCACTC
TCAAAGAGAT TCTTCAAAGA TTATTGGTGT CCTCTTTCAG AAGAGGGAAA GAGCTATACC
CATTTATGTC TTCCATGTGG CTCTCACCAA ATTTTTCACA TGGATACTCC TAAGCTGGAT
GATTTCGCCA TTGTTATAAA GAAAGGAGAG CGTTGCACTT GTTCCCAGCA AAAGGAGTTT
GGGGGTATGT GCCTGCATGA ATATGCTCTA CATAATCACA CCTTTGAACT TAGCTTGTTT
CCAGAAAGAA TGCTCCAGCC ACACTTGCAA ACAGCAATTC GTCCAACCAG CTCCAACAAT
GATACTTATG ATCATTATGG TTGCAATGAT GATGATCTAG TGTTGGTCAA GGGCAACAAT
GCTAATGAAG TCAATGTTGA TGAAGTTAAT ATTCATGCTG ACACTCGCAA TGAACCCTCC
TCTTCTGAGG ATGATAGCAT CCCATTAGCT ACATTGGCTG GTAAATGTTC TACCAACACC
AAAATGCAAC CCCAGAAGAA ACAAAAGTGT GCTGCTGTGA CTCATGCTGA AGCAACTTGT
GTTGCTGCAG CAGTGGTAGA CTACATTGTT GGTGGCAAAT CGGAAATGAG CATGGTCCTC
TATTCATCTC TTCAGCATTT GCTTGAGATT GCTCGTGGAA CAAGTGCCCG CTCAGCTGCC
AGCATAGTAC AATCTGCTAG CCTGGCTGTT GAACTGAGAA GAAAGCATCC AGCTCTGCAT
CAACCAGTGG CTGGTCCAGA GGCCAACACA GTAGGAAGGA CCCGATCCAA GAGACTAGTC
TCAAAGGTGA TGGCCACATA TGGAGGGACA TCGGTGCGTA GCTACAAAGT AAAGAATTCC
AAATGCAGCT TCTGCCACAG AGCCACATGC AGAAATATTC AAAGCTGCCA GATACTGAGG
GACCTTGGCC GGAGGATTAC AAAAGAGGAG CTTCCTCGCT TCCGGCAAAC AGAACTTTGC
TGCTCACAAG CCATCAGTGA CGGATCAAAG TTAAGTTCCC TTATTACAGC CAACAAACCT
GTGCTCATCA GCCTCCCAAG ACATACTAAA TGGCTTGTTA TTCATGGCCT GTACAATCTA
TCAGGAAGCT TGGCTGCTGC CAAAGTTTCA AACAATGTTG GGGTGGAAGT TACATGCTAT
GGCAACATGG GAACAATTAT GGAAGGACTT ATAGAAGGTG CTGCCAGCTT TGATCACAGA
GTGGCCACAT ACAGCACTGT AACAGACTGG ATTGCAACCT CTGCTTTGAC AGGAATGAAT
ACCATGACAC GGTTGATTGC AAGCAACAAA TTTAATAGCT TAACTGGTAG CATCTAATGA
CTACTTACTT GTGCTTT
 
Protein sequence
MVQVEIMSKI YGELVAYPPL RILQNGKEHD PYVVVDEAIE KFGAAVKDQM LTKEQALVKV 
TKVANSIDWL PTELKELVDA HCRKDSDVDN DGYVNKMRLS IKANELFGKM KASTYFVNFY
QLKQVASRFA AHWGFVVVSS GNKLSCFFAK SSTKPQHSTE CHPGVKEQRL ARKAAGITIG
GLDLTKVNDI VTLIAAGNIN ACQMKSLLKD HVPEHYAITA SDICNIRKRA VKYSIEQTPI
NIATAQKLID FVPLDEDETI ISKDDDVSRE KVAEFMRQVL QDTGEGWKAL AFLEKVKSET
IGFDFRVHYD IDVRPIGIVW ITKSMRKAWI RFGSTIYLDA MKRKMNSLHW PYIGPVAMDH
EMRVVPLCES ICLGETLAAY AFALNSLEQM EPRRKLASIR LIYADCFLTD ALLPLVGLKR
PSTTLAWDSF HLKSKVWPEY FGPTLFDQLK ASLGKMLYGK SRKEYDEAYQ EIAQTLAHNP
AKLEYVKGLY DHPERFAHHF IKTIPGNLGK SSSQPAESNH SSVVARVGPG SSQDIVKEIS
ALLYRRQDLA NLHEQEDARY ELLSFNRASK TKTTTLLHDL SDAGAHKALS KRFFKDYWCP
LSEEGKSYTH LCLPCGSHQI FHMDTPKLDD FAIVIKKGER CTCSQQKEFG GMCLHEYALH
NHTFELSLFP ERMLQPHLQT AIRPTSSNND TYDHYGCNDD DLVLVKGNNA NEVNVDEVNI
HADTRNEPSS SEDDSIPLAT LAGKCSTNTK MQPQKKQKCA AVTHAEATCV AAAVVDYIVG
GKSEMSMVLY SSLQHLLEIA RGTSARSAAS IVQSASLAVE LRRKHPALHQ PVAGPEANTV
GRTRSKRLVS KVMATYGGTS ILRDLGRRIT KEELPRFRQT ELCCSQAISD GSKLSSLITA
NKPVLISLPR HTKWLVIHGL YNLSGSLAAA KVSNNVGVEV TCYGNMGTIM EGLIEGAASF
DHRVATYSTV TDWIATSALT GMNTMTRLIA SNKFNSLTGS I