Gene PHATRDRAFT_45310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45310 
Symbol 
ID7200014 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp801504 
End bp803913 
Gene Length2410 bp 
Protein Length780 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179510 
Protein GI219117431 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTGGG AATCCGATCA TGTTAACGAC ACTCATACCG ATGGTGATCT AGCTAGTGAA 
AATGGCTCAG TCTGCAGCGC TACGGACGAG AAAGTGCTTG CTCACAAAGA AACGAAAGCA
GTGAATATAT TACGCGCAGT CACATTCTTT GTTCTTTTGT TGGCGACTGC GACTGTTTCG
CTGTTGGTCT TTTTTTATGG TCGCGACAAG GAAAATGACG AATTTTTGAA ACAATTCGGC
TACAACGCGG GAAAAGTAGT CGATTCCTTC CAGGTGAACG CAGAGAAGCG GTTGTCCGCG
CTCGAAGGAT TCGCCACTAT GATCACATCT CACGCACTTT TCGCAAACGA AACGTTTCCG
ATGGTGACGT TGCCAGACTT TGAGCGAAAA GCGTCCTACA CTCTGCAGCT TGCACAAGTT
ATTTCGATTC TCATTTTTCC AATCGTCAGC CGAGAAAATC GCGCCACTTG GGAAAGATAC
TCAGTAGAAA ATCAACGCTG GCTTGAAGAA GGTCTTGCTC TGCAAAAAAT CGTTAAGGAT
GGAGATGAAG AAGAGGCATT GATGCAGTTG GAGGAGCAAG TGGTAGCCGG AAACTTGGAT
GTCGACCCCT TCGCTCACTT GAATATCCCA CCATTCATTT TCAAAGTCGA GGAAGGTGGC
ACAGCAGCTG CTTACGAGAC GGGTCCAGGG CCTTATGCAC CAGTCTGGCA ATTGGCGCCG
GCAATTCCTG CAGCATTTTT CGTAAATTTT AACGGCCTTT CCCATCCTAG CCGAAAATTA
GAGATCAACA CCGTTCTTCG AACCGAAAAG AGGCTGGTGA GCGCAGCAGC CGATTTCTCC
AATGACAATG ATCCAAATTC AGCTGGTAGA AAGGCAGTGC TAAATCTCTT TCTGAATCGA
TGGAAAAGTG GTGGCAATGA TTACGATGAA GGGCCCGTCA GCGACATAAT CATTCCAGTA
TTTACTAGTT TTGGAGAGAA CAAGACAGTT GGCGCTCTGC TGAATTCTTA TATCTATTGG
CAGGTTTACC TCACTGACAT TTTAACGGAT GAAGCTGAAG GTATTGTCTG CGTACTGGAA
AACAGTTGCT CACAGAGCTT CACTTATCGC ATTGATGGAA AAGATGCAAC ATACATCGGA
CAAGGTGACT TGCATGACCC CAGTTACAAT GGAATGATGG TTGAGACCGG ATTCGGTGCC
GTGGTCGGAA ACAACAACGT TGATTTCAGT ATTCACGAAC ATTGTTACTA CAATCTTCGA
GTTTACCCAT CCAAGGAGAC TGAAGACAAA TACATCACGT TTCAGCCCAT CATGTTCGCT
TTGATTTTGG TGGCAGTTTT TGTGTTCACA TCCTTTGTTT TCGTCACATA CGATTGTCTT
GTCCAGCACC GCAACAGCGT TGTTAACACA TCAGCAATAC AATCCAGCTC TGTGGTTTCT
TCTCTATTTC CTGAACAAGT CCGCAACAGG TTGCACAAAG TATACAAAAG CGAAAAGTCC
AAACAGCACA ACCATACTGA CATCTTTAAA AGCATTACCA GCGACGGAAA GTCGAGAGAC
GATTTCGAGG CAGCTGACTT AAATGAGTTC GACGATTCGA CTCCTATTGC CGATTTGTAC
CCAAATTGCA CCGTTCTGTT TGCTGACATT GCAGGCTTTA CTGCGTGGAG CTCCGAACGA
GCGCCTACCG AGGTCTTTAA GCTCCTCGAG ACACTGTATG GAGCCTTTGA TAAAATTGCG
AAGAAATACA AGGTATTCAA GGTAGAGACA ATTGGTGACT GCTACGTCGC CGTGACTGGC
CTACCCACCC ACGCCACAGG ATGCCCACGC CGTTGCAATG TGCCGATTTT CCAGTTCGTG
CAATACTAAG ATGAACCAAA TGATGCATAT CCTCGTTGAG AAGCTGGGTC CGGATACAGC
AAATCTGTCC ATGAGATTTG GACTGCATAG TGGCCCGGTC ACGGCTGGGG TACTTCGAGG
TGAAAAGGCA CGATTCCAGC TTTTTGGAGA CACGGTAAAC ACGGCTGCCC GAATGGAAAG
CACTGGGCAA AAGGGGCGGA TCCACATATC GAAAGCTACC GCTGCGCTCA TTCAGAAGGC
TGGTAAGGGT AGCTGGATGA AGATTCGCGA GGAACTTGTA GAGGCCAAGG GCAAGGGAAT
GATGCAAACG TACTGGGTCG AGCCGCCGGA CTTTGGTACG ACGTCTACAG GAATTTCTAG
TAATCATGAT GTCGAGGACG CCTCAGAGAG CCAGCATCTA CGTTTTACTG CAAATGAGTT
CAAAAACAGC AAAATCGATG CAATGAGATT CAAAGAGCTT ATGGATAGCT TGAGGTACGC
GGAATCAGCA ACTACCGGCG ATTTGAATGC AGCTCTTCCA CAAGCAAATA CTTCGTCGGA
GAAAGATTGA
 
Protein sequence
MKWESDHVND THTDGDLASE NGSVCSATDE KVLAHKETKA VNILRAVTFF VLLLATATVS 
LLVFFYGRDK ENDEFLKQFG YNAGKVVDSF QVNAEKRLSA LEGFATMITS HALFANETFP
MVTLPDFERK ASYTLQLAQV ISILIFPIVS RENRATWERY SVENQRWLEE GLALQKIVKD
GDEEEALMQL EEQVVAGNLD VDPFAHLNIP PFIFKVEEGG TAAAYETGPG PYAPVWQLAP
AIPAAFFVNF NGLSHPSRKL EINTVLRTEK RLVSAAADFS NDNDPNSAGR KAVLNLFLNR
WKSGGNDYDE GPVSDIIIPV FTSFGENKTV GALLNSYIYW QVYLTDILTD EAEGIVCVLE
NSCSQSFTYR IDGKDATYIG QGDLHDPSYN GMMVETGFGA VVGNNNVDFS IHEHCYYNLR
VYPSKETEDK YITFQPIMFA LILVAVFVFT SFVFVTYDCL VQHRNSVVNT SAIQSSSVVS
SLFPEQVRNR LHKVYKSEKS KQHNHTDIFK SITSDGKSRD DFEAADLNEF DDSTPIADLY
PNCTVLFADI AGFTAWSSER APTEVFKLLE TLYGAFDKIA KKYKDAHAVA MCRFSSSCNT
KMNQMMHILV EKLGPDTANL SMRFGLHSGP VTAGVLRGEK ARFQLFGDTV NTAARMESTG
QKGRIHISKA TAALIQKAGK GSWMKIREEL VEAKGKGMMQ TYWVEPPDFG TTSTGISSNH
DVEDASESQH LRFTANEFKN SKIDAMRFKE LMDSLRYAES ATTGDLNAAL PQANTSSEKD