Gene PHATRDRAFT_44794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44794 
Symbol 
ID7199752 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011673 
Strand
Start bp263939 
End bp267218 
Gene Length3280 bp 
Protein Length973 aa 
Translation table 
GC content45% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178738 
Protein GI219115886 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTTGGATAAT GCATCGCATA TCGCTGAAAA GAAAAAGCTC GGAATTTTCT ATTCTGTTAA 
CAAGGAGAAG TAACGACTCA TGCTTTCGTA CCAACCACCT TAGCAACCTC AACTTTGCTA
CTTTCTCGCT CACGCGGATC TGTCAAAACC GAATCCTGGA AGGCTACTTT CAAGCTACGA
AATCTTCCCA GCGAACTCTC AGAACGTATC AGTCGTCTTC TTACGCATCA CTAGGCTTGC
ATGAGGATAG CCATCACTCG AATGCAGAGG ACGTGATCGG TGTTAGTTTC GGCAATTTGC
CTCAACAAGG TGCCGATGCC GTCGAAAATA GCAAATTCAA ATTTCGAGCA AAAAATAGAG
CTAGCCAAAG TATGACAGAA CAGATACCAA AGTCAATTGA AGAGATCATT GTGTTTCACG
AAACTCTTCA CAATAGTCAC TTGCGAATAC ATTGGACCAA GGAGCGAGAG TCTCTCTTGC
GGCGGGAGCG ACGGAAGCTG GGGACGACGA ATCAACCAGC TGAAGAATCG ATGGATAGAT
TTCAATCTAC TGGCTCGATT CTGACTCACG CAACAGAACA CGAAGAGGAG TCTCCACAGG
TTTCTTTACT ACCGGACCCT TCACTCTCGA CAACGTTTGT CCCTCTTACA AAAAAGGAAC
CACCTGGAGA CGAGAAATGC GATAATTCCT TCTCTCATCA AGCTCTGAGC AAAAGCGACG
CCTCTTTTGA TGGCTTGTAC GACTTGGAGA CGATCGCTTT TTTAGACTCC CTTGTTTCGT
TTCGAGACGA AAATGAATAC GACGGAAATC TCTCTTCGAT GGCAGTCAAG CCTTCGTTGA
TTGGTAGAAC AAGCGTTTCG AATGTCTGGC CACAGAAGAG GAATTTGTCT CCAGAAAGCT
CACAGCCAGC TAAAAGAAAG AGTATTGAGC AGCATGAGCC TTTTGTTGCT AGCGCGAATA
TTACAAGTAA GCCGGTTTCC CCCAATGTAG CCGCTCAAGA TGCTCGAGTG GAAGCTTCCC
TCCTTCCCGA AACTGGGTCT ATCATTAACA ATGATGAAGG GACTGGGAAG CAGTTTCCTA
AATTTTCTGA CGACGAGACT CCATCATTAA ATTTGCTATC TACCCAGCAA GAAATGCTGT
TTGCTGCAGT GTCACTTTTA CGCGCAACTA GCGACCGAGA ATGGAAACGC TTTGACCAAT
TTGTCGAATC AGATGATGAG GATGCAGATT CTGCCATCGA TAATGCGAAT GAGAATCTTC
GAGATGATAC TATTTCATTC AATTCGATTG ATGACAGCGC TAAAGCAAAA GACAAATATC
TACGTGGGTC GATAGAGAGC TACTCGATTG ACGATATCAC TGAGGATGTC CTCTTAGGAA
ACTTGATGTT GTCTACATTG GAATACAATT TGCTTTTAGC AAATTTGGCA CTGTCGCCAA
CACATTCCAC AGACGATGCT TTTAGTTTAC TCATGCGTCT ATATCGACAT GTGACGAAGC
TGAGCGAAAT GGGTGTAGGT GCATGCACTC CTGACGGTCT CACATATGAA ATTCTTATTT
TGACTTTTGG TCGAAGACTA CAAGCATACG CTGCAGGAAT GGATTTAATA AAGGAGATGA
TGGATACTTC TCGATTTACC CCCCGAGCAC TGCTTGCAGC CTTTGAACTT TGCCGCCAGC
GATCAGATTT GAATCTCACT AAGGAGATCC TCCAACAAGC AATCTCAGAC GAATCTAGAT
CTTTTCCAAT TCCAAACAGT GTGTATTCCA CCTATCTCAG CATGTTGAAA CCAGAAGATG
CAACCGAAGA GGCTTTGCAG GTGTTGCAAA CCTGTTTAAA AGTAAGTGTG TTACCAAAAA
AAACATGTAG AAGTTCGCTG CAGTCAGGCT TGCTAACGTG CCTTTGATTT GAAAGGAAAA
CAAGCGTACG AAGGATAAAT ATGTCGACGA AGTTTTCAAG ACAGCAATAG AGTGGCCTCA
TCGAAACACT AAGGGGACGA CAACCGATAG TACTGCTTTC CTTGGCCACA TTATAAATCT
ATTACAGGAA GAGGCCATAT ACCGACCAAG CATTCACGTT TGGACAAAGC TTGTCCACAC
TTTATGCTGG GGATCGCATC AAAATGAGAA GAGGAGAAAT CTTTTAGGAA ATGTTTTTCG
ATTTCTTCTG TCTAAATGGA CCGACTTCGT TCCAGATGCT CGGTTGCGCC GAATTGGATT
GGATCTGAGT CAGCAGATTC CTGATCCCAA AATGGCTCAC GATTTGATTC AAACTGTCTT
GAAGCACGAA GTGTTGAAGC AGCGTCACAC TTCGGCCTCT AGAAGGAGCT CCGGGGAGCA
AATACGACAC GGCACATCAG CTAGCTCCTT GCTCTTATCA CCGACAGAGG ATGAACGAAA
CAGGGGAAAG AATAGTTTCC GCGGGTACTC CGTGCCGTCG GCAGATGTGA CGAAGGCGAT
GGAAATTTGC GCTCGCTGTG ACGAAATGGA CAAATGCGAA TCTATCCTTA AAAAGATTGA
CAACCTTGGT TATGCAGTCG ATCCGGCCCT CCATAGTACA TTGTATAGTA TGGTCCTGAA
AGGATATGCC AAAACCGGAA ACACAACAGC CGTGGTTCGA CTGTTGTCAC ACATGCGGGT
ATCTGGGATG AAATTGAGGT AAGCTTCTGA AAACCAAGAC GCTTAGGGCA CCTATGTAAA
CGTCGAAAGC ACTCTCACCA TTATATCTTT ATGTTTAACA GTGATGAGTT GTATGGCACT
GCAATCCATT GCTACGCCGT TTCAAACCAG GCTGAAGAAG CGTGTGCGCT ATTGGAATGT
ATGAAGTCAA ATTCGTTCAA TGACGGCGTA AGTCCAGGTG ATGCTTGCTA CAACGCACTA
ATTCTTGCGT ACATCCAAGG TGAAGAATGG GACGCTGCGT TATCAATCTT TGCCGAAATG
AAAAATTTGG GAATTTCTCC TGATCCAACT ACGTCACATG GCCTACTGTT GGCGTTCTTC
AAATCAGGTG GGCTCTCGAG CGCAGCAGAG TTTGTCACCA CTCTGCTATC GACGAAGGCT
GGTATCAACG GACAGACATG CACTCTTGCT CTTCGCTTCT TTATTCCTGA GCTACAAGCC
TGCTCTGATA CAGCTTCCAT GCGCAAAAAG CTCCGCGAAC TCGGTATGGT AAGTTCTCGG
GATGAGGATG CTATATTGTT AGACTTAGCA CGTTCTGTGC GCGTCGCAGA GCTGGAGGAA
GGTCGGAGTA TTTCAAAAAG TCTTCCTGAG GAAGTATTGA
 
Protein sequence
MHRISLKRKS SEFSILLTRR SNDSCFRTNH LSNLNFATFS LTRICQNRIL EGYFQATKSS 
QRTLRTYQSS SYASLGLHED SHHSNAEDVI GVSFGNLPQQ GADAVENSKF KFRAKNRASQ
SMTEQIPKSI EEIIVFHETL HNSHLRIHWT KERESLLRRE RRKLGTTNQP AEESMDRFQS
TGSILTHATE HEEESPQVSL LPDPSLSTTF VPLTKKEPPG DEKCDNSFSH QALSKSDASF
DGLYDLETIA FLDSLVSFRD ENEYDGNLSS MAVKPSLIGR TSVSNVWPQK RNLSPESSQP
AKRKSIEQHE PFVASANITS KPVSPNVAAQ DARVEASLLP ETGSIINNDE GTGKQFPKFS
DDETPSLNLL STQQEMLFAA VSLLRATSDR EWKRFDQFVE SDDEDADSAI DNANENLRDD
TISFNSIDDS AKAKDKYLRG SIESYSIDDI TEDVLLGNLM LSTLEYNLLL ANLALSPTHS
TDDAFSLLMR LYRHVTKLSE MGVGACTPDG LTYEILILTF GRRLQAYAAG MDLIKEMMDT
SRFTPRALLA AFELCRQRSD LNLTKEILQQ AISDESRSFP IPNSVYSTYL SMLKPEDATE
EALQVLQTCL KEEAIYRPSI HVWTKLVHTL CWGSHQNEKR RNLLGNVFRF LLSKWTDFVP
DARLRRIGLD LSQQIPDPKM AHDLIQTVLK HEVLKQRHTS ASRRSSGEQI RHGTSASSLL
LSPTEDERNR GKNSFRGYSV PSADVTKAME ICARCDEMDK CESILKKIDN LGYAVDPALH
STLYSMVLKG YAKTGNTTAV VRLLSHMRVS GMKLSDELYG TAIHCYAVSN QAEEACALLE
CMKSNSFNDG VSPGDACYNA LILAYIQGEE WDAALSIFAE MKNLGISPDP TTSHGLLLAF
FKSGGLSSAA EFVTTLLSTK AGINGQTCTL ALRFFIPELQ ACSDTASMRK KLRELGMSWR
KVGVFQKVFL RKY