Gene PHATRDRAFT_14922 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_14922 
Symbol 
ID7203686 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011685 
Strand
Start bp530708 
End bp534010 
Gene Length3303 bp 
Protein Length572 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182978 
Protein GI219125416 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCGCTG CTCTGATCTC CGATCGTCTC GGCGCTGACT CCATCATGCT CGCGGCCGTA 
ACGGCATTCA TGGCAGCCGA AATCATTACC ATTCGCGAAG GCTTGGCTGG ATTTTCGAAC
GAAGGCTTGC TGACCGTGTT GGTTTTGTTC GTCGTAGCGG AAGGCATTTC CAAAACGGGT
GCGCTCGATT GGTACATGGG TAGGCTGCTC GGTAATCCAC CCACCATTGC GTCGGCACAG
TTGCGCCTCA TGGCGCCGAT TGCTGTGGTT TCGGCATTCT TGAACAACAC TCCGGTTGTT
GTTGTCATGA TTCCTATCGT TCAGCGCTGG GCCAAGCAAA TTCGCGTTTC TCCGCAACAA
TTGTTGATTC CGCTCTCCTT CGCGTCCATT TTGGGCGGAA CCTGTACACT GATTGGTACG
AGTACAAACT TGGTGGTTCT CGGTTTGCTG GAAGAGCGGT ACCCGGATGA TCCGGACGTG
GCGATTGCTT TGTTTTCGCT GGGAACGTAC GGCGTTCCGG TGGCCTTGAC TGGCATTGCC
TACATTTTGT TGGCGTCGCC GGTGCTTCTT CCGGGCGGAC AAGGCCAAGG CGGCTCGAGT
CCGCTAGAGA ACAACGAAGA TGTTTTACTA GGCGCTCGGC TGACGCAATG GTCGCCAGCG
GCTTCGCGAA CAGTCAAGCG CAGTGGGCTG CGTGATACTG GAGGTATATA CCTGGTGTCG
GTCCATCGGG CGGCCACCGG TAACGTCCAT CGGGCCGTCT CGAACGATTT CGTCCTCAAC
GTGGGTGACA TCCTCTACTT TACTGGATTT GTCGAAAGCT TTGGCGAGTT TTGTGAGGAG
CATGGACTCG AAGTCGTAAC GAACGAAGTT GAGACGTGCT TACCGGAAAC CCAAACACAC
GAGACAAGGG ATACAGTGAC GGATCAAGTT CTTTGGGAAT CCCTAGACCA ACAGAGGAGC
GAAACGAGCA TCGAACGCAA TCACGGCTTT TCTATGCTGA CAAAAAGCTT AGGATCATTG
GATACCGTTG CTGAAGATCC AGACGCGATT CCGGTTGAAG TTGGAATGAC CAAAGAATCC
TTATTGCGTG CAGACGAAGA CCAGCGACTG CGGAGCATCA ATCGAATGAC GGATCTGATT
CGAGACGATG CACCCTCAAA AGACAACAGC ATACTCGATC CTAAACGGGA TAGATTGGTG
TCGGAAAGGC TGCGAGGAGG CGATCCAGCG AAGATTGTAG TGACAATTGA TAAGGATCTA
GTGGTCGTCG GAATCAACGT AAAAGATCGT TCCGGACTTA TGCTGGACAT TTCCAAGGGA
CTCTTGCGAC TCAACCTACA ACTACACCAT ACCGAAGCTG CTGTGGTTGG CGATCGCTCT
ATTTCTATCT GGCGCTGTGA AGTCATCGGT ACCGAACTAC CGGATTTGGA AGAGATATGG
TCTGTAATGA ATGCTCTTTT GTCGATAGAA GGTGGAATTG CCACAATTAA ACAGCGAGGA
CTTCGCGTGA TTCGAGCTCG GGTGGTACAA GGATCACGAT TAATAGGACG AAAAGGGGCC
GACATAGATT TTCGGAAGCG CTACCAGGCT GCTATAGTGG CTCTGCAAAA TAACGGCAAG
AACAGCACTC AGCCCCTTTC GCAGGTCAGC TTTGATGTTG GGGATGTTTT GGTTCTGCAA
GTCGGGGAGG CTTCCCCGCT GCTCCAAGTG CCTCCGTCAA ATTTCTACAA GGGTCGCACA
GATAATTCCA GGACGGATGA GAACGTATCG CGAAATTCTT CAGTAAGAAA TCTGGTGAAT
ATGGTCACCT GGAGAAAGGC TAGTACAGAC AATTTGGAGG CTATGGACAA GTCACGGGCA
GGCCGGGTTT CGGAACGAAT GGAAGGTGCT ACTCCCACCC AAGACGACGA CTGCTTTATT
GAATCGGAGG GTTCTGAAGT TGCAATTGAA CGAAACGGCG ATGAGGAAGA TCCTGCCGTT
GTTGATATGC CTGGGGCAAT GCAACAAATC GAAGAACAGG AGGTTGTTTG GAAAGATCTA
CAGCTCCTCG TGCCTGACGA AAGGGTACAT AGCGGTGAAG GAGCAGCTCG CGAGTTTCTC
ACCGCGATGC AAGTTGCCCC AAAATCCAAG TTGTCGGGGA AAACCGTTGC AAAAAGTGGC
ATCGACAAGC TTCCAGACTT GTTCTTGGTT AGTATCGAAC GCCCCATCTC TGCAGGGACC
TCTTTGCCAA CGAAGACCAA AAGACTATCA GTGATGTCTG GCGCATCTGA TGCGCATTCT
CTGGGAGAGG ACAGCAATCA GCGCCTTGGC TCGATTCAAA CAGACAATCA GGCATACCAA
TCCATTGCTC CAGAGGAGCC CCTTCAGCAC GGAGATGTTC TATGGTTCTC CGGCTCTGCA
TCGTCCGTTG GCGATCTGCG CAAGATTCCA GGATTGATCT CGTATCAAAA CGATGAGGTG
GAGAAAATCA ACGAGAAGGT GCATGATAGA CGTCTGGTTC AGGCTGTCAT TGCCAGAAAA
GGACCATTGG TCGGGAAGAC TGTGAAGGAG GTCCAGTTCC GGAAGCGGTA TGGAGCCGCG
GTGATTGCTG TACATCGCGA AGGCAAGCGT GTGCACGAGC ATCCGGGGAA CGTGAAGTTG
CAAGCAGGTG ATGTGCTGTT ACTGGAGGCG GGTCCTTCGT TCATCGCCAA GAGTGGTGAG
AACGACAGAT CGTTTGCTCT GCTAGCTGAA GTGGAGGACT CGGCCCCTCC TCGTTTGAGT
CTTTTGATTC CTGCGTTGTT GATCACGGCA GGGATGCTGA TTGTATTTAT GGCTGACTGG
ACGTCGCTAT TGGTTTCTGC ACTAGTGGCT TCAATGTTGA TGGTAGCTCT TGGTATTTTG
TCAGAACAGG AGGCTCGGGA TGCGGTGAAT TGGGACGTGT TTATAACCAT CGCCGCAGCC
TTTGGCATTG GTACAGCTCT TGTCAACTCA GGGGTGGCAG GAGGGATTGC TAACTTTTTG
GTTGATGTAG GTACTGCTTT GGGTATTGGG AGCGCAGGGT TGCTTGGAGC CGTGTACTTT
GCAACCTTTC TTATTTCAAA TGTGGTCACG AACAATGCAG CGGCGGCTCT GTTGTTCCCT
ATTGCATTGG ATGCAGCGGA GCAGACAGGC ACTGATCGTG TTTTGATGAG TTATGCGTTG
ATGTTGGGCG CGTCAGCCAG CTTTATGTCA CCTTATGGTT ACACAACGAA TTTGCTGATC
TACGGTCCTG GAGGCTACAA GTACAAAGAC TTCCTTGTGT TTGGAACCCC AATGCAGATC
GTG
 
Protein sequence
MFAALISDRL GADSIMLAAV TAFMAAEIIT IREGLAGFSN EGLLTVLVLF VVAEGISKTG 
ALDWYMGRLL GNPPTIASAQ LRLMAPIAVV SAFLNNTPVV VVMIPIVQRW AKQIRVSPQQ
LLIPLSFASI LGGTCTLIGT STNLVVLGLL EERYPDDPDV AIALFSLGTY GVPVALTGIA
YILLASPVLL PGGQGQGGSS PLENNEDVLL GARLTQWSPA ASRTVKRSGL RDTGGIYLVS
VHRAATGNVH RAVSNDFVLN HGDVLWFSGS ASSVGDLRKI PGLISYQNDE VEKINEKVHD
RRLVQAVIAR KGPLVGKTVK EVQFRKRYGA AVIAVHREGK RVHEHPGNVK LQAGDVLLLE
AGPSFIAKSG ENDRSFALLA EVEDSAPPRL SLLIPALLIT AGMLIVFMAD WTSLLVSALV
ASMLMVALGI LSEQEARDAV NWDVFITIAA AFGIGTALVN SGVAGGIANF LVDVGTALGI
GSAGLLGAVY FATFLISNVV TNNAAAALLF PIALDAAEQT GTDRVLMSYA LMLGASASFM
SPYGYTTNLL IYGPGGYKYK DFLVFGTPMQ IV