Gene PHATRDRAFT_26061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_26061 
Symbol 
ID7197853 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp393640 
End bp395512 
Gene Length1873 bp 
Protein Length552 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178499 
Protein GI219115407 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTCGTTTGCG AATTAGAGTC ACAAGCGTCC ACTTCCACTA TCACCATGGC TTCTCTTTCT 
CTCCCCCCTC CCCGCAATAT GTCGGAAGCG CCTGTTGTGT CCCGCAAAGC CACCACAGAA
CAAAGTCCGG AAGCCCCGAA GGCTGATCAC TCGCGCCAAC CGGATTCTGC CGCTACCACT
ACCAGCAAAC CATCTCCTCC CTCCTACGCC GAACGATGCC GGGCCGCGCT CGCGCTCCAT
ACCATGCCAC CCGAAGAACG ACGTAAAAAC AAGCTGTTTG TCCCGCGTTC ACTGGACGAT
TTCGGCGACG GCGGAGCGTA TCCGGAAATT CACGTGGCGC AGTACCCGCG TCACATGGGC
AACCCCCATC GGCACTCGAA GCACGCGAAC GCCTCGGGCG GCACGTCTCG CGGTGCACAG
GCAATATCGA AGGCACTGGT AAATGTAGAA ATTGATAAAG ATGGTAAGGT ATCGTACGAT
GCGATCGTCA AGGGGGGGAC AAACTCGGAT AAGATTGTGT ATTCGCGACA CGCGGATTTG
CGAGGCGGAT CAGCCAAAGC GGAGGACATT GCGTTGCCAA CAGAAGAAGA GGAGCAATCC
GAAGCAGCTC GGACACAAGC GGCACTCGAT GCAATATTGG GTAAGAAAAC CGCGTTGGAC
AATCCTTCTG GCAGCGCAAT CGTCAACGCC CAAACGTCCC AGAATGTGGA AGCGAAAACA
TCCTTTATTA AATACACCCC ACGACCCGAC GCTCCCGGCT ACAACCCTGC CGCTTCACAG
CGAGTTATTC AAATGGTGCC AGCCAAGGTT GATCCCATGA TGCCTCCCAA GCACAAGCAC
ATTAAAGCTC CGGCTGGACC GGCCGAAGAT CCGGTTCCGG TCTTGCACGC ACCACCTAGC
AAATTGAGCA AGGAAGAACG CGAGGCATGG AACGTACCCG CGTGTATTTC CAACTGGAAG
AATACCCGAG GCTATACGAT TCCGCTCGAC AAACGATTGG CGGCGGATGG GCGAGGCCTG
CGTGAGCATA CGATCAATAC CAATTTTGCG ACGCTTTCCG AATCCCTGTA CGTGGCGGAA
CGCCAAGCTA GGCAGGAGGT ACGCATACGG GCGCAAGTGC ACAAGAAATT GGCTTTGCAA
GAAAAAGACA AGCGGGAAGA TGAGCTTCGG CAGCTGGCGA ACCAAGCGCG TCTGGAACGG
GGTGGCGGAG GCGGAATGCC TGCGGCGGCC CAGCCATCAC GCGACCGAGG GCACATCTCC
GATGCGTCAT CGGATGATGC AGAAAGTATC GACCATCCTC CCCCGGCAGC TGCGCAAGGA
GATACGGAGG ATGATGTGGC CGCGCGTCAG CGAGAAAAAC TTCGCTTGGA ACGAAAACGG
GAAAGAGAAC GAGAAATGCG TATGGAAAAC AATATGGAAC TCAAGAAGCA AAAGTTGGAG
CAGGAACGTG ACGTGTCGGA AAAAATTGCT TTGGGGGTAC ACACGGGTAC AGGTGGCTTG
GGAGGCGATG TGGATTCACG TCTTTACAAC CAATCGGCGG GTATGGATTC AGGGTTTGGC
GCGGACGACG AATACAATGC GTATTCCAAG CCTTTGTTTG CACGCCAAGC CGCGGCGTCG
TCGGCATCCA TTTACCGTCC GACTCGGGGC GACACGGCCT ATAATGCGGA TGAACAATAC
AGCAAGTTAC AGCAAGGGGC TACCTCCAAG TTTCAACCAG ACAAGGGTTT TTCTGGGGCC
GAAGGTGGTG TCTCTGGGGC TGGAACCACT CGCACAGCTC CTGTTCAGTT CGAGAAAGGC
GATCAAAAAT AGTTCACGAA ATTTTATCAT CTTTTGTTTA CTAGTTTTTA AAAGGATAGA
TTGAGGAGGG GCT
 
Protein sequence
MASLSLPPPR NMSEAPVVSR KATTEQSPEA PKADHSRQPD SAATTTSKPS PPSYAERCRA 
ALALHTMPPE ERRKNKLFVP RSLDDFGDGG AYPEIHVAQY PRHMGNPHRH SKHANASGGT
SRGAQAISKA LVNVEIDKDG KVSYDAIVKG GTNSDKIVYS RHADLRGGSA KAEDIALPTE
EEEQSEAART QAALDAILGK KTALDNPSGS AIVNAQTSQN VEAKTSFIKY TPRPDAPGYN
PAASQRVIQM VPAKVDPMMP PKHKHIKAPA GPAEDPVPVL HAPPSKLSKE EREAWNVPAC
ISNWKNTRGY TIPLDKRLAA DGRGLREHTI NTNFATLSES LYVAERQARQ EVRIRAQVHK
KLALQEKDKR EDELRQLANQ ARLERAAQGD TEDDVAARQR EKLRLERKRE REREMRMENN
MELKKQKLEQ ERDVSEKIAL GVHTGTGGLG GDVDSRLYNQ SAGMDSGFGA DDEYNAYSKP
LFARQAAASS ASIYRPTRGD TAYNADEQYS KLQQGATSKF QPDKGFSGAE GGVSGAGTTR
TAPVQFEKGD QK