Gene PHATRDRAFT_26980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_26980 
Symbol 
ID7200054 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp1011821 
End bp1013918 
Gene Length2098 bp 
Protein Length551 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179336 
Protein GI219117083 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.24942 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTCGTCTTTG CTGTTAAGAC GCACCAAATC GTGTACGCTT GTTGTTCTCT GACTGATTGA 
AGTACAATGA ATAACGCAGC GGCCTCAGGA CTGGTACGTG TTCCGCAGGT TGTCGGCTAG
TCGTGTAGGT CGCGAACGTT CCGTTTGGTG CGGGTTTATC CGTTTCGTCG TTGAGTGCGT
TGTGGGTTTG CGCCTTGATT TGCCTGATGC AAGTATTCCA AAAATAGAAT CTTCCGTGTT
GCTGGCACAT GTCATCAAGC CCTGGAAAAA AATTTAACTC CAAAAGTGTA CCTTTGCAAT
AGATTTGCAC AAACGTTCGC ACACACCCTC ATTGTTGAGA TTTTCATTGT CTGCAGTTTC
TCGGGGGTAC CCGCGAATCG GGCGAAGACG TCCGCGTTGG GAACGTGACA GCCGCAATTG
CGGTTGCCAA CGTTGTAAAG TCGAGTTTGG GACCCGTCGG TTTAGATAAA ATGCTCGTGG
ACGACATTGG GGATGTTACG ATTACCAACG ATGGTGCGAC TATTTTGGCT CAGCTCGAAG
TCGAGCATCC CGCGGCGCGC CTTTTGGTGG ATCTGGCACA GCTCCAGGAT AAAGAGGTTG
GTGACGGAAC GACGTCGGTT GTCATTATCG CCGCGGAGCT TTTGCGAAGA GGAAACGATC
TCGTGAAAAA TGGAATTCAT CCCACTACCA TTCTCTCGGG TTATCGTTCT GCGTTAAAGG
CCGCGGTGGC TTACATCAAA AGCACTATGG TGGTACCGGT ATCCAAGCTG TCGGACGAGC
ACCTTTTGCA AGCTGCTCGC ACCTCCATGT CCAGCAAACT CATCGGCAAG GAAGGAGACT
TTTTTGCACA GCTTGCTGTC GATGCCGTCA AGAGTGTAGC TACGATAAGT CCCTCCGACG
GCAAGGCCAA GTACCCCCTG TCGGCTATCC ACATCCTCAA GGCACACGGC AAGTCAAGCT
TGGATTCACA CCTTATGCAA GGTGGTTTTG CCCTCCTGGG CACTCGAGCT TCCCAAGGTA
TGCCCTCGAC GATTGATCCT GCTGACGGTG AATCCGATGT CAAAATTGCC ATGTTGGACA
TGAACTTGCA GCGTCACCGC ATGGCAATGG GCGTACAGAT CCAAATCACG GATCCCAAAG
AAGTCGAAAA CATCAAGAAA CGGGAGCTCG ACATTACCAA GGAAAAAATT CAAAAAATTC
TGCAAACGGG AGCCAAGGTC GTCCTCACCA CCAAGGGTAT CGACGACACG TGTATGAAAT
ACTTTGTCGA AGCAGGTGCT CTGTGCGCAA GGCGGTGCAA CAAGGAGGAC TTGAAGCGCT
TGGCCAAGGC AACCGGCGGT AAGCTAGTTG TCACCCTGGC CGACATGGAG GGTGAAGAGT
CCTTTGATGT GGACTCGCTT GGTAAATGTA CGTCGGCTGC TGAAGTCCGT GTAGGCGACG
GCGAGATGCT ACATTTTTAT GGTTGCAAAG GGGCCGGGGC ATCCACGATA GTACTGCGAG
GAGCGAACGA ATACATGCTA GATGAAATGG ATCGGGCCTT GCACGATGCG CTTTGCGTCG
TTAAAAGAAT GCTCGAGTCG TCTACCCTGG TTCCAGGTGG TGGAGCTGTG GAAGCGGCCC
TATCCGTATA TTTGGAGCAA TTTGCGGAAA CACTGGAAAC ACGAGAGCAA TTGGCTATCC
AAGAGTTTGC CGACGCATTG TTGGTGATTC CCAAAACTCT CGCTGTCAAT GCGGCGAAAG
ACAGCTCCGA ACTTGTCGCC AAGCTTCGGG CGGTTCATGC CAAGCACCAG AAAGCTGAGA
ACCCCACGGA TACCGATTAT CAAAATTTTG GACTGGATCT AATAAATGGT GAAATTCGCA
ACAATCTTTT GGCCGGTGTT GTGGAGCCCG CGATGTCAAA GATCAAATCC TTACGCTTTG
CCACCGAAGC TGCGATAACG ATTCTACGTA TTGACGACCG CATCACAGTG TCGGAACAAG
GATAACCTAA TGGCTAAAAA AGATGAAGAG ATCCAGCAGC GTATACATTT CCACCATTTT
GCTAAATGGT TGTCCATTTC GACGGCACGG TCATAAAGTA GCGCTCCATT GCATTTGT
 
Protein sequence
MNNAAASGLF LGGTRESGED VRVGNVTAAI AVANVVKSSL GPVGLDKMLV DDIGDVTITN 
DGATILAQLE VEHPAARLLV DLAQLQDKEV GDGTTSVVII AAELLRRGND LVKNGIHPTT
ILSGYRSALK AAVAYIKSTM VVPVSKLSDE HLLQAARTSM SSKLIGKEGD FFAQLAVDAV
KSVATISPSD GKAKYPLSAI HILKAHGKSS LDSHLMQGGF ALLGTRASQG MPSTIDPADG
ESDVKIAMLD MNLQRHRMAM GVQIQITDPK EVENIKKREL DITKEKIQKI LQTGAKVVLT
TKGIDDTCMK YFVEAGALCA RRCNKEDLKR LAKATGGKLV VTLADMEGEE SFDVDSLGKC
TSAAEVRVGD GEMLHFYGCK GAGASTIVLR GANEYMLDEM DRALHDALCV VKRMLESSTL
VPGGGAVEAA LSVYLEQFAE TLETREQLAI QEFADALLVI PKTLAVNAAK DSSELVAKLR
AVHAKHQKAE NPTDTDYQNF GLDLINGEIR NNLLAGVVEP AMSKIKSLRF ATEAAITILR
IDDRITVSEQ G