Gene PHATRDRAFT_27039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_27039 
Symbol 
ID7200690 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011675 
Strand
Start bp203734 
End bp205874 
Gene Length2141 bp 
Protein Length634 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179814 
Protein GI219118063 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.384955 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCGAATAAGT GTCCTACTTG AGCGAGACCT GCTACTGTGG CAAGATGTTA CAGTACGACG 
ATAACGGATT TTATTTCTTC GCGCTGAGTA CGCTCAGCTT TTACTTGGTA CCTTGTACGT
TCGAATGGGT ACATCGGTTG GTCGCGGTCG ATGTCCGCCA TTCGTCACTC ACGTAGGTCA
CCCGTTCCTT GTCTTTTCTC TCTCTCTCTT TCTCGCTCTC CCACAGCCTG GTATTCCATT
CTACAAAAAG TGTTCAACGC CTTTTGGGTC AACGATGAAA AGATTGGTGC CGTCGCGCGG
ACTTCCGCCG AACAGAAAAA GGCCGATCAG CTCAAAAAGT CGCAAAAGGG CATGAGCGTC
CTGCATTCCC AAGGCTTCCT CATCAACGTC GGTATTACCC TCGCGCTCAG TATGCTCTTC
GTGTGGCTCT TGTTTATGGT ATCGCAGGAC GGCGAAGTCA ACTCGTTCGA TCCCTTCTCC
ATTCTCGAAA TTGATCACGG CTCGGACTCG AAATCGATCA AAAAGGCGTA CCGCAACCTC
TCGCTCAAAT ACCATCCCGA TAAGAATCCC GGTAACCGCG CGGCGGAAGC CAAATTCATG
ATGGTCAGTA AGGCCTACGA AACATTGACG GACGAAACGG CCAAGGAAAA TTACGAAAAG
TACGGCAACC CGGACGGCAA ACAGAGTTTG GAAGTGTCCA TTGGATTGCC GTCGTTCTTG
CTCGACACCA ACAACCGTAA TCTAGTCCTT ATGGTGTACC TTGTCATCAT GGTGGGGGTC
ATTCCCTTTT GCGTTTGGAC CTACTACAGT GATTCCTCCA AGTACGGAGA AAAGGATGTC
ATGTACGATA CCTATTCGTG GTTCCATCAC ACTCTCAACG AACACACGGT CGTCCGAGCC
CTCCCGGAAG TCCTCGCGGG TTCCGCCGAA TTCCGCAAAC GCAACATTCC CCGTGACGCG
GACGATAAAA AGGCCGTTTC CGCCGCCGTG ACCAACGTCA AATCGCTCAT GCCTAAACCC
AAGTACAATC ATCCCGTCTG CGTCAAGGGC AACGTACTCA TGCATTCCCA TCTTTTGCGC
CAAGACGTCG CCAAAGTGCA CGAAGAAGAT TTAAAGTACA TGCTGCGCTA CTCCACTGCA
CTGATTGATG CCATGATTTC CGTCTGTAAG CATCAAGACT CAATTCAGAC GGCGGCTAAT
TGTATTGAAT TCGGACAGTA CGTGACCCAG GCCATGTGGA CCAAGGATTC GCCGTTGTTG
CAGCTACCGC ACTTTACGCC GGCAGAAGTA GCACACGTGG ATAAAGGCAA GGTCAAGATT
GGAACGGTCC AAGAATATCG CGCGCAGGCG GAAGACCAGC GCAAAGGCAT GGCCACATTT
TCCGACTTGC AGAAGAAGGA TATCGCCAAC TATCTCCACA TTTTCCCGGA TATCACGGTT
GAATCCAAAG TTTTTGTGGA CGACGACGAA GATGACAACG TGTACGAAGG GGATTTGGTA
ACCATTATGG TTACAATAAC TCGGAACAAT CTGGCAGACG GTGAAAAGGC GGGTCTCGTG
CACGCACCCC GATTTCCCTT TCCCAAGAAG GAAGCTTGGT GGATTATTTT GGGGCAACTT
AAGGAGGGCA AGATCATTTC GATTGATAAG GTTGGTAATT CCAACAAGAA GGTGCAACAC
GCCATCAAGT TCTTGGCACC GCCGCAGGGT ACGTACGAAT TCGATCTACT TATCAAATCG
AACGGATACG TGGGTGTCGA CCAAAAATTG AAAGTAGACA TGACCACATT GGACAACTCG
GCCTTACCGG AATACAAGGT GCATCCGGAT GATGCCGAGC TGGACGATGA GCCGACACTG
TTCGAGGAGA TGCTGAACGC CCACATTGAG CAGGATTCGG ATGATGATGA TTCGGACGAG
GAAGATTCCG ATGACGAAGA TCAGCCACAA ACAGAAGCCG CCAAGAAAAA GGAGCAATTG
CGAAAGGCAC GGCAAGCTGA CAAAGATGAC GACGACGATG ATTCGGATGA CGAAGCGGAA
GAGGTGTACG CCGATAAGTA GACTCCTCCG GCGTTACACT TCCTATTTCG GCCCGATACA
TTTCTAACTA TTAAGAACTT ATAGGTTGGA TTTACATATA C
 
Protein sequence
MLQYDDNGFY FFALSTLSFY LVPSWYSILQ KVFNAFWVND EKIGAVARTS AEQKKADQLK 
KSQKGMSVLH SQGFLINVGI TLALSMLFVW LLFMVSQDGE VNSFDPFSIL EIDHGSDSKS
IKKAYRNLSL KYHPDKNPGN RAAEAKFMMV SKAYETLTDE TAKENYEKYG NPDGKQSLEV
SIGLPSFLLD TNNRNLVLMV YLVIMVGVIP FCVWTYYSDS SKYGEKDVMY DTYSWFHHTL
NEHTVVRALP EVLAGSAEFR KRNIPRDADD KKAVSAAVTN VKSLMPKPKY NHPVCVKGNV
LMHSHLLRQD VAKVHEEDLK YMLRYSTALI DAMISVCKHQ DSIQTAANCI EFGQYVTQAM
WTKDSPLLQL PHFTPAEVAH VDKGKVKIGT VQEYRAQAED QRKGMATFSD LQKKDIANYL
HIFPDITVES KVFVDDDEDD NVYEGDLVTI MVTITRNNLA DGEKAGLVHA PRFPFPKKEA
WWIILGQLKE GKIISIDKVG NSNKKVQHAI KFLAPPQGTY EFDLLIKSNG YVGVDQKLKV
DMTTLDNSAL PEYKVHPDDA ELDDEPTLFE EMLNAHIEQD SDDDDSDEED SDDEDQPQTE
AAKKKEQLRK ARQADKDDDD DDSDDEAEEV YADK