Gene PHATRDRAFT_22622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_22622 
Symbol 
ID7194856 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011686 
Strand
Start bp514766 
End bp517050 
Gene Length2285 bp 
Protein Length693 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183238 
Protein GI219125963 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATACACACAC GCGTACGCTG CAGGATTCGA GCCAAGGCGA CGACCACAAT GATCCAAAGT 
GCCGCACCAT CGTGCCGACG GTGGTGGGCG TTAGGGTTCG CGGCGGTGGC GCTGTGGGTC
GACTCGACGT CGGGTTGGAC AACCCAACGC CCGCCGATGG CGGGCCTGCG ACGCTTCCAG
TCATCCGCCT GGTACATGTC GTCGTCTTCC ACCGAAAAAC CCACCACCAG TGGCACTATC
AACGGCGGCG GCGGCAAGCA GAAGGCCGCT TCGCAACCGA AGAAGACAGC AGGCATCGTC
TACGACGCAC AGAAAATACG AAACTTTTCC ATTATTGCGC ACATTGATCA CGGCAAGTCG
ACACTGGCGG ACCGATTATT GGAAACGACC CAAACAGTCG CTCAGCGAGA CATGGAAGCG
CAATTGTTGG ACAATATGGA TCTGGAACGG GAACGGGGTA TTACCATCAA GTTGCAAGCC
GCTCGGGTTT TGTATCAGGC AAGAGACGGT GAAATGTAGT ACGTACCCTC TAAATCCCAA
GTTGCGGATC ACTCAAAAGT CTTTACGGAT TTGAGCCCTG TGTTTCTCAC GCAAGCTGCT
CTTGCTTTCT GTTTTCTTTG GCCTGTTTCT TTGCCGTTGT GTCTCGTGTC TCTCTCGCGA
CACACTTGTC CAGTTGCTTA AATTTGATCG ATACTCCGGG ACACGTCGAT TTCTCCTACG
AAGTTTCACG TTCGTTGGCG GCTTGTGAAG GCGCCTTGCT CGTAGTGGAT GCCTCTCAGG
GTATCGAGGC GCAAACCTTG GCCAACGTGT ACCTGGCGCT CGAAAACGAC CTCGAGATTA
TCCCCATTCT CAACAAGATC GATTTGCCAG CGGCCGACCC CGAGCGCGTC GCGGAAGAAA
TCGAAGCCAC GATTGGTTTG GATTGCAGTG GGATTGTGCA CGCCAGTGCC AAGACCGGCA
TTGGTATCGA TGATATTCTC GAACGCATCG TCCAAATGGT ACCGCCACCC CCCGCGGCCA
CCGGTGGACC CTTCCGAGCC CTCATTTTCG ATTCCTACTA CGATGCCTAC CGCGGCGTTA
TTGTCTTCTT TCGCGTCATG GACGGAGAAG TCTCGCAAGG AGACAAGGTG CGCTTCCTCG
CTTCGGATGC CGAACACGAT GTCACCGAAG TTGGCATTAT GCAACCGAAT CAAGTCCCGG
TGGATTGCCT GCATGCCGGT GAAGTCGGAT ACCTTTGGGG GAACATTAAG GATGTACTGG
ATGCTCGCGT AGGTGATACC ATCGTGCTGG CGAAAGAATA CAAGGAATCC GCCAGCAAAG
GGAAGTCGCC ACCGATCGAG GCGCTACCCG GATACGCGGA CTCCGTTCCC ATGGTGTATT
GCGGTCTGTT TCCCGTCGAT GCGGATCAGT ACGAATCCTT GCGTGACGCC CTCGGAAAAT
TGCGTCTAAA CGATGCCGCG TTATCGTACG AACCGGAAAG TTCAGGGGCC ATGGGGTTTG
GCTTTCGCTG CGGCTTTTTG GGATTGTTGC ACATGGAAAT TGTTCAAGAA CGGTTACAAC
GTGAGTACGA TATTGATTTG ATCGTTACGG CACCCTCGGT CGTCTACAAG GTGAAAAAGG
AACACCAGGA AGAATTTTTT ATCGATACTC CGGCGAAAAT GCCCGACTTG GGACGCAACG
ATGTTGCTTT AGAACCTTAT GTACGGATGG AGGTGTTGAC CCCCAGTGAA TACAATGGAG
CCATAATTGA ATTGGGGCAA GAACGACGAG GCGATTTGAT TGATATCAAG TTTTTGACGC
CCACTCGATC CACAATCGTG TACGAATTAC CCCTGGCCGA AGTGATTACG GACTTTTTCG
ATCAACTCAA ATCCCGTACG AAGGGCTACG CCTCGATGGA ATACTCCTTG ATCGACTACC
GAGCAAGTGA TTTGGTCCGG CTCGATGTCA AAATTAACTA CGAAATGGCC CCGCCGCTCG
CTTGTGTCGT GCACCGTGAC AAGGCCCAAT CCATCGGGCG ACGATTGTGT GCGTCCCTCA
AGGACCTCAT TCCTCGCCAA ATGTTCAAAA TCCCCATTCA AGCCTGTATC GGTGTCAAAG
TGATTGCGTC CGAATCCATA TCGCCAATGC GCAAGGACGT CCTGGCCAAG TGCTACGGTG
GTGACATCTC GCGTAAAAAG AAACTGCTAC AGAAACAAGC CAAGGGCAAA AAGCGGATGA
AAAGTATCGG GAAAGTGAAT GTCCCGCAAG AAGCCTTTAT GGCGGTCCTG AAGTTAAACG
AGTAA
 
Protein sequence
MIQSAAPSCR RWWALGFAAV ALWVDSTSGW TTQRPPMAGL RRFQSSAWYM SSSSTEKPTT 
SGTINGGGGK QKAASQPKKT AGIVYDAQKI RNFSIIAHID HGKSTLADRL LETTQTVAQR
DMEAQLLDNM DLERERGITI KLQAARVLYQ ARDGEMYCLN LIDTPGHVDF SYEVSRSLAA
CEGALLVVDA SQGIEAQTLA NVYLALENDL EIIPILNKID LPAADPERVA EEIEATIGLD
CSGIVHASAK TGIGIDDILE RIVQMVPPPP AATGGPFRAL IFDSYYDAYR GVIVFFRVMD
GEVSQGDKVR FLASDAEHDV TEVGIMQPNQ VPVDCLHAGE VGYLWGNIKD VLDARVGDTI
VLAKEYKESA SKGKSPPIEA LPGYADSVPM VYCGLFPVDA DQYESLRDAL GKLRLNDAAL
SYEPESSGAM GFGFRCGFLG LLHMEIVQER LQREYDIDLI VTAPSVVYKV KKEHQEEFFI
DTPAKMPDLG RNDVALEPYV RMEVLTPSEY NGAIIELGQE RRGDLIDIKF LTPTRSTIVY
ELPLAEVITD FFDQLKSRTK GYASMEYSLI DYRASDLVRL DVKINYEMAP PLACVVHRDK
AQSIGRRLCA SLKDLIPRQM FKIPIQACIG VKVIASESIS PMRKDVLAKC YGGDISRKKK
LLQKQAKGKK RMKSIGKVNV PQEAFMAVLK LNE