Gene PHATRDRAFT_20547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_20547 
Symbol 
ID7201155 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp490127 
End bp492407 
Gene Length2281 bp 
Protein Length555 aa 
Translation table 
GC content51% 
IMG OID 
Producthomeobox protein 
Protein accessionXP_002180649 
Protein GI219119793 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GGTGCTGCAG CCCATCACGA AAGATTCGGC GTCTACGACA AAACTCCAAT AGGTCTCGGT 
TTATCCATTC CATCGCGAAC AAGGTTCATC ATGAAGTTCG TTCTGGGATA CTTTGGATTC
ACGCTGGGCT TTGCCCAGGC CTTCAGCTTA CAAAGTGTGG GAAGATCGTC GTCCAAGACG
ACGACGCACC TCCGAATGGA TTCGGAAGGC CATGATGAAA AGCCCGTCTT GAATAAGTAC
AGTCGGTGAG TGTATGCCAT CGCGACATGG GACAATACTC CGTCGGATTG TTCGTAGAAG
AGGGTAAAGT GAGCGTTGCC GAAGAACAAA GGTGCACTAA ACAACACAGA GTAGCTAGCT
TACACACGCA CGTGTTCCTA TATTTTCAAT TTGCAGAACC TTGACGCAAA CCAAGGTCCA
GGGTGCGTCG CAGGCGATGC TGTACGCGAC GGGTATTACG GAGGAAGATC TCGACAAACC
TCAGGTTGGC ATCTGCTCCG TCTGGTACGA AGGCAATCCT TGCAACATGC ACCTTTTAGA
TCTTTCCGAA AAGGTCAAAA AAGGAGTCGA AGACGCCTCC TGCGTAGGCT ACCGCTTCAA
CACGGTGGGT GTTTCCGACG GTATCAGTAT GGGCACCTCC GGTATGCGGT ATTCGTTACA
GTCGCGCGAT TTGATTGCGG ATTCCATGGA AACGACTATG GGAGGACAAT GGTACGATGG
ATTGATTGCC TTGCCGGGAT GTGACAAAAA CATGCCGGGC TGTATCATGG CCATGGGACG
CTTGAACCGA CCGGGTATCA TGGTCTATGG AGGAACTATT CGGGCTGGAA AGCAGCCGTC
TACTGGAAAC AGTCTCGACA TTGTCAGCGC CTTTCAGTCT TACGGAGAGT ATGTTTACGA
TAAGATTACG GAGGAGGAGC GCAAGGAAAT TCTGCAGCAC GCATGTCCCG GACAAGGAGC
CTGTGGAGGA ATGTATACCG CAAATACAAT GGCCACCGCG ATTGAAGCCC TCGGTACGTG
CTCGGATTTT TTTCTTCCCT TGGATCATGT AGTGATTCTC ACGTGACTTG ATCTACAGGC
ATGTCCCTTC CGTATTCATC GTCGTCGCCG GCGGATTCCA AGGAAAAGGC TGATGAGTGC
TATCGTTCGG GGGAAGCCAT GTATCGCCTT TTGGAACTTG ATCTTAAGCC TCGTGATATC
ATGACCAAGG CGGCTTTCGA GAATGCCATG CGTATGGTCA TGGTCACGGG TGGATCCACC
AACGCTGTCT TACACTTGAT TGCTATGAGT CGCTCGGTTC AGAATCCAGA AGTAGCAATC
ACGTTGGAAG ACTTCCAACG AATCTCCAAT CAAACACCAT TCTTGGCTGA CTTAAAACCT
TCCGGCAAAT ACGTCATGGA AGATGTCCAG AATATTGGCG GAACTCCTGG ATTGATCAAG
TTCATGATTG ACAATGGTTT GTTTGATGGA AGCCAAATGA CCGTTAGCGG GAAAACACAC
GCCGAAAACT TGAAGGATCA TCCCGGACTC ACACCCGGAC AGGACATCAT CCGTCCTCTT
TCTGACCCCG TGAAAAAGAC TGGTCACTTG ATGATGATGT ATGGAAATCT CTGTCCCGGA
GGTGGTGTCG CCAAGATTAC CGGTAAGGAA GGAGAAACGT TCACTGGAAC TGCGCGTGTG
TACGACAATG AGCAATTGAT GATGCGTGGT TTGGAAAACA AGGAAATTAA GGCAGGCGAC
GTGGTCATCA TTAGATATGA AGGGCCAAAG GGTGGCCCGG GCTTACCAGA GATGCTGACA
CCCACAAGTG CGATCATGGG CGCTGGGCTC GGAGACAAAG TGGCGCTTTT GACCGATGGT
CGGTTCAGTG GTGGAAGTCA CGGCTTCTGT ATCGGACACA TCACTCCCGA AGCGCAGGTT
GGTGGACCCA TTGCCCTCGT TAAGAATGGT GACCCCATCC GCATTGATGC TCGTGCTGAA
CAACGAACCA TTGATCTGTT GATTTCGGAC GAAGAATGGG AGAAGCGAAG AACAGAATGG
ACGCCGCCAC CTCTCCGAGC GACGCAGGGA ACCCTCTTTA AGTACATCCA GTGCGTTGCG
ACTGCCAGTG AAGGATGTGT GACTGACGAA GTTGGAACCT CGACAGCTGC TGAGATTGTG
ATTGCCGCTC CCAAAACTCC CGCGGTTGCG GAATTGGAAG CAAAGATTGC AGCGCTGGAG
GCCAGGATCG GCCAGGTTAC CAACTGAGCT AATTATCCTG AAATCTCATT TGGTATTGTT
C
 
Protein sequence
LTQTKVQGAS QAMLYATGIT EEDLDKPQVG ICSVWYEGNP CNMHLLDLSE KVKKGVEDAS 
CVGYRFNTVG VSDGISMGTS GMRYSLQSRD LIADSMETTM GGQWYDGLIA LPGCDKNMPG
CIMAMGRLNR PGIMVYGGTI RAGKQPSTGN SLDIVSAFQS YGEYVYDKIT EEERKEILQH
ACPGQGACGG MYTANTMATA IEALGMSLPY SSSSPADSKE KADECYRSGE AMYRLLELDL
KPRDIMTKAA FENAMRMVMV TGGSTNAVLH LIAMSRSVQN PEVAITLEDF QRISNQTPFL
ADLKPSGKYV MEDVQNIGGT PGLIKFMIDN GLFDGSQMTV SGKTHAENLK DHPGLTPGQD
IIRPLSDPVK KTGHLMMMYG NLCPGGGVAK ITGKEGETFT GTARVYDNEQ LMMRGLENKE
IKAGDVVIIR YEGPKGGPGL PEMLTPTSAI MGAGLGDKVA LLTDGRFSGG SHGFCIGHIT
PEAQVGGPIA LVKNGDPIRI DARAEQRTID LLISDEEWEK RRTEWTPPPL RATQGTLFKY
IQCVATASEG CVTDE