Gene PHATRDRAFT_43022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43022 
Symbol 
ID7196231 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1817070 
End bp1819640 
Gene Length2571 bp 
Protein Length623 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177379 
Protein GI219111255 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.599773 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAATTAGCCC AACGATGCCC AACTGTGCAT GGTGGACAGC AGTTCTGGCC TCTATAATAG 
TTGCCACTAT ATATGCCACT CTCTATGCGA CGGAATCTCG ACAAAGCCTG CGTTTCGGTT
TGCACTTCGA CAATGGCGAT GAAGAGAATT GGGACCCCAG GAACAGTCCC GTTGTGTCCA
TTGTTCCACA ATTTTTGCTG AATCATAGTG TGTATACTAC GTCCGATTTC CGCAACAAGC
AGGGCATTGC TCCCCCCTTC TGGGGATGCA AAAATGAGAG CTGTAACGCT TCGGGGGTAT
GGGGACCTTG TTTCTGGCCA CACAAAAAGA TTCGATGGGT AGGCGAAGTT GAACGAATTG
GTCGCACAAA TAGACCACAG TACCAAAACG GCCCGATTGA CCCAACACTT ACGGACGATC
TTTCTGGTCT TTGTCGACCA GGTTTTTTGA TTATCGGAGC CGGGAAATGT GGGACCAGGT
ATGTGCTTCG AGAACAGCGA TCCACAGACA ATGCTTTTGC TTCGGCTGAA CCATGGATAA
TCTTTTTTCG ATGGGTAGCT CGCTTTATCA CTATTTGACC GACCACCCTC GCGTTTTACC
GGCAAAAGAA AAGCAGATCC ACTATTTTAA GGTGAGCCTC GGTGTTCGTG GCAATGTCCC
GACCCGCAAG CCGATTTGAA TGGCGTTCTA ATTCCGTCAC TCTTTTTAAT ATGAAGTATT
ACATGCAGTA TCCCATGCAA TGGTATCTTC GCCACTTTCC GACAGCCAGC AGCTTTCTAG
CTGCTGGTGC TTTGATGTCT GGCGAAGCAA GCCCAGGTTA CTTACCATAT CCCGATGTTG
CGTATCGCAT CAAGAAGCAC ATGCCTGGAC CACGTATCGT TGCTGTTGGG CGAAACCCAT
TGGAGCGTGC GTACTCGTCG TATATGTACA ACTATGTCGC ACCTGTTATG AAGATGATGC
GCAAAGGCAG GGTGCCAAAC ATTTCAAGAG GACTGACGGA TAAAGAGTAC GAAGACCATT
TGTTTTCTTT TGAAGAAATG GCGCGAGCAG AGCTTTTCGT CTTGCGGGAC TGCTTAAGTT
CAGACGGAAC CGGCGTCAGA AAGGCAAAGG ATCGGTACAG CTCTCTTAAC TGGGCTGCTG
CAGAATACAG ACGTCGAGAG AAAGGAGGAC TTCCTCCTTT GATTGATCTG GAAAGCTTCT
GTTACGGTGA TTGGGTGGAC AAGGTAGTCC CTCGCAAGCA ATGGAAGGAC TTGATAGAAA
ACAACCCGGA CAAAGTGATT TACCGTAAAA ATGTGCACCT CACCCAGTCA TTTATTGGTC
GAAGTCTGTA TGTTCTCCCA TTAGAGTGGT GGTATGCTCT GTACTCCGAG AAGGAAATAA
TATTTATTTG TACCGAAGCG ATGAGTGACT TTTCGGGTAC GCCGATGAAT AAGCTCGCGG
AATTTTTGGG GTTACCTCCG CACAACTTTT CAACCATTGT AAGCAAAGGA GCATACAATG
TTGGAGGACA CCGAGGATAT GATAAGGAAA TTTCCTGGGA TGAGATCAAA GATGAGTTGA
ATGTCACTGA GAGGCCAAAG TACGAGGCCA CTCTCTCGGA AGACTTTCTT CGCGAGGTCA
AGGCTTTTAT CGAGCCATAC AATGAGCGTC TTTTTAAATT GATTGGCCAT CGCTGCGAAT
GGTAGCATTT CATGTTGCCT AAAAAGTAGT ATTCTCCCGT TGTACACATA GTGCAGATGT
AACGGAAACT TTGACGGTTT ACGCTGGTAC AGTCTAGTTA GCTTCCTTCG CAGCCTTCCT
TGCTTTCCTC CGCTCATCCG ATAACGCCTT CTTTTCTTCT TTTGTTAATT TTTTCTCAAA
AAGTCCCTGT ACTTCTACTT TCGGCAGCTC CTCTTCGTCG GCAATCTTAG GACGGGGTTT
CGTGGGCGGG GGCGATGAGC CAATAGCGAA CGAATCGTCA ATCTGTACGG CCACTCCAAC
AGCTCCACCC GACCATCCCA ACATCCTAGA ATCACAAAAC ATTCCTTCAG ACGTTACTCC
TCCGACGGCG GTCTTTTTTA CTACCATCTC TTCGCCCTCA TCGGTAAGTA CTTTCGACCC
GACTGGTGCA ATGGCGACCC TGAAGCAATT CAAGAGAGGT CAAATGAGTT TCTGCGTTTC
TTTGGAAAAC GGTCCATGTG TCACCGATCC AAAGTTGAGC AACAATACCT GTTCCCTTCC
CTGACATTTG CCGCCGATGT CACCACTGTA ATAGGGTTGC TTTCGTCGCC GAGATTTACT
TGGCACGCTT TAAAGGATTT TCCCGATTTT CCGCCGCAGT CATTTATTTT AAGCACTAAG
CCAACTTTAT ATTCTGTATG ATATACCATG ACAACAAGAT ATTTTATATT GGTGAAGGGT
AAACGTGAAT GCAGCTCGAT AGACCAACGC AAGTCTGAGA ACAAGAATTG CTCCAACAAT
CTCGGTACAA AAATGATTTT TCGTGTCGAG ACCTATTTTC CGTCTGGCTG GCGGTCGACA
CGTCGAAAAC ACACAGGTTT GGAGAATTTG ATTGACTGTT GTGATGCATG A
 
Protein sequence
MPNCAWWTAV LASIIVATIY ATLYATESRQ SLRFGLHFDN GDEENWDPRN SPVVSIVPQF 
LLNHSVYTTS DFRNKQGIAP PFWGCKNESC NASGVWGPCF WPHKKIRWVG EVERIGRTNR
PQYQNGPIDP TLTDDLSGLC RPGFLIIGAG KCGTSSLYHY LTDHPRVLPA KEKQIHYFKY
YMQYPMQWYL RHFPTASSFL AAGALMSGEA SPGYLPYPDV AYRIKKHMPG PRIVAVGRNP
LERAYSSYMY NYVAPVMKMM RKGRVPNISR GLTDKEYEDH LFSFEEMARA ELFVLRDCLS
SDGTGVRKAK DRYSSLNWAA AEYRRREKGG LPPLIDLESF CYGDWVDKVV PRKQWKDLIE
NNPDKVIYRK NVHLTQSFIG RSLYVLPLEW WYALYSEKEI IFICTEAMSD FSGTPMNKLA
EFLGLPPHNF STIVSKGAYN VGGHRGYDKE ISWDEIKDEL NVTERPKYEA TLSEDFLREV
KAFIEPYNER LFKLIGHRCE CSSSSAILGR GFVGGGDEPI ANESSICTAT PTAPPDHPNI
LESQNIPSDV TPPTAVFFTT ISSPSSGKRE CSSIDQRKSE NKNCSNNLGT KMIFRVETYF
PSGWRSTRRK HTGLENLIDC CDA