Gene PHATRDRAFT_50153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50153 
Symbol 
ID7198940 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011695 
Strand
Start bp205794 
End bp208721 
Gene Length2928 bp 
Protein Length634 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184989 
Protein GI219129635 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTACGAC CCCTGTGGGT GCTAGTTCTT GCGTTGGTCG GAGTCGTCTT GGAATCGGTT 
GCAGCCCACA GTCAGGAGAA TTTTGGTGGG TTCTGCGGCG GCGGAGATGG AACGCCGACT
TGTGCGATTG GAATTTGCAG CTCCCACAGC TTATGTGGCC CCTGTGAGGT TGGAACTTGC
GAAACTTCCC TGGCGTGCTC CCTTGATGGA ACAAACACAA TTTGTGCAGG CTCGGACTTG
GAACGACGAT TGAAACAGAA GTTGCCGCTG CGAGAGTCCG AGGCACCAAC ATCAACATCA
CCGACTCTCA CGGAAAACGA CTATTATTGT ACCGGAAAAT GTACCTGTGC TGCAAACTTT
TGCAACTCGG ATAACGACTG TGGCGGGGGA AGAGAGTACT GCCAAGGAGG TAATGTCTCC
TCGATGTTTG GCTATAACAT GACAACAGCT TGTTCCGGTA CCTGTGTTCC GGCCCTTCAA
CTCGACGCGT GTGGCGCTGG TCTGTGCACA ACAGACGTTG ATTGCGGCAG CGAGAACAAA
ACGTGTCTAA ATCTAGTTGG TGAAGAATGT GGGGGCTTTT GCAACGGGCT AGCATCCACT
GTCAGCCACG ACAGACACGA AATCTCTGTT CCTTCCGCGT GTGCCGACCG TTCGTGCAGT
TCCGATTCTG AATGCAAAAC AGGAAACGAA TTTTGTCTCG ACTTTAATAG CAAGGTTCCT
TGCAGTGGAA CGTGTGCAAC CACCACCACT TTAGATGCGC GGAGCTCCTG TATGATGAAC
ACATGCTCAG TCAACCGTGA CTGCATCGAG CCTTTGGAAT TCTGCTCACA GGCCCAGGAA
GGAACACGTT GCAGCGGCTT GTGCATTCCT ATGGAAATTG AAGAAGAGGA GGAGAGTAGT
ACCAGAACCA GTACAATTAC AAATCCTTTC ATGGGCTCCA ACGACTTCTT GATGACCACC
AACGAAGGTG TCTTTCAGGT TACTGGCTGC GCAATTGGGG CGTGTAGCTC AGACAGTGAT
TGCAGCGCTG ATTCAGAGTT TTGCTTTGTT TCTGCAGATT CACCGTCCTG CTCAGGAGTC
TGCTTACCAT TAGAAGAGAT TGAAGTTAAC GGTTCCTGCA CGCTTTCGTC ATGTACTTTT
GATGAAGACT GCAACATTGG ATTGGAAACC TGCGTGGGCG GGGGTAGCGT CTTTACTCCT
TGTTCAGGTT CCTGCCTCTT GCTTGATATT GAAGAAGAGG GAGACGATAG CCTTGTCGTC
GTTAGCGAAT GCTCTAACTC GACCTGCAGG TCTGACTTGG ACTGCATTAC GGAACTCGAA
GTTTGCGATG GAATGGATAC GCAGAACAAT GACACTTGTT CTGGGGTTTG CATTTCAATC
GATGGGCAGC TTTGTCCGTT GGAGAGCTGC AATTCCGACG ACGATTGCAT CGATCTGACT
GCTGTTTGCC AAAACGCCAC CGACAATTTC ACTTGTTCCG GAACCTGCGA ATCTATTCAG
TGTCCCGTAA CTGACGAGGA GGTACAATCG TGCAGTATGG ATAATGATTG TCTTTCCGGA
TTGGAATCAT GCCAAGGATT CGATCTGCAA TTCGCTTGCA GTGGGACATG CTCAATCTTG
GCTGGCGAAT GTCCGGAAGA GGTTTGTTCG AGCGACGCAG ATTGTTCAGA TGGCGTAGTC
TGTCGAAATA TCGACGAAGA GGACCCTACT TGTTCGGGGA CATGTCGCTC TTTTTCTAGC
TGCGGGGGCC TCACGTGCAG CACTGACGGG GACTGCCGGC CTGGCCTGGA ATCGTGCGAA
GGAGAGAGCG ACTTTTCTTG CTCGGGAAAT TGTACCCCAG TCGAGTGTCC AGTTTTTCTG
TCGTGCGGCA GAGACGCTGA CTGCCAAGAA GGCCTGGAAC AATGTCAAGA CTTTGATGGC
GTGACGTCTT GTTCTGGAGT TTGTTTAATC CCCAGCTGTC GAGAGGTTGG CTCCGAAAAC
AGCTGCTCAC AGAATTCAGA CTGCAACGGA TTAGAGTTCG AATGCATCGG TTTCGACGGG
GAAAGAGCTT GTACTGGCCA ATGCGTCCGT ATGCCTTGCC CCGGGGAGAC CTGCTCGACC
ACGAGTGAGT GCATTGACGG ACAAAAATGT TCTGGTGCAG ATGATAATGA ACCGTGCAGC
GGTAGCTGCG AAACCTTTTC ACTACGATTC CGCAATTATC GTGGCGAGCA GTTGCCAAAC
TTTGGCAAAG GATTTGATTT CGGGACATTA TTCTTTACGA AACGAACGGT ATTTGTGAAT
AAAAGGCGTT CAAACAATGA TTGAGATCCT AAACGCATGT CAATACCTCC CAAAGAATCC
TGATTGTGCC GCGGTCGTAG CTACATTAGA GAATGACCAA GACCGTTCCG CGTGGAATTC
GCTGGAGAGT GTTGACGCCG CAATTGAAGC TTGACGACAA ACAGCAAACA ATTTAAAAAC
AAAGAGCATC GCAAAAGTAT TTTGATTAGC GAAGAGGTTG TGATCCTGAC TGTGAGAATG
TTCCGTAGAC TCTTGGCGAA GAAAACCAGG TACTCTTATT TCCTCACAGA GTTTCGATTT
ACATATAAAG TACGTTGCAT GACCTGCAAA ATTTTGTACA CTTTTAATAA CAATTCCCTT
ACTGTTAGTT TGTTGTCCCA GAAGGGAAGA CCATGGCGGG AGAACAGAAA CATGAAGCTG
AATTCGCCGT AAAACACCAT ATCTTGCATC CCAATCCCTT TTGTGCATGC TCGAAAAAGA
TATCCACTAC TGGTATGGGT GAAAATTGAT GCACAGTGTT GTTGCTTGCC TATCTCCAGA
CGTTTACAGT AATACAGTGC AGAATTCAAA TCAAAAAGCT GGAATACAAG GTAATTATGG
TCGAACCGCT TTTTTTGGGA AAGGCAACTC TCTTTGACGT AACGAGAG
 
Protein sequence
MVRPLWVLVL ALVGVVLESV AAHSQENFGG FCGGGDGTPT CAIGICSSHS LCGPCEVGTC 
ETSLACSLDG TNTICAGSDL ERRLKQKLPL RESEAPTSTS PTLTENDYYC TGKCTCAANF
CNSDNDCGGG REYCQGGNVS SMFGYNMTTA CSGTCVPALQ LDACGAGLCT TDVDCGSENK
TCLNLVGEEC GGFCNGLAST VSHDRHEISV PSACADRSCS SDSECKTGNE FCLDFNSKVP
CSGTCATTTT LDARSSCMMN TCSVNRDCIE PLEFCSQAQE GTRCSGLCIP MEIEEEEESS
TRTSTITNPF MGSNDFLMTT NEGVFQVTGC AIGACSSDSD CSADSEFCFV SADSPSCSGV
CLPLEEIEVN GSCTLSSCTF DEDCNIGLET CVGGGSVFTP CSGSCLLLDI EEEGDDSLVV
VSECSNSTCR SDLDCITELE VCDGMDTQNN DTCSGVCISI DGQLCPLESC NSDDDCIDLT
AVCQNATDNF TCSGTCESIQ CPVTDEEVQS CSMDNDCLSG LESCQGFDLQ FACSGTCSIL
AGECPEESSN ASVSTGKELV LANASVCLAP GRPARPRVSA LTDKNVLVQM IMNRAAVAAK
PFHYDSAIIV ASSCQTLAKD LISGHYSLRN ERYL