Gene PHATRDRAFT_50372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50372 
Symbol 
ID7199148 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011697 
Strand
Start bp124660 
End bp128164 
Gene Length3505 bp 
Protein Length1020 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185284 
Protein GI219130254 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CATCTCTCAC CGACGGTGAA AAAAGTTTGG GTCTTGTGGT ATCGTCTTTG CCTGCGGACC 
TGTACGGGAC CCATCACAGT CACTACTTCT GTCTTGGCGA CGCTACCTTG CTATCAAAAT
GACTTCTGGG CAGACGAGTA TGCCGGGAAA GGAAGAGGAC GAAGAAGAAT GGATCTTTAC
TTTGCGGATT CCTTCGTCCT CCGCCAAGGA TGCTGCTGGC GGGGTCTACG AACTTAGTCA
ATCCCAGCCG CCACCGACCT TATCTACTCA CGCTAGCAGT AGTGGTAGCT TCATGAACAC
CAACAATAGC AATGATTCTG TTTCTGTAGA CGAAACGGCT GAGCAAGAAG CCCTCTTGGT
TGCGGGGGCT GCAACACAGC AGATAATTCG ACCGCCTTCT CAGCACTCAT CCGCGAACCG
TAACACCGAC TGCGCTACCG ATATCCCGTC TGGTTGGACT TGGCAAAGCC CTTATTGGAG
TTTGCAGTCT GATGTTGTCC TGCAAGCGAC TTACCGTCCG TATTGGCAAG CCCTGCCCGA
GTTCACCAAT TGTCGGGCCT GCCGAGCATC AGCCTACTGT CGGACTCGTT GTTGTCGGAC
GTGTACTGGA ACGCGCGCTT TGGAGTTTTA TACCACTGTT TGGACGACTC AGGATGTCGT
GAATGTGGAG GTGGTAGAAG AATGTATTGA AGTCGCAGGG GTGGGCCCGT GTCGGGCTTT
GCGAGTGCAT TTGACACCGT TGGATACAGC CTGTTTGCCG GTCATTCCAG ATGCATTGCA
GAAAGCCTGG GAGGGCCACG TCCATAATAT GCGCCAGGCC CACGATTCCA ACCTTAGCAA
CAAAAATTCC TCCTCTCACA CCGCAAGCAA GTCAGATACG GTCGCCGGTC CTACTATAAT
CTTTTGTTTG CCTACCAAAC AAGTTGATCC CCCTTTATTA TGGAAGGTAG AATTTGGTGG
TGCTAGCAAT ATGTCACAAA CGCCTATTGC TCTGGAAAAT CTGACTGTTT TGGACGGGCA
ACTGCAGCAA TTTAGGAACA CCGTTCCCTC ACCCACACAT ATTTACATTC ACGGATATCA
GTCTTGGTCT TTTGCCGGTT CTATTGTGAA AGGGCAAGAT CAACCGCAGT CGGCCATGCC
CGACTTTTTA AGTCGGGCCT TCAATTATGG CGGTTCGCCT CCACCCGTCT CCGACGATGT
TCTTACGTAC GTGCCACCCT TGTCGCATAA TCATATTCAT CGCGACAACG ACGGCGATGC
CTACACGGGC CCGCAATCCT GGAAAACACA TTACCAGTCC GACTTTTTTA CTTGCGTCAC
GTCCGATGGA ACGATTCCGT CGTTTTGGAC ATCACGTCGC GAAAAGCAAT TTCCTTTTCA
AGCTTTGGAC GAGACAGGTG GACCCGGACT AGTATTGGGG TGGCTGTCAC AACGCGAACA
GTACGGCGTC ATTATGGCCG ATGTGGATTT AAGGCGGTAT GCCATGCACG TTTCCGGGCA
CGGACAAATA ATTTGGGGCC GTGGCTCTGG TTCGACAACC ACAAATACCA TTGCTCTAGA
GACTGATTGG GCCTATGCAC AACTGATCGC CCCACATTCA TACGACGAAG AACCCATGGT
GCACTACCTC GAAGCAGCAG CGGGTTACAA TCAAGCCCGA CCGCTCCGCA ACGGCTCGTT
ACTCACCGGG TGGTGCTCAT GGTACCATTT TTACGAAAAT ATTACGGCCG GTACGCTAAG
TGAAAACGTA TCCAAACTAG CAGCTCTGAA GAACCGAGTC CCAACGAATG TGGTTGTAGT
TGATGATGGG TATATGACAG CTTGGGGCGA CTGGGATTCA GTCAAACCCG GTGCCTTTCC
GCAAGGCATG GCGGCTGTCG CTCGTGATAT TGTTGCACAA GGTGGTATGC GCGCCGGATT
GTGGTTGGCC CCGTACGCAG CCGACAAGCA CTCTCGTTTG GTAAAAACCC ATCCCGATTG
GATTATTCGC AACGACTCTG GTATTCCGGC AAATTCATCC AATTGTGGAA AGTTTTTTTA
CGGACTGGAC GCCACCAATC CAGCTGTCCG GACCTATGTG TACGAATGCA TCCGACGTGC
GGTACACAGT TGGGGTTTCG ACGTCCTTAA GATTGACTTC CTTTACGCCG CTTGTCTGGA
AGGCAACGGT AAGCACGATT TGTCGCTGAG CCGGGCGCAG ACGATGGATC TCGCCATGCA
AGCCATTCGA GATGCTGCAG GTCCCAATGT GTTTTTGATA GGCTGTGGTT GCCCAGTTGG
ATCTGGTATC GGCTACGTGG ATGGCATGCG GGTTTCAGCT GATACAGGTC CAACTTGGTA
TCCCGCTTTA CCGCTTCCGT GGTGGGATCA TGGGACTCTA CCTTGTTTGC GGTCAATGGT
CCGTAACAGT ATGAGCCGAG CACCGTTGGG ACATCGATGG TGGCACAACG ATCCGGATTG
CTTGCTATTG GGTGAAAGCA CCCGCCTTAC GGACGAAGAA GTAGCGAGTG CGGCCTCGGT
TGTGGCCATG ACGTGCGGTA TGATGCTTCT TTCAGATGAT TTGACCAAGG TCAGCGTGGC
GCGGACCAAT ATTTTGACCA AAATTTTCCC CATGACTGGT GTTACGGCCG TTGTCTTAGA
TTTGCACAGT GCGAGTGACG GCTTGCCAAG TTTATTGCGA CTCTGGTGTA CGGACAAGTA
CGACTTGTTG GATTCGTTTC GTGAGCGTAT GGTGGTGAGC GCGCAAGACC ACAATGCGGA
AGCGACTTAC TTTGCGCGGC AGTCGTCATC GTATCATCCT GATAAGGACC AACAGCATCC
GATCGAACGC CAACGTAGCT GTATTCACGT GACAAAAGGT CTGGGAACGT GGACGGTTGT
TTCGGTCAGT AATTGGTCCG ATCGGACCGC TGTGGTTAAC TTACCTCCTC CTGCTTTGTT
ACCTCCGCCC ATGACTGGAT GGGAGCAAGG CGATGAGGAG CCAGAATCAT TTTTGCAGAC
TCCTGAAGAA GTTGACTGTG AGCAGCACGG TTGCCACGCC TTTGGATTCT GGTCGTCCAA
GTACACTTGG TTGCCCAACC AAAAATACAA CGACAATGGG CAAGGCCCGG AACGAATTCT
TCGTCGAAAG CTAGTTGCTC ACGAGACGGA AATCTATCAT ATTAAAGCCG TAACACCTGA
CGCAGCGCAA TACGTTGGCA GTGATTTGCA CTTTTCCTGC GGACACGAGG TCCTGTACTT
TCGGGCACAG AAGAATCAAG TCAGTGTCAC TCTCAAAACG AAATACCATC GTGTCGGGCA
CATTTTCCTT TTCCTCCCCT GTATTAATAC GAATTCTGTC AAAGTGACTG TCAACGGAGA
AGCTGGGCGA TGGCATGCGG TAGGAAATGT ACCGAACGGG CACGACAACG GACATGCCCA
ACTGATTGGA CGAGTCTTCC GGGTGGCGGT CGTCGTGCAC GCCAATGGCC GTCCACAAGA
TGGACAAGTC AAGGTTGAGT TTTAA
 
Protein sequence
MTSGQTSMPG KEEDEEEWIF TLRIPSSSAK DAAGGVYELS QSQPPPTLST HASSSGSFMN 
TNNSNDSVSV DETAEQEALL VAGAATQQII RPPSQHSSAN RNTDCATDIP SGWTWQSPYW
SLQSDVVLQA TYPCLPVIPD ALQKAWEGHV HNMRQAHDSN LSNKNSSSHT ASKSDTVAGP
TIIFCLPTKQ VDPPLLWKVE FGGASNMSQT PIALENLTVL DGQLQQFRNT VPSPTHIYIH
GYQSWSFAGS IVKGQDQPQS AMPDFLSRAF NYGGSPPPVS DDVLTYVPPL SHNHIHRDND
GDAYTGPQSW KTHYQSDFFT CVTSDGTIPS FWTSRREKQF PFQALDETGG PGLVLGWLSQ
REQYGVIMAD VDLRRYAMHV SGHGQIIWGR GSGSTTTNTI ALETDWAYAQ LIAPHSYDEE
PMVHYLEAAA GYNQARPLRN GSLLTGWCSW YHFYENITAA WGDWDSVKPG AFPQGMAAVA
RDIVAQGGMR AGLWLAPYAA DKHSRLVKTH PDWIIRNDSG IPANSSNCGK FFYGLDATNP
AVRTYVYECI RRAVHSWGFD VLKIDFLYAA CLEGNGKHDL SLSRAQTMDL AMQAIRDAAG
PNVFLIGCGC PVGSGIGYVD GMRVSADTGP TWYPALPLPW WDHGTLPCLR SMVRNSMSRA
PLGHRWWHND PDCLLLGEST RLTDEEVASA ASVVAMTCGM MLLSDDLTKV SVARTNILTK
IFPMTGVTAV VLDLHSASDG LPSLLRLWCT DKYDLLDSFR ERMVVSAQDH NAEATYFARQ
SSSYHPDKDQ QHPIERQRSC IHVTKGLGTW TVVSVSNWSD RTAVVNLPPP ALLPPPMTGW
EQGDEEPESF LQTPEEVDCE QHGCHAFGFW SSKYTWLPNQ KYNDNGQGPE RILRRKLVAH
ETEIYHIKAV TPDAAQYVGS DLHFSCGHEV LYFRAQKNQV SVTLKTKYHR VGHIFLFLPC
INTNSVKVTV NGEAGRWHAV GNVPNGHDNG HAQLIGRVFR VAVVVHANGR PQDGQVKVEF