Gene PHATRDRAFT_42643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42643 
Symbol 
ID7195998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp684151 
End bp687876 
Gene Length3726 bp 
Protein Length1238 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176629 
Protein GI219109751 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.107643 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACGAG CAAAGAAGAA TCCCGCCAGT AGCGTGAAAG TCCGTTTCGA CGGACTCGAC 
GTGACGGCCA TGGTGTCGCA CGTACAACGC CGCTTGCTCG GACGCAAAAT TATCAACGTC
TACGATGGCG ACAACGGCGA AACGTACGTC TTCAAGCTGG ATAGTAGTGG TGGGACTACT
ATCAGCAACA ACAACAACAA CACTAGCAAC TCTAAAGAGT TCTTGTTACT GGAGTCGGGA
ATTCGCTTTC ACCCGCTGGA GCATTTCGAG TCAAACTTGC CCATGCCGAC ACCGTTCTGT
GCCAAGCTGC GCAAGCATTT GCGGGGACTC CGACTGGAGC AAATATCGCA AATTGGGACC
GATCGAGTGA TACTCTTGCA ATTTGGTTCC GGAGCTTCCC GGCACGCTTT GATACTGGAA
CTGTACGCCA AAGGAAACAT TATCTTGACG GAGGGGATTC ATTACACCAT ACTGGCACTT
TTACGATCGC ACGTCTACGA AAAGGATCAG GTCGCCGTCC AAGTTGGACA GGTCTATCCT
GTTACGTATG CCACATCGGT ACAAAAAGAC AACCAAACCG TAGCGAATGC TGTTGCGGCT
ACCGATACTC AACCCGAAAA CGATCCGTCA CCAACTTCTA GAATCATGGA CACTGCATGC
GCTGCCAAAA ACAAAAATGG TATTCTAAAT ATGTCGATTG AGGAGATTCA AGCATCCTTG
GCGCTGCTAC TCGAGCCGGC ACCGGTATCA GCAACGACCA AAAAAGGGAA AAAAGGAAGC
CCGCTCAACT TGAAAACGCT CTTATTGCAA CCCCAATGGG GTGTCTCTCA GTACGGACCC
GCACTACTGG AGCACTGTAT TTTACAGGCA AATCTGCTAC CGCATGCATC GATCAAGGAG
ACCGTGCTGC AGGCGGCTGA TTGGGAACGA CTGCAAACAT CGCTAAGCGA ACAAGGTCCT
GCCATCATGT ATAATCTACA CTCGGCAGCG ATCGACACGC CCGGCTACAT TCTTTATCAA
CCTCGCGTGG AGGAAGATAT CGTTAACGGC AAGCCGCATT CTGAAAATCT GTCGTCGGCA
GTTGCAGTCG TGGCCAAAGA ATTAGCACAC GCTGATAAAG TACTGCTCGA ATTCCAACCC
CACTTGCTTG CCCAACACCA GAATTGTCCC CGGTTGGAGT ACAAACACTT CGGCGCCGCC
GTGGCTGACT TTTTCGCGCA TATGGTTGCC CAGAAACGCC TTCTCAAGGT CCAAGCCTCG
GAAATGGCCG TCCAAGAAAA ACTGCGGAAA GTACAACAAG ATCAAGCCGA TCGCGTGATG
GCTTTGGAAC GCGACCAGCA AACGCTACAA GCTTATGCCC AGGTAGTCAA GAACAACGCG
GAAAACGTTG ACAAGGCCTT GCTAGTGATA AACTCAGCTT TGGATAGTGG TATGGATTGG
GATCAACTGA TTGAACTTGT GAGTGTTGAA CAGGCAAATA GAAATCCGAT TGCTAATTTG
ATTGTCCGCT TGGAATTGGA AAATGAAATC ATGATACTAC GACTGCCTCG AGACCCGTTC
GACGAATTGT CTGACGTGTT GAATGTGAAT GTGTCGTTGA AAGATTCGGC GCATGCCAAC
GCCAGTGCGC TGTTTGCAAA GTATAGGGCA TCCAAGGAGA AGACACAAAA AACTCTTGAA
TCGTCAAGTA AGGCTTTACA GGCGGCCGAA GAAAGCGCCC AACGGCAATT GATCGAAGCC
CAACGACGCA CGAAACAAAC TGTCGCTGCC GTCAAGCGCA AGCCAGCTTG GTACGAAAAG
TTTCACTGGT TTGTCACTAG TGACAACTAT CTGGTGCTAG GCGGTAAGGA CGCCCACCAG
AATGAGTTGT TGGTCAAACG ATACTTGCGG GCCGGGGACG CTTACTTGCA TGCTGAAGTG
CACGGAGCTG CCTCGTGTAT TCTTCGTGCT AAACGTCGAC GACTCCCGAA CGGAGCCACC
CAGAGTATAC CCTTGTCTGA CCAGGCCCTG CGGGAAGCGG GCAACTTTAC AATTTGCCGG
TCTTCAGCAT GGGCGAGCCG CATGGTCACG TCCGCTTGGT GGGTGGAATC GCACCAAGTA
TCCAAAACTG CACCGAGTGG AGAATTCTTA ACCGTAGGGT CATTTATGGT ACGAGGTAAA
AAGAATTTCT TGCCTCCGAG TCCACTAGAA ATGGGCTTGG CCGTGCTGTT TCGGTTAGGC
GACGACGATA GTATTGCCAG GCACAAAACC GAACGTAGAG ACTTTGCCCT GATTGAGTTA
GAGAATTCTA GCGTGGATGT GCTCGACGCC GTATCGTCGT TTCAGATGGA GCCGAAGACA
AATATTGAAG GTCAAGAGGC TACGACACAC AGAGACACAA CAGAGCACGA AGGATCCGAT
TTAGTATCGG ATGAGGTCTG GATGACGCTT CCGAAAGTCA TCGTCTCAAA CAGCACGTCT
AGCGCTGAAA ATCTGATCAA CGATCCTACG CGCGACGACG GTAGTTGTGG AAGCGATGGC
AACGAAGAAG CCAAGAAAGG GTCGACCACA AACGAAGGAA ATGGACGCCG TACAAAAAAG
GGCCTTTCGG TCAAAGAAAG GAAGCAAATG AAGAAATACG GTTCGCTCGG CGAAGCTAGG
AAGTTGCACT CAACAGTTGC AGTTGACAAG TCATCCACAG AGGATACCCA CGGTCAGCAG
CCTGTTTTGC CCTCCTTGGA CGGCCTCATT GACGCGAGCA AACTGAAGAG AGGCAAGCGA
GCGAAAGCGA AACGTGCCAT GCTAAAGTAT ATGGATCAGG ACGACGAAGA TCGAGAGTTG
GCGATGCTGG CACTGCAAGG AGGTGAAGGA AAAAATCGAA AAAAGGGCAA GAATAAACGC
AGCCAAGGAC CTGTGTCAGC AGCGCAAAGT CAAGTCGCAG GAGAAACTGC TGCATTATTA
GTAAGAGATA CGTCCGAGAC CATTGAACAG CTACCCGGTC AAGTCGTATC TATTTTGCAA
GAATGCCTAA CTGCAAACAA TGGACTAGGC AAGCATAACG AAGCGATTCG CTGGGATAAA
CTTGACTCCG ACACTGTGGA GCAGCTTGTT GCGCTAGAGT CCTTGGACGC GCAAGTAGCT
GCCGCAACAC GTCTTTTGAA TTTAAAAACG AGTACTCGAG TGGACAACTT CTCAGCGAGT
TTAGGTGGTA TTATTCGTAC TATTCGAAAA TACGGATACA GTTGTCTCGA TGATGAAAAG
ACCGAAGTAC TAGAAAAGCC AAAAAGGAAG ACGAAAGCGC AGAAAGATGT TGAAAGTACA
CAGTGGAAGC AAACAATGGA AGAGGAGGGA GTTGTTGGTA GCGACCTGGA TGAAGATGCG
GTCGATGATA CGATCGAGCT GAGCAAGTTA TCTGGAATGC CTCAAGCGGA AGATCTTGTT
CTCTATGCAG TACCAGTCTG CGCACCTTAC CAAACTCTTT CAAAGTACAC ATATCGTGTC
AAACTCACAC CTGGTAGCAC GAAGAGAGGA AAGGCTGTCA AGCAGTGTGT GGACATGTTT
TTAAAGAACA TGGTTTTGAA AGAGCCTTCC GCCTCGGAGC ATTGTACAGA ACTCATCAAG
AAGCTCGGAG ACAACGATTG GGTACAAGTT ATTTGTGCAG ATGTAAAAAT ATCTGCACCG
GGGGCTAGCA AGACAGCGAA GAAGCACAGA GCAATCACTA AGAAGAAAAA CAAATAGTGA
ATACTT
 
Protein sequence
MQRAKKNPAS SVKVRFDGLD VTAMVSHVQR RLLGRKIINV YDGDNGETYV FKLDSSGGTT 
ISNNNNNTSN SKEFLLLESG IRFHPLEHFE SNLPMPTPFC AKLRKHLRGL RLEQISQIGT
DRVILLQFGS GASRHALILE LYAKGNIILT EGIHYTILAL LRSHVYEKDQ VAVQVGQVYP
VTYATSVQKD NQTVANAVAA TDTQPENDPS PTSRIMDTAC AAKNKNGILN MSIEEIQASL
ALLLEPAPVS ATTKKGKKGS PLNLKTLLLQ PQWGVSQYGP ALLEHCILQA NLLPHASIKE
TVLQAADWER LQTSLSEQGP AIMYNLHSAA IDTPGYILYQ PRVEEDIVNG KPHSENLSSA
VAVVAKELAH ADKVLLEFQP HLLAQHQNCP RLEYKHFGAA VADFFAHMVA QKRLLKVQAS
EMAVQEKLRK VQQDQADRVM ALERDQQTLQ AYAQVVKNNA ENVDKALLVI NSALDSGMDW
DQLIELVSVE QANRNPIANL IVRLELENEI MILRLPRDPF DELSDVLNVN VSLKDSAHAN
ASALFAKYRA SKEKTQKTLE SSSKALQAAE ESAQRQLIEA QRRTKQTVAA VKRKPAWYEK
FHWFVTSDNY LVLGGKDAHQ NELLVKRYLR AGDAYLHAEV HGAASCILRA KRRRLPNGAT
QSIPLSDQAL REAGNFTICR SSAWASRMVT SAWWVESHQV SKTAPSGEFL TVGSFMVRGK
KNFLPPSPLE MGLAVLFRLG DDDSIARHKT ERRDFALIEL ENSSVDVLDA VSSFQMEPKT
NIEGQEATTH RDTTEHEGSD LVSDEVWMTL PKVIVSNSTS SAENLINDPT RDDGSCGSDG
NEEAKKGSTT NEGNGRRTKK GLSVKERKQM KKYGSLGEAR KLHSTVAVDK SSTEDTHGQQ
PVLPSLDGLI DASKLKRGKR AKAKRAMLKY MDQDDEDREL AMLALQGGEG KNRKKGKNKR
SQGPVSAAQS QVAGETAALL VRDTSETIEQ LPGQVVSILQ ECLTANNGLG KHNEAIRWDK
LDSDTVEQLV ALESLDAQVA AATRLLNLKT STRVDNFSAS LGGIIRTIRK YGYSCLDDEK
TEVLEKPKRK TKAQKDVEST QWKQTMEEEG VVGSDLDEDA VDDTIELSKL SGMPQAEDLV
LYAVPVCAPY QTLSKYTYRV KLTPGSTKRG KAVKQCVDMF LKNMVLKEPS ASEHCTELIK
KLGDNDWVQV ICADVKISAP GASKTAKKHR AITKKKNK