Gene PHATR_43922 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_43922 
Symbol 
ID7204319 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp436146 
End bp439609 
Gene Length3464 bp 
Protein Length973 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186344 
Protein GI219113521 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.150894 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAAAATGGGA TGTTTTTGTC AAAGTCACAA AGTCTTCGAA TTCTTTCCGG GTTTCAAAGC 
TCTTGTCGAG ACACGTTGGT ATTTGTTTTT GGCTGGCGTC CTTCCACATG TTGGCTAGCT
CCGTATTTGT TTACTAATAG TAACAGCCAA GGCGGGCGGG CCCTTACGCG GCTTGGGCGG
CTCTCCGCGG TTGCAAGTAG TTACTGTTAG CTGTCGGCAG TGTAACTGCT ATCATCTTTT
TGCTGGGGAT ATTCCTTGCG TATTGCTCTT GTTTAGTGGT CGGAACCCTC ACCGATTGTC
GACAGATTGT TAATCGACCA TGTCTACCGA ACCGTTCGGC ACCACACCGC TCCCGACGTG
CGTGTCGGGA AGCCCAGTGA AAGTCTGGTT CGATGTTAAT CCTCTGTTGG CGCGGGCGCA
GCAGCCCCAA ACCCAACAAG TGGCGAATTT CGAAGCCTTA TCGAACGAAG CTTGTATCAA
CGGCTGCTTG CTGGATGTCA ATCAGCATTT CATTGTGTAC GGGATCAAGA ACGGCCTTAT
CCGCGTCTTC CAACGCCATA CGGTGCTCCG ATCGTTGTTG CGCGGTCACG AGGGACAGAA
TCTGACCGAC ATGCATTTTT TCCAAAACGG CGACGTCCTC GCCTCGGCGG CGTCCAACGC
CCAATCTTCC ACGGTGCTCG TTTGGAGAGT CTTTGGGCGC TCTCCGGAAA TCATGTCGGA
AAAATTGTTG GAAATTTCGA CGCCGCATTT CACTATACAA CGAGTCGTCT GGCACCCCTT
CAATCCGAAT CAGTTTTGGA TGCTGCATAC GAATGCTGCC AACCACATGG TGGCGACGCT
GGTGGAAACC ACGCGGATCG CAACACAGCC TCATCCCGTC GAAGGACACG CCGTGTGCAA
CTTTCACGAT GCGCACATTA TTATGGACGG TGCGGTACAG ATTAGTGCTG ATTGTGCTTC
GGGATCCGGT GCCTCTTTGA CCGATTTGAC CTGGTCCAAT CGGGACACGC GACATATTTT
GACGTGTCAC GATTCTGGGG AAATTGTCTT GTGGGATTTG AAAACGCTGT CGTCCTCGTC
GGCTACCCCT GGTACCGTGA CTCCGGCTCG ATTAGCAACG TTGCGTATGG ACGAACCAGT
CTCAAGGGGT CTGTTTTTGC CACACGAGGA CGTCCTTGTA TCGGACAATC GTAGCCAGGA
GGCCAAACTG ACCACTTGTT TTGTCACAGC CAGTGATAAG AACGGAACGA TCACTGTATG
GAGCCCATTC GAGAGTTCGG GAGCGCTGCC GCAAAAAATA CAAATCTTGG CGGTGGAGAA
TCCCAGTCCC AGCTACGTTT TGGATGTTTG CTCGGGACCC GCCCCGGTCA ACGCATCCCC
GCCCTCGGCT TTTGTGGTGA TGGCTGATCG CCACAGTGGG GCGATTCTAG CGTGGCATTT
GCGGGCAGAT TGGAACGATA CCGTTCCGAA AAAGGCTTTG CTGAAGGGTT GTGACTACGT
GGTGCCATTT CTAACAAAAT TTCCAACCTA TTCTTGGAGT GTAGTGTGTG CGCCTGCGAC
CAACATTTCC GACGAGGAAC TGTCGGACCA GGGTGGATTG GTCTTTGACG TAGAACTCTT
TGCCTACCAG ACTACCGCGG TGCAGCGTTT GAAATTGACT TCTTACATGT GTCTGCCGCC
GGAAACCTCG TGGACGGATC CAACGCCGAG TGTGCGGTTG GAGCGGCTAG TGTCCGCTCA
GTCGGCGCAC GTTTCCGAAA TCGGCTCGGA CGATGCCAAT CCGGATGTTG AATTCGACGA
AGCTTACGAT TTGGAGGAGG ATGACGAGGA AGAAGAAATT GAGGCGCCGG ACCCCTCGTC
GCTACCCTCG CCGTTGGGTA TAGGCAATTC TACGCCGTCG TTGTCGAACA ATCCTTTCGC
TAACTGGTTG GGTACGATTG CGTCGAAAAC GACTACTTCT GTACCACCAG CTGTAGTCGC
AGCCCCTGCT CACGTACCTC CACCGGCAAG CTCGTTGCCC ACACCGCCTC CCGATCAACC
GAAAAAAATC GTCTTGTCGA AACACGATTT GGAGGATCCA AAAAAGGTGG AACCTCAAAA
CGCGGCTCCA GTGATGACCA CTAAAGGCCC CAACAATATC AGCAACGCTG ACTCTAAGAA
AAAGAAAAAA GTAAAGGCAA CACCTGTCCC CCCGTCAGCT CCGGAAGTGG GGAAGGTTTC
CATTCTCAGA AGGGATGACG AGGTGAAGCC ATCACTATTG CTTGATAGCG GTGCGAATAT
ACCGCCATCT CCAACAAATC CAATCGAGGC CAGTATGGAC ACCAAATCCA TTGCAGAAGA
TATTCGAAAG GTTGTACACC ACGAAATGCG CTCAACTCTC GTTCCCGCCC TCAAGCAAGC
TGTTCAAGAA TCCTTGAACA CTTCCGTAAT CAATCCTATC CAAGCATCAA TCACTCAACT
GTCCAAGCAA GTGGTGATGA ACGACAACAT GGAATCCGCC TTATCGGGAT CAGTTGAAGA
GCCTCTTCAG GCCGCTGTTG CGAACACTAT GCGAACGGTG TTGATTCCAA CAATGGAGTC
AATCACGAAT CAGGTCTTCG TACGGGTATC TGAAAGTCTG GAACGAACGG CAGCGACTAC
ATCAACTGAT TCAAAAAAGG AACTTGAGGC TATCTCTTTA CAGCTCACGA CAATGACAGC
TCTGGTTGCC GAGCTTACAA ACGAAGTGCA AAGTCTACGC AAACTGGTTC GATCTAACCA
AGCGCCTGTA CCACCAGCAC CTACGGCCCC GAGTCTACCT CCGATTAACC CCGTGGAGGC
ACTGCGCAAG GAAATTGCCG CACTCATACA ACAGCGGCAG TACGAAGCCG CGTTCACGAA
GGCTGTGTCA TCAAGTACAG CCGAATTGGC CGTTTTCGCT TGTACAAATT CGAATCTGAC
TTCTGTGTTG GGCAGTGCAC GAGTAGAACT GAGTCAGCGC ATTTTAATTT GTCTGATGCA
GCAGCTCAGT ACTGTCTTAA ATTGGCGTGA TGCAAGCTTA AATGTACCAC TCATCCTTGA
ATGGCTCCAA GAAATTGCCT TGTCATTAGA TCCCAACGAT GATACCATTA AACGGCACAT
TCCAACCGTT TTGCAACAAA TGGTGTCCAG CGTCAACAAT CGAATGTCCT TGGACGAGCC
TGTTCTTAGG CGACCTTTAC AGAAACTACT TCAGATTCTT CGGGGGATGT CTATATCATA
AGAGCGCCAA AGGATTGTAA GACCGCAATG AAAATCGAGA ATATGCATCT CTTTGTGAAT
TTTCTGAGCT CCTTCCGTGG CTTTAGCATT TTTTACGAGA ATACGTACAC TGAAGTCACG
TCATAAGTAT TGGCCTCTTC CTCAACAAGC TATCAAAAAA GCAATGATGG AGTAGCTTTA
GGGGAATTCT CTAGAAAACA GTAGGTCATG CCGTGATTGA TGGT
 
Protein sequence
MSTEPFGTTP LPTCVSGSPV KVWFDVNPLL ARAQQPQTQQ VANFEALSNE ACINGCLLDV 
NQHFIVYGIK NGLIRVFQRH TVLRSLLRGH EGQNLTDMHF FQNGDVLASA ASNAQSSTVL
VWRVFGRSPE IMSEKLLEIS TPHFTIQRVV WHPFNPNQFW MLHTNAANHM VATLVETTRI
ATQPHPVEGH AVCNFHDAHI IMDGAVQISA DCASGSGASL TDLTWSNRDT RHILTCHDSG
EIVLWDLKTL SSSSATPGTV TPARLATLRM DEPVSRGLFL PHEDVLVSDN RSQEAKLTTC
FVTASDKNGT ITVWSPFESS GALPQKIQIL AVENPSPSYV LDVCSGPAPV NASPPSAFVV
MADRHSGAIL AWHLRADWND TVPKKALLKG CDYVVPFLTK FPTYSWSVVC APATNISDEE
LSDQGGLVFD VELFAYQTTA VQRLKLTSYM CLPPETSWTD PTPSVRLERL VSAQSAHVSE
IGSDDANPDV EFDEAYDLEE DDEEEEIEAP DPSSLPSPLG IGNSTPSLSN NPFANWLGTI
ASKTTTSVPP AVVAAPAHVP PPASSLPTPP PDQPKKIVLS KHDLEDPKKV EPQNAAPVMT
TKGPNNISNA DSKKKKKVKA TPVPPSAPEV GKVSILRRDD EVKPSLLLDS GANIPPSPTN
PIEASMDTKS IAEDIRKVVH HEMRSTLVPA LKQAVQESLN TSVINPIQAS ITQLSKQVVM
NDNMESALSG SVEEPLQAAV ANTMRTVLIP TMESITNQVF VRVSESLERT AATTSTDSKK
ELEAISLQLT TMTALVAELT NEVQSLRKLV RSNQAPVPPA PTAPSLPPIN PVEALRKEIA
ALIQQRQYEA AFTKAVSSST AELAVFACTN SNLTSVLGSA RVELSQRILI CLMQQLSTVL
NWRDASLNVP LILEWLQEIA LSLDPNDDTI KRHIPTVLQQ MVSSVNNRMS LDEPVLRRPL
QKLLQILRGM SIS