Gene PHATRDRAFT_48916 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48916 
Symbol 
ID7195344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011688 
Strand
Start bp5989 
End bp9036 
Gene Length3048 bp 
Protein Length944 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183654 
Protein GI219126835 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0642733 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAAGAC CACGAAATTT TTGTTTGTAC GGCATTCCTT TGGTTGCTGC TTCCGTGTTC 
TCCATTTTCT ATTCATTCTA CGCTTTTGAC TATCACAACG ACTTTTGCTT GGAAGCACAG
AAGGCCGTGA CGGTGCCAGC CATATCTACC AGTGAGGTAC TATCGTCACC ACGCCAAATG
GCCAAGTCAG CATCTTTCCA AGGCGAATGC CGCTCGCTAG CCAACGGTGG TCCAGTTAGT
CTCGTTTGGG CATGGAATGA TCCACCCTTG TTTCACGCCA TGAAGACCAC AATAGATGGG
CTTACATCAG GGTTGTCGGG TATTCGAGTA GTTATATACT GTGGCTCAAC CGCATGTGTT
TCGGCAGCAC ACAAGGCTGT CCTTGAACTC CCACCACAGG CGACGTTGAT GTCATCCTGT
ATCAGCATCC AGTACATAAT TGCCCCCCAG TTGGCCGAAG ACTCACCATT TGAAGAATGG
ATTGGAGATC ATGTCTTGGC AAAGCTCTTG TCGGCAATTT ACTTTGAACA GACCCTTCAA
GTTGTGATGC AGCTGACTGT GATCTGGAAA TATGGTGGTA TGGTGCTCTT GCCGGGGTTC
AACGTGGCGT TTCCCTCTAT GCTCGAACTG ACGGCTAAAG ATGGCGCCAC ACTCATGACG
GCTGCTGACA TGGGCCTAGT CAAGCCAGGA GTTGGCGGGG GTCTCTATGC AGTAGCGGCG
CCACCGAAAG ATGCAATGAT CAAGTCTCTT ATGGAGGAGA TGCTAGTAAT GTACAAATGG
CCAAATTACG TTGCATCGGA CTGGCCGGTA CGAAGCCAAT GGGACATCCT GTGTGCACGA
CAGAGTAGCT GTCTTGCTGC TCAGCGACCC CTACAAGATA TAGGAATAAG AACAGGAAAT
GAGTCTACAG TGGAGCATCA TCCTAAGAGA CACTTTGGAA CACTGAGCTA CCAGGCAAGA
AGACATGGGA CAAAGGCATA CCCTAGCATG AATCAAGGCG ACGAGATGCA AGGGTTGGCG
GGTTTGCAAT TTCTTCCGCG CCTCGACGCT TTTGTGGACA GAGACCGGCT TGACGTGGTG
AAATTGATTG ACTCCGCCGA CTTTTCCGCC ACAGGTGAAG TCACCCCTCA ATCCCCGTCT
CAACCTGCTT TGACACAAAC AACACTGTTT CTAAATGCTT GGTGGGGGAT TCCAAATTGG
GTCTGGCCGC CACCTGAAAA GCTTGAGCCT ATCTTTGTCT CAATGCATCT CAACAATAAC
AAAAACAAAG ATGACGTAAA GAGATCAAAA GGATATCTCA ACCAAAATTG GCCTATTGGA
GCACGAGACT CAAAGACGCA CGATTTTTTT CAGTCCATCG GTATCCACTC AGTTTTTACG
GCGTGCATGA CAATGACTCT TTTGCCAACT TGGAGTGAGC AACGTGCTTT GATGGAACAA
AGCGATGAAG TGTTGCTTGT AGACGTGAAT AGAGAAGGTC TTCAGCTGCT ACCAGACCAC
ATCAAGTCAA GAGCCGTGAC CCTGTCAGCA AAGCTTAAAG ACCCAAATGT TATAGATGAC
ATGGTGGCAC GATACGTTGA AGCTCATGCT ATGAAGGTTC GCTTGCAAAA AGCCAAGTTG
GTCATCACCC AGCGGCTGCA CATTGCGTTG CCAGCTGCTT CCACAGGAAC GCCCGTGATT
TTGATCATTG ACAATGATAT GCCTGGGGGT GGAGGTGATC GCTTCAGTGG TCTGCAGCAG
GCTGTACACA CTGTACACTC TACAAATGGG TCCACAGCCT TGGCTTTATT CAATTGGGAT
GATCCACCAC CCAATCCCAA CCCAATATTT TTCCGAAAGA AACGTAACGT CCTTCGCGTC
TTGACTATGT GCCATGGGGA AGTGACCGAC TCGGCACGAA AATTTGGTGC AATTCCAGCT
TCATGGGAGT ACCCGTCTGA AACCAAAGTG TGCAGGAATA CAATTGGCAA CTTGCACACA
GAAGATGCTA TTCATATTGC AACTACAATT AATCCTTTAT GGTTGGACTC CAAGCATGTT
CTCCCCAGCT GGGTTCATGC ACTGTACAAG TCAAACCCTA CGGAAACATT TGTGTTCTAT
TTCCTTACAG ACAGAATGAA TGAAAAGCAG CGATGTATAG TCCGATGGAT GGTGCTTCAA
TGGTTTCCCA ATGCAAAGGT TTACACAATA CCAATACAGC TGCCATCTGT GGACATTTCT
TCCATCCCTA TAAAACATGT TCCTACATTC TCTCAAGTCC GGCTTCTCTT GCCCCAAATG
CTTCCCTGTG TGCAGCGAAC TTTGTGGATT GATGTTGATG CCATGGTGAT AAAGCAACTT
AGACCAATTT GGGACACCTG GAAGGTCATG CCAGAATGTG GTATAGTTGC CAGGAGCTTA
TTGGCAAAGA CAGACGTGGG ATCTATGATG GCAGCCCTGA ATGTAACATC TCCCCAGCAG
CTGTGGAAGA AAGCCAGCAA AGACATGCCA GGATTCGATG CTGGAGTGAT GCTTCTAGAC
CTTGATGCAT TGCGCGCCAG CCACTTCACA GAAAAGGTGG CATCGTACTG GTCCTTTTCA
ATTGGCGGAA ATGATCAAAT TTCCTTGAAT ATGCAATGCA ATGGGACCCA TGGCAACCTT
GACTCAGTTT GGAATGTATT CATGGACTCT CCAGACGACT ATGTGCACAA CCGGACAAGA
GAGTGGAGCA TTGTTCACTT TCAAGGCTTG AACAAGCCTT GGCTTGTAAA GAGTGATTTG
TTTCATGGTA GAGTATGGGC CAAATACGCC CTTTCGCTTG TTGATGCTCT CTATGGACCA
ATCCAATTGC AGTAATTAGC AGTGCTTGTA ACTGTTTGTT GCACAAGTTC TTTCCATGTT
GCTCTTGTGG GAAAGATCTC TTATCCAAGC CGTCTGCAGA AGTTGCGTTG TTTTCCGGAA
GTACTCACAA GCACTTGGCA CATGTTGTAG AGGCAAACTC TGAAAAAATG CACTACATGA
ACAACACACA GTCTTGTCTA ATCTTGGTCT AGTTTTGCCT GCATTTTG
 
Protein sequence
MPRPRNFCLY GIPLVAASVF SIFYSFYAFD YHNDFCLEAQ KAVTVPAIST SEVLSSPRQM 
AKSASFQGEC RSLANGGPVS LVWAWNDPPL FHAMKTTIDG LTSGLSGIRV VIYCGSTACV
SAAHKAVLEL PPQATLMSSC ISIQYIIAPQ LAEDSPFEEW IGDHVLAKLL SAIYFEQTLQ
VVMQLTVIWK YGGMVLLPGF NVAFPSMLEL TAKDGATLMT AADMGLVKPG VGGGLYAVAA
PPKDAMIKSL MEEMLVMYKW PNYVASDWPV RSQWDILCAR QSSCLAAQRP LQDIGIRTGN
ESTVEHHPKR HFGTLSYQAR RHGTKAYPSM NQGDEMQGLA GLQFLPRLDA FVDRDRLDVV
KLIDSADFSA TGEVTPQSPS QPALTQTTLF LNAWWGIPNW VWPPPEKLEP IFVSMHLNNN
KNKDDVKRSK GYLNQNWPIG ARDSKTHDFF QSIGIHSVFT ACMTMTLLPT WSEQRALMEQ
SDEVLLVDVN REGLQLLPDH IKSRAVTLSA KLKDPNVIDD MVARYVEAHA MKVRLQKAKL
VITQRLHIAL PAASTGTPVI LIIDNDMPGG GGDRFSGLQQ AVHTVHSTNG STALALFNWD
DPPPNPNPIF FRKKRNVLRV LTMCHGEVTD SARKFGAIPA SWEYPSETKV CRNTIGNLHT
EDAIHIATTI NPLWLDSKHV LPSWVHALYK SNPTETFVFY FLTDRMNEKQ RCIVRWMVLQ
WFPNAKVYTI PIQLPSVDIS SIPIKHVPTF SQVRLLLPQM LPCVQRTLWI DVDAMVIKQL
RPIWDTWKVM PECGIVARSL LAKTDVGSMM AALNVTSPQQ LWKKASKDMP GFDAGVMLLD
LDALRASHFT EKVASYWSFS IGGNDQISLN MQCNGTHGNL DSVWNVFMDS PDDYVHNRTR
EWSIVHFQGL NKPWLVKSDL FHGRVWAKYA LSLVDALYGP IQLQ