Gene PHATRDRAFT_48375 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48375 
Symbol 
ID7203640 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011685 
Strand
Start bp268501 
End bp272498 
Gene Length3998 bp 
Protein Length1169 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182932 
Protein GI219125322 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCTCTA CCGAAGAATC GTCCATAGCG CTTTCATCAC GGCACGAAGC CACCGCTCTG 
CAGGATGCGA CCGACAAATC TTCTGGAAGC ATTCACAGTC CGGAAAAGCG ACTACAACTC
TTTTCTCGGG AAAGACGAGT TGTCCGTCGA TGTCGCTTTT TGGTGTACGC GGTCCTATTT
GCCACGGGCC TGATAGCGTC GTTGGGAACT TTCTTTTACA CTCGTAGTGC AGACCGCATT
GAATTCCAGG AAGACGTCGA GGAAATTTCC AGCGTACTCC ATAGAAGTGT CAGGAATAAC
CTCCGCACGA CCATAGAAGC TATTGACGCC TTCGCTTCGG ATGTGTCAAC GTTTGCCCAA
TTTTCAAGTA ACGTGACTAC TTGGCCTTTT GTCACTATAC CACGGTTTGA AGAAAAAGGT
GTGAAGCTCC AAAACGCGGT CAAAACGACC AATTTTTTAT TCCTTCCATT GGTCCCTGAA
GCTGATCGTG AAGCCTGGGA AGCGTATGCG GTAAACCATC ACCAAAAATG GTTACAAGAA
AGCTTGGACT ACCAGAGTGA GGTGGATGGG GCCACTCCGG TCAAGGCCGG GGCAATTTCA
CCTTCAATAT ACAATGAGGC GGGGAGTGAA CTGGGGACAG GACCCTACCT TCCACTATGG
CAGGTACGGA CAAATTTGAC AAAAAGGATT GGATACTGGT TCTCTTGTCC TCTCATGTAC
TTTCGTTCTT TGATTTAGAC TACGCCGCAA GCACAACCAT GGAACTCATA CAACTTTAAC
ATGCTCAATC AAGTGGAATT TGCTGAAGGG TTGCAACAAG TTTTATCTAG TCGAAAAGCG
GTCGTGGCTG GAAGTTTTGC GGCCACCGAT GAAGTTCTCG GTCTCTTTTA CGGAAACAAA
GAAAACTCGC AAGAAGGATT GCTGAGTCTG TTCTTCCAGC CAATCTATGA AAGCCACTCT
GAACCTATGG GTTTGGTTGG AGTACTTGCC ACTCTCATAG ACTGGAATCT TGTTTTTGAA
ACCGATGTTA CTTTCAATGC TGATGGTCTA ATTGCCGTGC TCGAAACCAA CTGTGGAGAA
TCATCAACCT ATATGGTGAA CGAACATTCC TTGGAATTTG TTGCAGGTGA CCTATACAAT
CACAACTACG GCCTCGTTGA TGACGAGTTA GATATCAGCT TGTTCGACTC GCTAGGACAG
CTACCGACTC CGGTCGACAG CGAAAGTTGC CAGTACAGGT TTTCCACGTA CGATTCCAAA
AAGAAGGAGA ACGGGGTTTC TTCTCAAGAT GCTGCTATGC ACAGCACATA CGTGGCCCTC
ATTTTTGTGT TCACGATGCT CTTGTTCATC TTGTATGATT ATGTCGTACA GAAACAGCAA
CAGTTGATAC TCCAACAGGC CGAAGAATCC ACGGCGCTGG TTTCGTCTCT CTTTCCTGAA
GGTGTTTGCA ATCGATTGAT GCGCACCTCT AGAGTAGCAA CAGAAAACGG AGCAAACAAA
ACTGGCTCCG CCATCAATGT ACCTCACGAC CATGACATTT TGGATCTGGA CGACTTTTCA
GACTCATGCA TCAAAGGATA CTCAAAAGAA TCTCCCAAGT GGCGCCTCAA GTCCATGATT
CGAGAGTCGA CAGCTGACTT TTCGGAGGTA ACAATCAACG AAACAAGACC TATTGCCGAC
TTTTTTCCGA ATTGCACTGT AAGTCCCTTT TGAGTCAGGC TTTTTTGCTG TACTCTTCAC
ACATTCGCTG AAATCATTTC ACTTTGTTGT TTCCAGGTTC TATTTGCCGA TATTGTTGGT
TTCACTGCTT GGAGTTCTCA GCATTCCCCT GGTCAAGTCT TTACTTTGTT ACAAAATGTA
TACGCAGCGT TCGACAAGAT CGCGCGGAAG CTTGATGTCT TTAAAGTTGA AACAATCGGA
GACAGCTACG TTGCAGTAAC TGGACTACCA GACCACCAGG AAGACCATGC GCTTATTATG
ACTGGATTTG CTTTAGAGTG CCGGAGCCGG ATGCGGGAAG TTACTGGAAA ATTGGAGCGG
GCACTTGGAC CAGAGACGGG TGATCTTCGG ATGTAAGGTA CAAGGTTGAT CTGCAAAGAT
GTTTACTTAT AACAAATATC TTATGTTTCT ATCTCCTTCT TTCCACAGGC GATTTGGAAT
GCACTCTGGA CCGGTGACAG CAGGTGTTCT CCGTGGCGAT CGAGCACGCT TCCAGCTTTT
TGGTGATACA GTCAACACCG CTTCCCGTAT GGAAAGCTCC GGAATGGCGG ATCGTCTCCA
GGTGTCCGAG TCTACCGCCG TTCTACTGCG AAGTGCGGGA AAAGTGCATT GGATCCAAGA
GCGAGCTGAG TCTATCCACA TCAAGGGAAA AGGGGATATG AAGACTTTCT GGATCAAACC
AATGGAACGT GATTTGAGTT TGCCAAGCTC AGATAGCCTA GAAATCTCAT CTCAAGCAAA
GTCGCCGCGC TCTTCAATGG AAAATTTTCG AGACGTGGAT GGCAATCACT CTAAGAGAAA
GTTGCTTATG TCCAAAAACT CAATGGTCTG GGGTGATTCA CAAGAGTTTC TGGAGTCCAT
GTCTCCGCCA ATGCTGAGAA GACACTCATC AGACACAGTT TGCCGCTTAA TTGATTGGAA
CGTTGAAATG CTTTCTGGCC TCTTGAAACA ACTTGTTGCG AAGCGAGAAT ATCTTCAGGA
AGCGCAATGA AAATGAAGAT GGTGCCGAAT TGGCACTTTT TACTGCAGCA TTTCCAGGCT
CAACCGCATT GGATGAAGTA GTGGAGATAA TCACTTTGCC CAAATTTGAC GCAAATGCAG
TTGCCAGTCT TCAAGACATT GACCCAGATT TGGTAACCTT CGAAAGCGAT GCAATTGAGA
AGCAATTGCG AGACTATGTT ACTGCAATCG CTTCAATGTA TCGTGACAAC CCTTTTCATA
GCTATGAACA TGCCTGTCAT GTACAATTGT CCATGTGCAA ATTGTTACAA AGAGTGGTTG
CACCAGATGG GATTGACTGT GATGGAGATA CTTCCGCGTT TGCCTTGAAA GCGCATACGT
ACACCTACGG CATCACTTCG GATCCCTTAA CTCAGTTTGC TGTTGTTTTC TGCGCCTTGA
TCCATGACGT GGATCACACC GGCGTATCAA ATGGTCAACT TATCAAAGAG AAGGCGCATG
TCGCTTCTTT GTATAGAAAT CAGAGCATTG CCGAGCAGAA TTCCGTGGAT CTGGCCTGGG
ATCTTCTGAT GGATGAACGA TACATGGATT TACGAAAAGT TTTGTTTAGG ACCCAGTCAG
ACTTCACACA ATTCCGTCAG CTTGTGGTGA ACGTTGTTCT AGCCACGGAT AATTTTGACA
AGGAGCTTAG TACTTTGCGA AAGAACCGAT GGAACAAGGC CTTTACTGAT GTGCAAGTGG
ACGAGTCGGC TAGCATTGCG GCAAACCGCA AAGCCACAAT TGTTATTGAG CATCTTATCC
AGGCCTCGGA TGTATCTCAT ACTATGCAGC ATTGGGATGT CTACTGCAAG TGGAATGAGC
GACTGTTCTA TGAAATGTAT GCTGCCTTCA AGTCCGGTCG GACGGACACA AATCCAGCAC
CAGGATGGTA CAAAGGCGAA CTCTGGTTTT TTGATAATTA TGTCATTCCT TTGGCCAAAA
AGCTGAAAGA TTGTGGAGTC TTTGGAGTTT GTAGCGATGA ATGCTTAAAC TACGCTCTCG
AAAATAGAAA CATGTGGGAA GCGCATGGAG AAGCAGTTGT GGCCATGATG CTAACTACTG
ACTTTTATAC GAAGTTGAAG CAAGAACGAA TGGCGCGGAT GCCTCAACGC AGGGCAAAAC
TAGCTGTCGA TTTGTAACTT TACCCGACAA GCGAGTTCAA TCGGTAACAC CGTTAAGCAC
AGCGTCACTT GCTTTACAGC ATAGCTTCTT AAATCTGCTA TAAAATGTAG TACTCTTCAC
ATTTACAAAC TTCGTTAGCC GACAAGAGCT CTAGTAAT
 
Protein sequence
MASTEESSIA LSSRHEATAL QDATDKSSGS IHSPEKRLQL FSRERRVVRR CRFLVYAVLF 
ATGLIASLGT FFYTRSADRI EFQEDVEEIS SVLHRSVRNN LRTTIEAIDA FASDVSTFAQ
FSSNVTTWPF VTIPRFEEKG VKLQNAVKTT NFLFLPLVPE ADREAWEAYA VNHHQKWLQE
SLDYQSEVDG ATPVKAGAIS PSIYNEAGSE LGTGPYLPLW QTTPQAQPWN SYNFNMLNQV
EFAEGLQQVL SSRKAVVAGS FAATDEVLGL FYGNKENSQE GLLSLFFQPI YESHSEPMGL
VGVLATLIDW NLVFETDVTF NADGLIAVLE TNCGESSTYM VNEHSLEFVA GDLYNHNYGL
VDDELDISLF DSLGQLPTPV DSESCQYRFS TYDSKKKENG VSSQDAAMHS TYVALIFVFT
MLLFILYDYV VQKQQQLILQ QAEESTALVS SLFPEGVCNR LMRTSRVATE NGANKTGSAI
NVPHDHDILD LDDFSDSCIK GYSKESPKWR LKSMIRESTA DFSEVTINET RPIADFFPNC
TVLFADIVGF TAWSSQHSPG QVFTLLQNVY AAFDKIARKL DVFKVETIGD SYVAVTGLPD
HQEDHALIMT GFALECRSRM REVTGKLERA LGPETGDLRM RFGMHSGPVT AGVLRGDRAR
FQLFGDTVNT ASRMESSGMA DRLQVSESTA VLLRSAGKVH WIQERAESIH IKGKGDMKTF
WIKPMERDLS LPSSDSLEIS SQAKSPRSSM ENFRDVDGNH SKRKLLMSKN SMVWGDSQEF
LESMKRNENE DGAELALFTA AFPGSTALDE VVEIITLPKF DANAVASLQD IDPDLVTFES
DAIEKQLRDY VTAIASMYRD NPFHSYEHAC HVQLSMCKLL QRVVAPDGID CDGDTSAFAL
KAHTYTYGIT SDPLTQFAVV FCALIHDVDH TGVSNGQLIK EKAHVASLYR NQSIAEQNSV
DLAWDLLMDE RYMDLRKVLF RTQSDFTQFR QLVVNVVLAT DNFDKELSTL RKNRWNKAFT
DVQVDESASI AANRKATIVI EHLIQASDVS HTMQHWDVYC KWNERLFYEM YAAFKSGRTD
TNPAPGWYKG ELWFFDNYVI PLAKKLKDCG VFGVCSDECL NYALENRNMW EAHGEAVVAM
MLTTDFYTKL KQERMARMPQ RRAKLAVDL