Gene PHATRDRAFT_48502 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48502 
Symbol 
ID7194695 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011686 
Strand
Start bp36004 
End bp39622 
Gene Length3619 bp 
Protein Length707 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183147 
Protein GI219125772 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GATTTGGAAT TGCGTCTCCA TTCGTTTACA GTTAATATTA CACTCCAGTC TAGTAGCGCC 
CTACAGACGA GCCAACGGGA TGCCAACGGA AAGAAACCTT CTCGGTCGGC GAATCGCAGA
ATCTCGACTC CATGAAGATG GGACAAACCA TGGTGATGGC AAAGTAGTGG TTTACTGTTA
TTCTGAGCAT CCTGGAAGAA GACTTCACCG TAAGCGACGT AGCTAGACAC GTTTAACCAA
TCAAGTAGAT TTAGTGTAAA CAAACGACTC TTGTTCCACC ATGGGCAAAG GAAAAGCCAA
AGCGTCGAAA AGTGCCAACC GGAAAGGCAG CAAGAAACCA GCTTCCACAC CCTCCACGGC
ACGTCACACG CACGCTAAAC CGGGGATTGG TAGTCTCTTT GGTCCCATAC GCGTGCGAAG
CAAACGAAAA GGCGAGACTC TCGAGTTGAA CTTTCCCAGC AAAAAAACGC AACATCTACT
GAAAGCTGTT CATCCAGGAA AGCTGTCGTG GAAGATAGGA AAAAAATCCA AGAATGAAGA
CAGCTACAAG AAGAAGGCAG CTCGACCAAC ATTTGTACTG AATCTTCCCG AAAGTATTGG
GCACCCCCAG TCTGTACCAT GGATGAAACT TAGCAGGACA AGGAACCGCA GCTCGCCAGG
GAGTGAAGGT CTCTCCAAAT GCAGCCGAGG TCCGGGCCAC ATCATCCCAT CGGACGCTCT
TCCCGAAGAA ACCAAGCTAA AGCTCGATAA GGAATTGGAA TCTTTTGCCA ACTACGTACG
ACTCACAGAT GAAGAATGTC AAATTCGGGA CTATCTCGTG GAGCAAGTCG AGCTGATATG
TCGAGACTTA TTCGAGGAAT CGCGACATAC GCTCTTCAAT CGGAACAGCG AAGCAATGCA
AGAAGCGGTT CGGGTTCAGG TGTTTGGTTC GTTTGGAACC AAAGCCGTTT GCAGTTTTCG
AAGTGACGTT GATTTGGCCA TATGGGGCGT GGTACCTGTC CAGAAAAGAC GGCCTGCAAT
CATGTCCAAG AAAAACAGCC AAGTCACTCA TTCGGAAAAG GTGGACTGTG CCAAGCAAGA
GCGGAAACGA AAATGGCAGG AAGCGCTGGC GGCTGTCGAC GAAGCTAATA CCATGTCTCC
GGACCCGATG ACGAGGCAAA ACCACTCACT TTCGAACGAG GATGGACCAC CATTGAGTTT
CGCGCCTGAC AATGAACCGG AGCGAGCGGG AGTGATCGAT CAAGAGGAGT CCCTGTTTGT
CATTGACCGT TTCGGCGACG TCCGCAGTAT CGACAGACTA AACTTACCCA AAACCTCATC
CATCAATTCC TCTAGCTCCG TGAGTTTATC GCAAACGATC GGTGACAACA GCGGAGTGAT
CAAATCACAG GGTCATTTTT TGCAAAGTCA AAAAGCCGAG AGCCATTTCA AAGCCAATTC
CGAAAACCCC ATCGACGACT TAAACGACAA AGCTGCCAAA CCTAAAAGGG CCAAGCCCAA
ACCTGTTGAG TCTGATGATG AATCTTTTAA CGATAGTACT GATAAAATGG AAGCTTTCGT
CCCAAACCGT GCTAAAAGTG AGGATCGGCA CTTTGTGAGC TATTCGTCTG ACGATGATTC
CCACGACGGC TTCGGCGAGG AAGTGAGCGA AGGAGCAGGA AAGAGCGACC GTGACCTTGA
GGTGTCGTTT TTCTCGCAAG ATAGCACATC AAAAAGACTG GGACCGACCG GCTTGGCTCG
GCAGCATGTT ACTACTTGTT TGGGCCTCAT TCGGAGGAAG CTTCAAAAAC GGAAATTCGC
GAGGTCGACT CTTTTCATCA AGACCGCGTA CGTTGTTTTT GGCCTCCCAA AAAGACATAT
GTTACACTTC CATGGACAAC TCTTACCTCA AGTCTCCTGT CTCCGTATTA CAGAAGGGTA
CCCATTGTAA AAATGACGAC CTCTCTTGGT TTTGAAAGCG ATATTGCTAT TGGTGGCCAC
AATGGATCGG ATACCTCGCC TTACGCCGGT AGTCAAACTG AGAAATATCG AAGGTCGGTT
TGGGTCGGAA TATGCGTGGG CATGAAATTT ATATCCTTCT AACTCTACCT TTTGTACTGT
TGCAAAGCTT TGCGCCGGTT GTGCTGGCTT TAAAAGTTGT TTTGCAGCAA ACCAATCTCG
ACGAACCATT CGCGGGAGGA CTGGGAAGCT ACAAGTTGTA TGTGCTAGTA GCATACCACA
TCGAGCAGCA TCTTTTATTA GGTGGCAATG ACCGACCGAG TGAGATCTTC TTAGGATTTC
TGTTTCGCTA CGGTGCAATT TTAGGTTACA ATTCACTAGA TGGTACGATG ACGCACTTGC
AAAAGAACGT GCCGGTTGCA ACTTTTGATG CTTCCATAGC TGATTTAAGC AACGTTTTCC
TCTTGGAGCA CTGCGTTGAT CTGTTCGGTC GGTGCTGGCG CCGTCTTTGG AAGCGGACAC
GCTCGTCGTC GAAAAATATC GGATCCTTTC TCGCCGACAT TGTGGACGTC AAAGCCTTGG
CGAAGGAACG ACAGTCGCAT ATACAGCGAG CAAAGGCAAC TCTTTGCCAT GAACTTGCAA
AGAACAGTAA CTCTTTCCAT AAAGCACCGG TACGGAACTT CGTCGCACAG ACATCCAGCA
CAAGAGCACA TAATTTGTCC AGTTCAGCTA ATGATGTTCG CCATCCCGCG GAGCTCACTA
AAGCTGCGTC TTTGCCACGA GAAGCAACAG CTGCTGAACT ATTGAAAGGT TACAATGTAC
AGATTGATCA AGAGCTTCCG ACACGCCGCG AGTGAGTTTT ATTTTTGGAG CTTATGACGT
CTATTTTGCC ACTAGCTGCT TTCGAAGAAG CCCCACATAG ATTTGTTGTG AACGCTCCTT
CGGGCTTTGC TGTCTTCCGG CTTTTGAATC CGCGCAAAAT GTTGCCCCAC CCTGTAGTAT
CTCTTCTTTC CGTCAAGGTC AACGAAGAAA TGGTAGCGTC CTTGGTCGAC TTTGCGGCTT
TGCTTTTGAA GCGCATCTTT TCCAATTTTG TCACCTCGTC CCTTCGCCCG AGTTTGGTCT
CCGTACCGCA AAACACGATT CCAAAGTTCA CCCGTCCAGG TCGGCCACAT AATCCCAACG
AGGACGGCGC CACTGCTTAC TAGCAAAGGG GATATGATGA AAAGAGCTCC GATGAGGTTG
GTCCCAAGTA AAGCCACAAA GACAGCTGTT ACCCTGTTGG TATAGTCGGC AACTCTCGAT
TCCGATAAGT CCATCCGAAG CCCCTCCAGG AACTCATGCA GTTCCTTGTT GTCCCTCCCA
GTAACCGAGT TCAAAATTGA TCGAAGAGAT CTACCAATCC AATAAGCCGC GTCATACACA
ATTTGTACAA ACCTGTAGCG GGATTGGCGC CGTCTGCTGC GTAGAGTCTT TTTCAGGCTT
GATTCCTCTA CGAGCCAGAA TGTACGAAGG GCTGCTAGGG CCTTGCGCCC GATCTCATTT
TCCTTTTCCC AACGATCGAA GGCGATCTTT CCTTCTACAA ATCGAGCGTT CCATGCATCA
ACTTTCGCTT GGATGGCGAA GCGCTGGTCT AGTGATTCGT ACTGTTTGTA GTAATCGTAG
GAGAGTTGCC CGGTCTTGT
 
Protein sequence
MGKGKAKASK SANRKGSKKP ASTPSTARHT HAKPGIGSLF GPIRVRSKRK GETLELNFPS 
KKTQHLLKAV HPGKLSWKIG KKSKNEDSYK KKAARPTFVL NLPESIGHPQ SVPWMKLSRT
RNRSSPGSEG LSKCSRGPGH IIPSDALPEE TKLKLDKELE SFANYVRLTD EECQIRDYLV
EQVELICRDL FEESRHTLFN RNSEAMQEAV RVQVFGSFGT KAVCSFRSDV DLAIWGVVPV
QKRRPAIMSK KNSQVTHSEK VDCAKQERKR KWQEALAAVD EANTMSPDPM TRQNHSLSNE
DGPPLSFAPD NEPERAGVID QEESLFVIDR FGDVRSIDRL NLPKTSSINS SSSVSLSQTI
GDNSGVIKSQ GHFLQSQKAE SHFKANSENP IDDLNDKAAK PKRAKPKPVE SDDESFNDST
DKMEAFVPNR AKSEDRHFVS YSSDDDSHDG FGEEVSEGAG KSDRDLEVSF FSQDSTSKRL
GPTGLARQHV TTCLGLIRRK LQKRKFARST LFIKTADIAI GGHNGSDTSP YAGSQTEKYR
SFAPVVLALK VVLQQTNLDE PFAGGLGSYK LYVLVAYHIE QHLLLGGNDR PSEIFLGFLF
RYGAILGYNS LDGTMTHLQK NVPVATFDAS IADLSNVFLL EHCVDLFGRC WRRLWKRTRS
SSKNIGSFLA DIVDVKALAK ERQSHIQRAK ATLCHELAKN NIQHKST