Gene PHATRDRAFT_49500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49500 
Symbol 
ID7195840 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011690 
Strand
Start bp424978 
End bp428197 
Gene Length3220 bp 
Protein Length1011 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184250 
Protein GI219128080 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.268799 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AAGGAAGCTA CTGCAGATCC TTTCGCCATG GGAATGACAA TGCTACCGCA CATTCCTGTG 
CATAACAGGA ATACGGAGTT CCTATCGTAG TCACATGGTC GACGGACGAG AGGATGGTTT
ATCTACAGGG TCCGTCGGAG CAAGCGGCAA TCGTTTGGGA TCCTTCCCTG ATCCACCAAC
GTTCCGAGAC CGCTGTATAC TGGGATGATG ACATGCTTCA ACCCTGGGTA AAGCGCCACG
CATCCCAGCC GTCACTGAAA TTTCTGTCGT ATCATGCCTG TCAGGAAATC CATTGTTGTA
TAGTGAACAT TGTTCGTCGG ACGCTTGAGG ATTTACTACA GAAAGTAGGA GACGATTGTG
CTATGGAAAG AAGCCAAAGC GAAATACATC CATTGTCGGT TCCGGTCGCC GTAGCGATTC
CGGAAGGGAT TCTACTGCCA ATAGCGATCG AAGCCGTCTG CTGCTTGAAC GAGCCTTTTT
TTATTGGAAA CCGTTCGTGC TCTGTCGTTT TGGTTCCTTT GGAACCAACC GAAGGCCGCG
AGCGACTCCG GGACATGATT TGGGATTGTC GCCCCGCCTT AATTCTCACA ACGTCCGTCT
GCGACACCGA TCGTCTCAAT AACATAGTGT CGACGGATTG CCGACCCAAT GCATCCGTGG
CGGATGAGGG TACAGCTACA TTGTCGCACC CCGCTCTTTA CCGCGCCAAA TCGATTCAGT
TTCTCAATTT GCAACAACAT ATATTGGACT CGGTCGGTGA CGGCAACGAG AAAGCACACG
CCGCTTCTAC AGATCTTCCG TCCGAAAGAG TCACGGAATC TCTGGATCGT ATTTCCCATA
TTGTTTATAC CAGTGGAAGT ACTGGGGCTC CCAAAGGCTG TGTTTCGTCA ATTCGGGCTT
TGCGTTCCTA CCTTTCTTCA AAAAATACCG TACACAACGT GCTTACCGCG TCAACAGTCT
TATTGGCCAG CACCATTTCA TTCGATCCCT GCTTTTCCGA CATTTTGGCC ACTTTTCAGA
TTGGTGCAAC CCTAGCGATT GCGCCAAGAC GCACGTTACG GGAATCGCTC ACGCACGTAT
TGCACTCGCT CCAAATTTCT CACGTCTTGT GTACACCAAC CTTGTGGAGT ACCCTGGCCT
TGACAGGGAC CCGGCCAGCT GATCTTCCCA GCCTCCGCAT GATTGCCCTG GGTGGCGAAC
CTATTCCACT AGCCATTGTT CAGGCCTGGG CTCGTGCTTT GCCGGATGAT CCCGTACACT
GTCGTTTACT GGCCACGTAT GGAGTGACGG AAGCTTGTGT ATACCAATCA GCCGGAGAAG
TATTCCGGTT GGACTGTGGA CAGTCAAAGG GGCAAGATGT TGGTTTATTG CTCCCGGGAA
TGCGCGTTTC TATTTGTGAC GAATCGATTC AGGAAAGCTT GACCGAGGTT TTGCCGGCAG
ACATTGCCGT TGGCGAGGTA GTTTTGTCCG GTAGCCAGCT TGACAGCGTT TGCTCGTATT
TGAACCGTCC CGCACTCTCG ATCTCAAAAT ATCTGAAGTC AGAACGTCAT TGGCATTACC
GTACAGGTGA TCGTGGATAC ATTGACAGCA AAACCTTACG GTTACACATC ACAGGGAGAA
TTAATGGTGA AGACGGCATG GTAAAGATCA ATGGTATTCG AATCGAATTG GGTGAAATAG
AAAACGCGTT GGTCGGCTCA ACTGCAGCCC TCGCTACAGT TTTGGACGCC ATGGTAGTTC
CTCATGTACA CTGCATAACA GCCACGGATC TCGTTGCCTA CGTTGTCTTG GGAGGAGACT
GTCGACAGGA AATGGGTGTA AAGGGCACGA TATCTTCGGA TGGAGTACTA CTCCCTCCAT
GCCCATTGAT GGTTTTGTTA CGACATCGTT GCAAACTGAA AGCAAGAATG ATTCCGGCAT
TTTTTATTAT AATTCCAAGA ACACCGCTGT CGCCGACCGG AAAACGACAT CGTGCTGGAT
TGCCGCCTCT TGAGGCTGCT GTGCCGTTCT TTTCCATACT GAGACAGGGA GTAGATGCTA
TCTCCCAGTA CCTCTCGGTG CGTACGGCAA GTCCGGTTCC ATGGTTGCGA GTCACATCGT
AGACTGTTTG AATCTACAAT ACAACCAGCA AGCCTTGCTC ACGACGGACG CATCGTTTGC
CATGCTCGGA GGAGACTCGC TAGCTGCCAC TCGCGTTGTT CGTGCGCTGT ATGCCGCACA
TCACTGCGTC CACAACAGTC GCCATCTGGG AGGAGAGTAT GGCGTAATGG AGGGACCTTT
TGATGCAGTT TACTTGATTG GCGCGGACAA TCTGGGAAGC TACGTAGACT GGCTAGATCA
AAACAAGGTG TGTCAATCTC CGAACGTGGT CACAAAGCCG AGCTGTGACG ATCCCGTCCG
GGATGCGATG CCTACTTCAT CCAACATTTC CCCACCAACA GTCCTAGAAC AGGAAGAATC
GCAACTCTAC GACGCCTTGT TTCAATCCGT TACTCAAGGA CAAGTAGCCA TTGCAATGGC
GTTGCTTTCG GTTGGCGCCG ACCCAAATCA AGGAGGACAC GATGGACGGC TGGGTAAAAT
TTCGAGTCGC AACGATCAGA AAATAATCTT TCGCTCGAGT CCGTTACATC TTGCTTGTGT
TAAGGGTATA CCGCTTTTGG TGGAAGCGTT GCTCGCTGAA GGCGCTAGAA TGAATTCACC
GGACGCTTCC GGCTTATTTC CGTTACATTT GGCGGCTGCT GGCGAAGCCA GACGCGAGCT
AGAAGGTGAA GGTCAGGCGG CGGATGATTG TCGTCGTCTG GAATGTGTCA AACTATTAAT
TGCCGCGGGG ACACCACTGT CCATGAAAGA CGGCAGCAAG CAAACGGCTA TACATTGCGC
GGCTCGAGGG GGTCATGTTG CCACATTGAG TTATGTGCTG AAAGAATGGC ACCGTCTTTA
CGGGACGGAT CCCGAAAAGT GCCACGGGGT GAACTGGCGA GATCGATGGC TGCGAACTCC
GGTGCACTGG GCAGTCTTGA ACGGCCACGT AGATGCCTTG GTTGTGCTTC TGCAGCACGG
ATGCGATTCC AATCCTCCCC AACCCAAAAT GAACAAACGG TCCAGTGCGG CCATTGAAAG
CCCCCTACAG ACGTGTGAGA GGTTGTACGG ATCGACGCCC TTGGGCGAAC GTATCAGAGA
GCTGCTACTC GCTGGTAAAC GAGGAATTTT GCGGAGGTGA
 
Protein sequence
MVYLQGPSEQ AAIVWDPSLI HQRSETAVYW DDDMLQPWVK RHASQPSLKF LSYHACQEIH 
CCIVNIVRRT LEDLLQKVGD DCAMERSQSE IHPLSVPVAV AIPEGILLPI AIEAVCCLNE
PFFIGNRSCS VVLVPLEPTE GRERLRDMIW DCRPALILTT SVCDTDRLNN IVSTDCRPNA
SVADEGTATL SHPALYRAKS IQFLNLQQHI LDSVGDGNEK AHAASTDLPS ERVTESLDRI
SHIVYTSGST GAPKGCVSSI RALRSYLSSK NTVHNVLTAS TVLLASTISF DPCFSDILAT
FQIGATLAIA PRRTLRESLT HVLHSLQISH VLCTPTLWST LALTGTRPAD LPSLRMIALG
GEPIPLAIVQ AWARALPDDP VHCRLLATYG VTEACVYQSA GEVFRLDCGQ SKGQDVGLLL
PGMRVSICDE SIQESLTEVL PADIAVGEVV LSGSQLDSVC SYLNRPALSI SKYLKSERHW
HYRTGDRGYI DSKTLRLHIT GRINGEDGMV KINGIRIELG EIENALVGST AALATVLDAM
VVPHVHCITA TDLVAYVVLG GDCRQEMGVK GTISSDGVLL PPCPLMVLLR HRCKLKARMI
PAFFIIIPRT PLSPTGKRHR AGLPPLEAAV PFFSILRQGV DAISQYLSQA LLTTDASFAM
LGGDSLAATR VVRALYAAHH CVHNSRHLGG EYGVMEGPFD AVYLIGADNL GSYVDWLDQN
KVCQSPNVVT KPSCDDPVRD AMPTSSNISP PTVLEQEESQ LYDALFQSVT QGQVAIAMAL
LSVGADPNQG GHDGRLGKIS SRNDQKIIFR SSPLHLACVK GIPLLVEALL AEGARMNSPD
ASGLFPLHLA AAGEARRELE GEGQAADDCR RLECVKLLIA AGTPLSMKDG SKQTAIHCAA
RGGHVATLSY VLKEWHRLYG TDPEKCHGVN WRDRWLRTPV HWAVLNGHVD ALVVLLQHGC
DSNPPQPKMN KRSSAAIESP LQTCERLYGS TPLGERIREL LLAGKRGILR R