Gene PHATRDRAFT_45072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45072 
Symbol 
ID7199934 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp116957 
End bp120740 
Gene Length3784 bp 
Protein Length947 aa 
Translation table 
GC content56% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179364 
Protein GI219117139 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGTCGC CGTCCAACGA GGATCGCAGC GACCAGGACG CGTTCGTCGG TGAATGGATC 
GACAAGTACT TTGTCGAATT GGACGAAGTC TACCGCGGAC GCATCGTCTC CCAGTCCGCA
CACAGCACCA ATACCCACAC CAACAGCAAC ACCGACACAC ATACCAGTGC ACTCTGGAAG
GTTTACTATC CGGCGGATCC CACCAAGTCG TACCCCGATT CGGACGTGGA AGAGTTGGAT
TGGGACGAAG TACAAACCGG TATTGCCCTC TATCGACAGC GGAACGGCAA GGAGAAGGAG
AACGGCAAGA CCAAGACGTT CGCAACCCGA ACGAGTCCCC GTACGCCGTC GTCCAATCAA
GCAACCTCAC CCCGCACGTT CGTCTCCTCG GCTACGTCAA CCTCCATCTC AACAACTTCC
GCGTCGTCAA CCTCCTCAAC AACAACGACA TCCTCAACAA CAAGCTTACC CAAGGTACGG
AGGACTTCCC GCCGACGCAG CGTTACGCAA TCCTCGTACC GCGACGAAGC CTCCGTTGAA
AGTGCCGACG ACGAAACGTA CCGGACCGCG CGACGCAGCC CACGAACTCG GACGTCCCCA
CCGCGCGCAT TGGACGCCCA GTTGGCCGTC GAACCTCCAT CCAAGCGCTC CCGGACGGCA
GTGTCCCGGA GTACGACACG ACGCTCGTCC CCGACGTCGT CCCGTTCCAC CGCCTTGCCC
TGTGCGTTTG CGGTCCTCAT GGGAGCCCGA CCCGCTCCAC GCAAGGCTCG AACCGCGAAG
GACTCGGTCG CCAATGCCCC CTCACACCAC CCCTCCTCCG CCCCGGAACA CGGCCCGTTA
CTAGTCCCGC CAACACGGAC GCAGACGCAG ACACCAACTA CGTCCCACGC TGTCGAAGGC
AATCCCATGG CCGACGGATC GCGGTTCGTC AAGAAACCCT ACACGGCAGG GGACGACTTG
CCCGTACTGG CCCAACCGCA CGCCATGTTC GACGATCTCG TCGCCAATCT CACGGACCAC
GGCCGCAACA TACACGTTCT CTGGCCCTTG GTGGAAACCT TCCGCTCCCG CACCTTGCGC
GTCGCGACCA TGTGTTCCGG AACGGAAGCA CCCGTACTGG CCCTCGATCT GTTGCAAACC
TCGCTCCGAC ACGCCTTGCG GCGGCACGCC GCCGATGAAT TGCGGACGCG CGGCATTGAC
GTGGCCAACG TCCTGCGCAT CGAACACGTC TTTTCCTGCG AAATTGAACC ATTCAAACAG
GCCTACATTG AACGCAACTT TCGTCCGCCT CTCCTCTTTC GGGATATCCG GGAACTGGGA
CAGCCACAGG CCTACACTGC CTACGGCGCC TTGCGAGACG TCCCCAACCG ACCCGGTTGT
GTCGATGTGC TCGTGGCCGG AACGTCCTGT GTCGATTATT CCAATCTGAA CAACCAAAAG
GTGAGTCCCA ATGGGTTGTT TGTGCCGAGC GGTGCTGTGT AAGACAACGG GTGGGTCCAC
GGAGGGCTGT GTCCATGCGG TGCACTATGT GCACCGTTAT ACAACGTACG GAACCGTAGG
TTTCTGACGA ACGTCCTTTT GCTGTCGATA TCTTTGTAGA AACACATTGA CCAAAAGGGC
GAAAGTGGAC AAACATTCCA CGGAATGATG GACTGGGTTG ACCTGGCGCA ACCTCCCATT
GTCATTATCG AAAACGTCAG TGGAGCTCCC TGGGAAATCA AGGTGAAAAT GTTTGAAGAA
CGCGGGTACG CCGCCACCTT TTTGCGTATC GATACCAAAG AATACTACAT TCCCCAAACA
CGCAAACGCG GTTACTTGTT CGCGATTAAA GCTCAGACCA AGAACAAAGT AGTTGACCGT
CCCGCGCGAT GGACGGCCGC GGTCAAATCG TTGAAACGAC CAGCGTCGGC GGCGTTGGAC
GACTTCATGT TGCCCAACGA TGATCCGCGA GTTTTACGCG GCCGAGCTCG TCTCACGGCC
GAAAGCAGCG CCGGTGAAGG AGAGGGCCGC ACGCGGACCG TGGACTGGAC CAAGTGCGAA
ACCAATCATC AGCAAGCGCG CTCCATGGAG GAGTTGGGCG ATAAGCGACC TTTGACCAAC
TGGTCCGATT CCGGCAATAC GGCCATGCCG AGTTTCGGTT GGAACGAGTG GACTAACGCC
CAAGTACACC GAATCCACGA CTTGATGGAC ATCAATGCAC TACGCTTGGC AAAACTCATG
ATCGACTGCT CGCACAAAAC CATGGTGTGG AATTTGTCGC AAAATGTTCA CCGGGATACG
ATGGGTGTGG TACGTTGTTA CGTCTTCCGC TGCATTCGTA GTCGTATCTC GTTCCCGTTC
TTGTTGCTAA CCGGCCTTTC CTCTTTTTCC TACAGCTCGG TCTATCTCAG TGTCTCACAC
CAACTGGAGT GTTTTACGTG GCAAATCGCG GAGGGCCACT CGTTGGTGAG GAGCTTTTGA
TGCTACAGGG TATCCCCGCC GAAGATCTAC TGCTGACGAA AGAGTCGGAA GCGAATCTAA
AGGTAAGCCG TCGGTGGAAG TGCGTTGCCT TTTGCCGGTC CACGCTCGAT TTCACCGTTA
ACGTCTGTAC CTTTGTGCTT AGGATTTTGC CGGCAACGGA ATGAGTACAA CGGTTGTCGG
TACAGTAATG CTCTGTGCCC TGCTGCACGG TCACGACATT CTAAGAGGTG GCAACGAGCG
CGAGGCCTCG TCGGCTGTTG TACCAAGTCT GGTGCCCCGA TCCATCACAG CTCCCTCGGA
TGCGTCTATA TCGCTAGAAT TTGCACACTA CGCTAAGGAA AAACTCGAGC TTGGACCGAG
GTCTTCGCTC GGGCGCGAGG ACTGGTCGAA ACTACTGAGC GCAGCGTCGT CGTCCTCGAA
AAAGTGCATC ACCGAAGGAA AGTACGAATC TCTGCCGCCG GGAGAGCTAG TGACGTGCCA
GGAATGTGGC TTTACTTCGA GTAAGAGCAA TGCCGTTCCC CCACGAAAAT ATGAAGAGCA
TCATTTTGTG CCGAACGCAG GCAGCAATCC ACGCGTTCAT CCAGCCGATC TGCGTGAAGA
GCTTTTGAAA TTGCTGCCAA TGAGGGCGCA AGCACAGAAT TTTGATCTTG ACACTCTGTC
CCCATTGGAT ACTGTTTCGA AGAAGTTGTG GCAGGAATGG AAAGTTGCGG TCAAGGCTAC
TCTCTACGAG TCGGATGCTA CCGGGGCCAT ATTCCTCTTC AAGGAGATTA CCCGCACCCA
CATTTGGACG GCGCATTTTG GTTCCAGGAA TGGAGGACGG CTGGAGATGC GCATATCAAA
GTCCAAAATC ACTTGGTTAC TTTTTGCGAA GCCACCTTCG GAAAAGGGTG AGCTACGGGA
CATTTTGTCT CGCCCGGTGG CAAGACTGTG CGTTCGTGCA CCAAACGATT CAACGGTTCC
TGTCACGCTT CTAACAGGAT CGTGGGAGCT TTGTCTTCCG GAAACGATAG CTTGTACTAT
TCTCGTTGAA GGTAGTGGTG ATACAATTCC AAGTTGGCGC AACAGACTGG GACTCAAAGG
TGGCTTTGAA ACAGAACGTC AGTTCTCGCG GTTGAAAGTC ACTTTAGAGA GTTCGGTCCT
TCCTGATTTA AAGGCAGTCA TTGACGGCGT GTACGCTGCG CTACCTCAAT GTGGAGGCGC
CTGCGGCTCT TTGCGGAAGA AAGAAAAGAG GCGAGATACT GAGCCAGATC TATTCTTCTT
TTTGGAATCG GGTCGGAAGT CACTACCGGG CGACGACACT TACATCTTTT CCCCAACATG
TCAT
 
Protein sequence
MASPSNEDRS DQDAFVGEWI DKYFVELDEV YRGRIVSQSA HSTNTHTNSN TDTHTSALWK 
VYYPADPTKS YPDSDVEELD WDEVQTGIAL YRQRNGKEKE NGKTKTFATR TSPRTPSSNQ
ATSPRTFVSS ATSTSISTTS ASSTSSTTTT SSTTSLPKVR RTSRRRSVTQ SSYRDEASVE
SADDETYRTA RRSPRTRTSP PRALDAQLAV EPPSKRSRTA VSRSTTRRSS PTSSRSTALP
CAFAVLMGAR PAPRKARTAK DSVANAPSHH PSSAPEHGPL LVPPTRTQTQ TPTTSHAVEG
NPMADGSRFV KKPYTAGDDL PVLAQPHAMF DDLVANLTDH GRNIHVLWPL VETFRSRTLR
VATMCSGTEA PVLALDLLQT SLRHALRRHA ADELRTRGID VANVLRIEHV FSCEIEPFKQ
AYIERNFRPP LLFRDIRELG QPQAYTAYGA LRDVPNRPGC VDVLVAGTSC VDYSNLNNQK
KHIDQKGESG QTFHGMMDWV DLAQPPIVII ENVSGAPWEI KVKMFEERGY AATFLRIDTK
EYYIPQTRKR GYLFAIKAQT KNKVVDRPAR WTAAVKSLKR PASAALDDFM LPNDDPRVLR
GRARLTAESS AGEGEGRTRT VDWTKCETNH QQARSMEELG DKRPLTNWSD SGNTAMPSFG
WNEWTNAQVH RIHDLMDINA LRLAKLMIDC SHKTMVWNLS QNVHRDTMGV LGLSQCLTPT
GVFYVANRGG PLVGEELLML QGIPAEDLLL TKESEANLKD FAGNGMSTTV VGTVMLCALL
HGHDILRGGN EREASSAVVP SLVPRSITAP SDASISLEFA HYAKEKLELG PRSSLGREDW
SKLLSAASSS SKKCITEGKY ESLPPGELVT CQECGFTSKV VAGMESCGQG YSLRVGCYRG
HIPLQGDYPH PHLDGAFWFQ EWRTAGDAHI KVQNHLVTFC EATFGKG