Gene PHATRDRAFT_47538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47538 
Symbol 
ID7202762 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp49938 
End bp55136 
Gene Length5199 bp 
Protein Length731 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181834 
Protein GI219123027 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTGTA CGTATAGCTT GATGCTATTG ATAGTTTGCA GATAGCTTGG TACGTGAAGA 
TATGGTCCTA CCGGAAGTAG AAACCACACC GCTGCTGCAG ATTCGTTTTC TACCCCAAGC
GTCGGACGAG CTAATTCACC TGGTCCAATC TCGTTTAGCG GAAAACGGCA TTGTCGTCTT
GTCGAGTCAC AAACTCATGG GAAAATCCAC GATTTTGAGA ATAACGGCCA AGATGGAAAC
GTTGGAGATA CAGGCTGAAA AGATTCACCT GATGAAGGAA ACGGTTGGCA GTCGTCGCCG
GGTCGTCGAT TACTTTCGCC GAGAGCACCG CAGTCGGTTT TGCGATCTCA CCAAACCGCC
CCAACGTGAT GCACAAGGTT TGTTTACGGC TGCTGAGTAT GCACTGTTAA TAAGGCATTT
GTTGGATCGC GTTCATGTAT TGAAAACAGG GCAAGTTTCA TCTCCCTTGT CCCAATTGTT
TGACACTAAT TATCGCGTAA AATATTTGGT GGACTTTGAC GACTCAAAAG AGGCTTCGAG
AGGGAGCTTG GCATTTTTAA CGGCTTCGCT CCGCCGCAAA CTTCACGAAC ATGGAATTCA
GAGTGCCTGT CTTATGCATG TTCTGATCAC GTACAATTTG GTCGATGCGG TTGTTCCAGT
GCCGGTTCCT GCCGTAAATC GTGAAATTTT TCGAAAAACG TGGTGGCCCT GGTCTCGTTT
GGATCTACCT ATTGAATTAA TCCAAGACTA CTACGGTTGG GAAATTGGTT TCTATTTTGC
CTGGATGGAA TTTTTGACTC GCTGGTTATT CTTCCCTGGT ATCCTCGGTC TGCTGGTATA
TATCATTCGC TGTTACCGGG GAGATACCAT TGACACGGAT GAATACACTC CTTTTTACGG
TCTCGTCACC TTTCTCTGGG CCGTATTGTT TTTAAGATTT TGGGAACGCC ACGAACACAG
GCTCGCCTAT CAGTGGGGTA CTTTCTCGCT GTCGCAATAC GAGCGTCAGA AGTTCTTTGC
CGTCCGGCCG GAGTTTCGGG GCTACCTACG GAAGTCCCCC GTAACGGGGG AAGTTGAGAC
ATATTATGAG CCGTTACAAA GAAGAATCAA GTATATTGGA AGTGCGCTTG TAACTTCGGT
TATGCTGGCG GTTGCCTTTT CTGTCATGAT ATTGTCTCTC AATTTACAAG GCTACATTCG
CCCGAAGTCA AACCCCACTC GTTGGACCAA AAACAGTCCA CATCCTTTCT TCATTGCTGA
TTTGGCATTC GTGTCCGAAC CGGGCCAAGT CTTCGACGCC TTGTCTCTAC GCGGCTATAT
TCCGGTAGTC GGGCACGTGA TATGTATTTT CTCTTTGAAC TTGCTGTACC GTCGAATAGC
CGAACGATTG ACAAGCTGGG AAAACCACGA AACGGAGAGT AGTCACCGTA ACTCACTCAT
TCTCAAGCGA TTCCTGTTTG AGGCCTTCGA CTGCTACGTC GCACTCTTCT ATTTGGCGTT
CTATGAGCGG GACGTTGAGC GCCTCCGTCT AGAGCTGATT GCTGTTTTCC AGATTGATAC
AATCCGGCGC GTTTTGCTAG AGTGCGTTAT CCCCATACTG ATCCAGCGAT TCAATGCGGC
GCATCACTTG AAACGAAAGC TGAATCCTAT GCAGTCCCTC CTGGTGATTC CCACCCACGA
CATATTAATG GACGAACTCG ATAAGGATAC CTACGATCAG TTTGATGATT ATATGGAGAT
TGTAATTCAG CTCGGATACG TCACCTTGTT TGCGTCAGCC TATCCGTTGG CATCCTTAAT
TAGCATCGCA GCTAATTGGG TGGAGATTCG TTCCGATTGT TTCAAGCTAA CCCAGGGTTG
TCAACGACCA GCTGTCTTTC GATCTTCTGG TTCGGGTATG TGGAAAACCT TGGCATCTTG
TATCATTTGG ACGAGCGCCT TGACGAATTG TCTCATCGCG GGTTTCACAT CTGACCAGTT
GGTGCACTAC CTGCCTTCAT TTTACGTTCA TGTGCAGGAG GGCTATACAG ATATGGGTCA
CGAAAAGGGT TGGTTGCTTG TGTTTTTGAT TTTTGGACTC GAGCGGATCC TGGTTTTAAC
CGGCTTGCTC GTGTATGCGA TTGTGCCAGC CGTACCAGAA GATGTAGTCG ATGAGCTAGA
GCGGCGTCAG TACATTCGAT CGCAGCAGGA AGCGTGGGAG CATTCACCCG AGAACAAAAA
GAACGACTAA GAAATTTGAA GTAATCTTTT TATTCTCTTA AACAACGAGA TCGATTGGCT
GTGAATTGCA CGTTTGATAA TATATTAGAG CGGGCGGAGG TGTCTAAAGT TTACCAGGAA
TCCGTATCGC CGTAGTTGAT GACGATGCAG TGTCTTCTGC AGTCGTTGAT ACTGCTCCTA
CCGCAGGTTG TTGTACTAGC CCGTACAAGA GCACGGAACC AGCTTCTCCC CGTTCAATAG
CAATGGATCC GACAGCCAAC GTGAAATTAC TGGCCATGGC AATTGCGACA CCGAAATCGT
CACCAGGCCT TTCTCCGGTC CAAGATACAG AATGCACTTG CCATAATGAA GAAACATCAG
TGGTGTTTAT TGTGCTGGTG CTGGGTGGGT ATCGGTACAA GCGAACCTCC CCAACTTTGC
TGTTGGCACC GGGAACTCCC ACTGCCAACA AACCTTGACT TAAAGACACA GCAGAACCAG
CTTCATCGTT GGCCGTGGGC GTTTCCTGGA CAATAGGATC ACCTAAAGCG TTCCAACCAT
TGGAGGCTTG GTTATACTCG TACACTACCG TCATGCCAGC ATTACGTTGA CCGTTGACGG
TCTTCCACGG AATGCCCACG GCAACACGGA ACGTCGCTGT ATCTCCGCCG ATGTCTCCGA
CGGTGGTAAG GGCGATGGAA GCGCCGTAGC GATTGTCGGA AGAAGAGGGT ACAAAATCGC
TGTTGACCAT GTCGAGGGGG CCGCCAAGTA CTGTCCAGGT TTGCGTGGAG GTGTCCCATT
GCCAGGCCCG GACATAGCCA GTGCTACCGG TGTTCCGAGG GGCACTAGCA ATCACTACGC
TTCCATCCGG CGAAAGATCA ACAGCAGAGC CCAACCAATC CAGATCTCCT GTCCCGGTCA
AAGGCGCACC GCTTCCCATC ATTTGCCATA CATTGCCGGA CGGGCTGTAT TCGTACACCC
CGACGTGTCC GGCGAACCGA AATTCCGGTG TAGAAAAGTA AGGTGCACCA ATTGCCACAC
GAAGACCGTT TTGCGACAAA GCTACACCCA CACCAGCGTA AGACATGGAA GAGAGTCCTA
TGATGCGGTT TCCCCTTCGC GCGTACTGGC CCGTCGCTTC GTTATAAAGG TATATTTGGA
TACCGCCTGC TTGCTCGCCA CCCGAGTCGT CGTTCGGTTC CGAAACAGCC AAAATGGATC
CATCCGCATT GAGCGCCACG GCTGAACCCA GGGCATCCTG GGCCGCGTCT CCCCGAAGAG
CGCTACCTCT TGGTATCCAC TGGTCCTGAA CTCGTTGGTA TACCTGTACA AATCCAGCAC
GAGCCAACAC GCCGGCCGCA CTGTCACTGT TGGCCACGTC CGGAGCGCCA ACAGCCACAA
CTTTACCGTC ACGCGACAAA GCCAATGCTT GTCCAAACAT ACCGTTCGGT GCCGGACCCG
TCAGTCTACC CAAGGGTGAC AATTGCAAAT TTTCTAAAGC TTGCGTAGGC GTTGCGGTGG
GAGCAGACGT CGTGGAAAGC GCGGAGGGTG CGATGGAGGG CTGTGCCGTA GACGAGACAG
TCGTGGGACG AACAGTCGGC CTGTTGGTTG GGGCGCCAGT ATCCGACGCT GCTGTAGGAG
TATCGTCTGT ACCGGCAGCA ACGTCGATCA CAGAAGGGGC TTGGGTCCCT CGCACGTACC
GAGTTGGCGC TGCGGTGGTG CCAGCCGCCA ACGCAATCGC TGGAGACGCC GTGGGTTCCG
TTGTCTGTAT ATTGGACCAA GAGTCGCCAC TAAGATTCGT TTGCGACGCA TCATCTCCCG
GGTCATTACT GTTTCTGGTG AATCCCGTCG TGACCGAAAC AATCATGGTC GCGATGGCCA
GGAGGAGTAT CAAGAGCAAG GCGTACGCTA ACTTTCCTTG TTTAGAGCGA AGGTTGAAGA
AACCGTAACA GCACGCGTAT GAGTTGGTGC TCTGTCGGTG CGACGTCCCG AAGCCTTCGG
CCGCGCTGTC GTCGCTCCCG CGGGCAGTGG GGCTAACTTC GTGAATTGAG CCCAATGCAC
CGAAATTGTA GTTGGAAGTG GCCTCGTCGC TGTCGTAAAT GGTAGCGTGA TAGGGTGGAT
GTGGGTGCGA TCGCATTTTG CGACTATTGT TGCTAATGAC GGGAGTATGC TCGGAAGCGG
CGTTGGAAAC CGTGGAAGCC CCATCGTCTA GAGTCGGTAA AGTGTCGTTG TCGTCGTCGG
TGTCATCGCC ATTGCAGGGT GCCGATTCGA CTTCGAAAGA CAGTTTGTGC GTATCGGGTA
TGACGGCTTG GGCCGCCGCA TCCCCGTAGG CATCGACATC CACGGGTCCC GTGATGCCAC
CATCCGCCTC AGAGCTGTTG AGGACAGCGT TAATGATAAA CTGCTCGTGT GGACTGGTGC
CAACGTCGGC GGTATGGTCG TACTCGGGAC TGTGTATACC GACATCTCGT TCGTAGGGTA
TCCGCTCGCG ACCTTCCTGC CCGTGACGCT GTTGTTGCTG ACGATAGCGA GCCAGCCACC
AAGCGGAGGC GATTCGTTCG TGTTGTTGCG ATTGCCGCGG TGACGTGGAG AAGTGTGCTG
TTGGTATCGG TCATGCTGCG GATGCGGCTA GTCGGCGAAG CGTTACCCGT GTCCGCGGAC
GACTCCACGC TGGTGTTTCC GTGCGGAGCG TGTCGGTCAC CATCCCTTTC GTCGTCGTCG
TCTCCGCTAC TGCCGCTCTC CACGCGCTGT CGGTAGAGAT CTCCGGGATG ACCTCCGGTG
CGGAAGAAGG GATTGGACGG TGTAAAGGAG GAAGTGGTGG TGTTGGCGGC GGCGGCGGTT
GTTGGAGGCG TCGCGGTAGC CAGAGGAAGA GGTGCAGGAG TCGTATCCGC GGTCGCGGCA
ACTCCGCGAT CTTCCTGTGG ACGGGCACGG CGCGAACGCA GGATCGAGGG ACGGCGAGGT
GGTGCCATGG TGCCAAAATG TTGCAATGGG ATACAGCAA
 
Protein sequence
MDYSLVREDM VLPEVETTPL LQIRFLPQAS DELIHLVQSR LAENGIVVLS SHKLMGKSTI 
LRITAKMETL EIQAEKIHLM KETVGSRRRV VDYFRREHRS RFCDLTKPPQ RDAQGLFTAA
EYALLIRHLL DRVHVLKTGQ VSSPLSQLFD TNYRVKYLVD FDDSKEASRG SLAFLTASLR
RKLHEHGIQS ACLMHVLITY NLVDAVVPVP VPAVNREIFR KTWWPWSRLD LPIELIQDYY
GWEIGFYFAW MEFLTRWLFF PGILGLLVYI IRCYRGDTID TDEYTPFYGL VTFLWAVLFL
RFWERHEHRL AYQWGTFSLS QYERQKFFAV RPEFRGYLRK SPVTGEVETY YEPLQRRIKY
IGSALVTSVM LAVAFSVMIL SLNLQGYIRP KSNPTRWTKN SPHPFFIADL AFVSEPGQVF
DALSLRGYIP VVGHVICIFS LNLLYRRIAE RLTSWENHET ESSHRNSLIL KRFLFEAFDC
YVALFYLAFY ERDVERLRLE LIAVFQIDTI RRVLLECVIP ILIQRFNAAH HLKRKLNPMQ
SLLVIPTHDI LMDELDKDTY DQFDDYMEIV IQLGYVTLFA SAYPLASLIS IAANWVEIRS
DCFKLTQGCQ RPAVFRSSGS GMWKTLASCI IWTSALTNCL IAGFTSDQLV HYLPSFYVHV
QEGYTDMGHE KGWLLVFLIF GLERILVLTG LLVYAIVPAV PEDVVDELER RQYIRSQQEA
WEHSPENKKN D