Gene PHATRDRAFT_49084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49084 
Symbol 
ID7195440 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011688 
Strand
Start bp541825 
End bp544732 
Gene Length2908 bp 
Protein Length818 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183626 
Protein GI219126777 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0231566 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTGTTTCATC GTTTCTCTCT CCTGCCAAGT ACGAAAAGTT TATTCCACAC TGATGCGCGA 
CGAATGTGGC ATCGCTTCTA TACCTATTGT GAAAGCGTGT TGCCATCCGA TTTGAAAGCG
TCTACGGATT TTCGCAAGTG ATCGAATCCG GTCTTTTTGC GAACCCCGTG TGCGGAAAGG
CGACCATGTC GGACCCGACT TCATCATTTC AAAATCCCTT CGGAAACATG CCGAACTCCC
CCATCTTTAC AGCTGACCAA CTGAAAGCCT TACAGCAACA GTTTCAAATG AATCTTAACA
ACAATACTGG CTTACCGCAA AACCAAACCA ACACTGATTC TGAAATGGAA GCTCAGCCAT
CATCTTCTGT TGCTGCGAAG CACAGTACTG CGAAGCACAG TACCGGTCCA TCGTCAACAT
CTAGCAGTAG TGACGCGAAC CATTCAAATC ACCACGGACA AGCGTCGTCA ACACAGCAAC
AGTCACATCA ACAACCGTTC TTTGTTATGA CACAGATACC TATTCAAGCA ACGCCATTTG
CTTACTCAAG TAGCATGTCT TCCGATACGC ACGTGGTGCA GCCGTCTCCT CAGGTGCAGC
CGTCTCCTCA GCTGCAAGCC TTACAGGAAC AGCAAAGGCA ATTTCTGCAA AATTACCAAA
GCAATCCGTC CGCCTCTCCT ATGCCGCAAC CAACTCCCCA ATTTCAAGCT CTGCAACAGC
AAGTACGCCA ACAGCAAGAA CAACAAAAGA AAGAATTTCA ACTCTACCAA TCTCAGCAGG
CTTGGGAACC TCACTTTGAG AACCGCAGCG GCAACAGCGA CAACCACCAA CATCAGCAGC
AACACACGGA GACCAACCAT GCGAAGCAAT CGCAGGCATC ACACAATAAT AGGCAGCAGC
GTCCAAAAGA GATGCAGCAG CAGTACCAAC ACCAAAATTC CGGCATTCAG CCTGAGGCGG
AAACTTCGAG CAATGTGGTG CCCAGCACAA GCTCCAATAA TACCAATTTT ATGCTGCAAC
AGCTCATGAA GAATGCCTAT GCAGCACATC AACAACAGCA GCAGCAGAAT CCGAACCAAA
TGTACCCAAC GCAATCCCAG CAGCAGACAA TACAGCAAAC TTCGGACAAC ACATTAGGTA
CCATCTTGTC GAACAGCCAT ACCAAAAATG TGGGATCAGA AAGTGATTCC TCGAACAATC
GGCGGCTCTT ACTGGGATCC ACCGTTAAAC CAGTGCAGCG GGAGGAAAGC AATATGAACG
TGGATTTCTG GAAACAGTTC TGGGATGACG AAGAAAAAAC CTCGGCGACA TCTTCCGATC
TTAACAACAC CCTGTCCATG CAAAACTCCA AGCGCACGTT CAGCGACGCA ATGGTGAGTT
CTTTGTTATG TCTAATGTTG CATATAATAT TACGCATGCG CTCACAATCT GCTTCTTTTT
TATGTAGGGT GACGGCTCTG GAACATCGAT CGGTTCTGAT GACGGCACGA ATGCCAATCC
TCTTCTCCGG CAGCGCTGCG AACCACAGGA ACAGAAGTCT ACGTTCCAGT CACAAAGCGA
GCAACGACGG TGCCCGCCAT TTCAGCAGCA GAATTCCACG GAATCACGAA ACGTCGAATC
TGACGAAGAC GATGCGATTG CACCCACCCC GTTGAGCGAA ATTCGTGCCA AACATTTTCG
CATGCAGACT TCACAAGCAT CCACTGAACC GCCTTCCCCT CAGGTTCCGC ACGCTCAGTT
GCCTAAATCA AGCAACACTG GGTCATCTTC TTCCTCTATG CAATCATTTC ATCACCAACC
CATTCAGCCA CAGTTACAAC ATCAGTACCA GCAGCCAAGG CAGCAACCTA TTCTTCCGTC
TCCATCGCTG TCGGCGCCAT TTCTGCTTCC GTCAACTATG ATGCCGCCCT CATTATTAGC
TAGCGTTTAC ACGTCCTCGA CGAAAGCAAC TTCGGAAACT GGGCAACCAA AAAAGAAGCC
GGTGTGTGCT GTCTCCTCCA AGTCGGCTAG TGCTGCGCGT ACCGAAACAC CCCAGCAAGT
ATTGGAGCGC ATTTTGACGA GTCGAGGATA CGGAAGCGAT ATTCGAATCA AAGCAGAGCA
ATCCAACTAC GATGCAATGC CATCGCCTTT GCAATTGGCG TCGTTTGGTA CGGAGCTGGT
CAAGGCCATT CACACGTCGG ATGTGGACAA GCTATCCTCC TTGCTAGCTT GTGGTCTGTC
TCCGAATCCT TGCAACCAGT TCCGAGACTC CATTGTGGAT CTGGTATGCA AACGTGGTAG
CGCGGACATA TTTCGTTGCC TGGTCGATTA CGGTTGCGAT TTGCGCGTTT GCGACGGGTT
CGGGCGTACG CCGTTGCACC ATGCTTGCTG GGGCAGCGCT TTTCACCCCG AAATTGCCAA
CAGTATTCTC CGCAACGACG CGCAGCAAAT CCTGATGGAA GACAAGCGTG GTCAAACGCC
ACTGGAGTAC GTCCGGGAAG CGCAAGCCGG CGACTGGATA GACTTTCTGG AGAGTCACAA
GGACGAGTAT TTTCCGGCTG GCGGTGCGTT GCCGGCGGCA CGAGACATTC GGGACTCTCG
ACCGAACGGA TCGTTACCGA ATCCACTCAA CGCCTTGCCT TTGGCTTTGG CGGGGGCGTT
GTCGTCGGGG CAAATCACGC CAGAGAAAGT CAGCGCCATG AGTGCCGAAG AGCGGGGTCG
CTACAATCAG CGTTAATTGT ATAAGGCGGC TCAACACGTG TGCCTCTGGT GGTGCGGAGA
ACCAACCTGT ATGCGGAAGA ATGTGGCATT GTCAAACTTT CTTTTATCAG CAGACACTTC
TTTTTTCTCT TTGCTAGAAC TAGCCGATAG TAGTATACGC ATGGCTAATA TGGATAAAAA
TGTGTGTAAT AGTATAGTTA ACTGTAAG
 
Protein sequence
MSDPTSSFQN PFGNMPNSPI FTADQLKALQ QQFQMNLNNN TGLPQNQTNT DSEMEAQPSS 
SVAAKHSTAK HSTGPSSTSS SSDANHSNHH GQASSTQQQS HQQPFFVMTQ IPIQATPFAY
SSSMSSDTHV VQPSPQVQPS PQLQALQEQQ RQFLQNYQSN PSASPMPQPT PQFQALQQQV
RQQQEQQKKE FQLYQSQQAW EPHFENRSGN SDNHQHQQQH TETNHAKQSQ ASHNNRQQRP
KEMQQQYQHQ NSGIQPEAET SSNVVPSTSS NNTNFMLQQL MKNAYAAHQQ QQQQNPNQMY
PTQSQQQTIQ QTSDNTLGTI LSNSHTKNVG SESDSSNNRR LLLGSTVKPV QREESNMNVD
FWKQFWDDEE KTSATSSDLN NTLSMQNSKR TFSDAMGDGS GTSIGSDDGT NANPLLRQRC
EPQEQKSTFQ SQSEQRRCPP FQQQNSTESR NVESDEDDAI APTPLSEIRA KHFRMQTSQA
STEPPSPQVP HAQLPKSSNT GSSSSSMQSF HHQPIQPQLQ HQYQQPRQQP ILPSPSLSAP
FLLPSTMMPP SLLASVYTSS TKATSETGQP KKKPVCAVSS KSASAARTET PQQVLERILT
SRGYGSDIRI KAEQSNYDAM PSPLQLASFG TELVKAIHTS DVDKLSSLLA CGLSPNPCNQ
FRDSIVDLVC KRGSADIFRC LVDYGCDLRV CDGFGRTPLH HACWGSAFHP EIANSILRND
AQQILMEDKR GQTPLEYVRE AQAGDWIDFL ESHKDEYFPA GGALPAARDI RDSRPNGSLP
NPLNALPLAL AGALSSGQIT PEKVSAMSAE ERGRYNQR