Gene PHATRDRAFT_50489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50489 
Symbol 
ID7199325 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011698 
Strand
Start bp196943 
End bp199860 
Gene Length2918 bp 
Protein Length387 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185398 
Protein GI219130492 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.558278 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ACCGCACCTA CAAGTTCGCG ATTTACAGTT AACATATAAA GAAATGAGGT AGTTTTGTCA 
TCTACCACTT CAAGCCAGAC TCTTTTGGAT TGTACAATCC AGTTAGTTAG CTAGATGCAT
TGAAAAACAA ACGAGACAAA CCAGAGTTAC TCATAGCAGA CCAAGAGATC GCCCGGGACG
AAATACTGCT CTGCCTTTGG CCCAACCGGG GTCTACAAGA CCCGCACGCA TAGATATCCC
GACATTTCCC TTTCCAGCAT CAACGGCAGG CCCGGCATCA TGACCGACTG GATATACCTT
GACTTTTCCC GCACCACGTG GAACCATAGA TTTTGCATTC ATGGAGCCGG GCTCTACAGG
GATTTGATAG ACAACAGACG GTAGCGCATT ACTACCCTTT ATGAGCGGGG GTAGCAAGAT
TTCAAACTCG ACGGTAAAAA GCCGAGAACG CACCGCTTCA TCGTACGATG CTTCCAAAAA
TTCTGATGTT TTCAGTTGGT GAGCGGCTGC CGATCCTTGA CGCCAGAACG CTCTAAGTTT
CTTGCCGTCA CTCATCTTTC CGATTGCCCG CATCCCGTAT TTCCCTTTCT TTTTGTGAAG
TTTAAGTTGA ACCGGGTTAG CAAGATATTT CTGTATGTGA CGCTGTAGCT TGCGGTCGAC
TCCTTCCAAT ACTGGGGGCT CAAGTTCCCA CTCGACATCC TGCATCAAAA ACATTTCACG
AGTCATGCCT GAGAGGGCTT CATTGGTATC CAGCGCCATT TCACCACGAA AGTCTGTATC
CGATTGACCC GAAGATCGGA GAAAGCGCTG AGCGCTGGCA AAGGGATAGA TAGCGAGCAG
CAAAAAACAA CAGAGAATAA CGGGCTTCAT CGTTAGTCTT AGGCTTAACC TTGTAAATCT
GAGAATGTCG TTGTTTGGAT TTCTAGAAGT GCTGATGGAT TTCAGGTGAA ACACTATCCA
TGTGATCGAA GGCGATCTCA CAGTCGAGAT GGCGGAAATT GGGAATGGTC GCGGAGAGTT
CCATAGTAAT TTGTCGGCAA GAAATCTTAC ATCGATGAAA CTGTAAGCTT CTTCCACGCG
GAGTTGCCGC CCTGGACGAC AAAGGTTCGG GATTCAACAA ATCCTATTAT GGGAGTCCGG
CTCCATAAAA GCAATCACTT GTCCGGGTTC TCACGTCAAA TGTTCGCGCC GTACCTCGGC
TTCCTTCGAT CAACCCTGCA CAAACTCCTC TTCATTGCCG TTGAAATAAA ATTGGAACGA
CAGCTTTGGA GGGAGATTGG AGTTGTCGTC GATGGAGGAT TTTGCATTTT CTTAAACAAA
CCCTTTTGAT ATCACAATGG CCATCGCAAA GAATCCCCAC CGGAGCCCTG AAATGTTCCG
CATCCATCGC TATTCTTTGT GGTCGTTCGA GGTGGGTGCA TCTGTACAAC CACTTTCCAA
GGGTAAACGA AAGAAGAGCC GCTTCATGAT TTCCCCTACA CCTACGCATA GCAACTTCTG
GAACTCCCAT GACTTCACAT TGGATAAAAA GATTGGACAA GGATGTTTTG GGAAGATTTA
TCGGGCCAAG TACCATCGAC CTATAGAATT GGCTCCCTCC AGGGATAGAA GCCAAAGTTC
TGTCCACATA AACGCGAAAA AGTGCTCCTT TGTTGCTATC AAACAATTCT CAAAAATAAA
GCTCATGGAA TCGAAAGACC GCGGAAGCCG TTCCCATGAG CTACTTGAAC GGGAAATTGG
CATTCACAGC CAGTAAGTTT GCGAGTGGAA CGGCAACCCC AAGGAAACAT TTTCCTCACG
TTTTTTTGAC AAATGGCAAT AGATTGCAGC ACAAGCATAT TTTGTCATTT TGGGGATACT
TTGATAGCTT GTCGCATGTG TCTTTGGTTT TAGAGTACGC ACCGTATGGT GACCTTCTGA
ACTATACCAC TCGGAATTTT CCATATTCCA GAGAACTCCG TCTCAAAGCC TCTAGCCATT
TTGTCCGGCA AATTGCTTGC GCACTGGACC ACCTCCAGGC ATGTCAAATC GCCCACCGAG
ACATTAAGCC CGAAAATATT GTCGTGGTTT CGCCCCGGCA AGTGAAATTG TGTGATTTCG
GATGGGCTGT TTCTTTCCAA AAAGCTGGCT ACCAAACGAC ACTGTGCGGT ACGTCCGAAT
ACGTTCCGCC TGAAATGCTA GCCTGCAACT GTAAATACCA AGCAGCATAC GTCGACTCAT
GGGCTCTTGG AGTATTGACG TATGAACTCG TTGAAGGCGA GTCACCCTTT GTCCTAGACG
CTTCTAAATG CAAAACGAAC CTTCCAAGGC AACAAATCCA TGGAAATGTT ACAACCGAAA
TGGTTTTCGA TAAGATTCGA AACTTTCCTG GATTTTTTCC GCGGCATGGT CCATCCCAGC
TCACCAGTGT GGAAATGCGT ACCTTTGCGG ATTTCGTGAC AGGCTTGATG CAAATCAACC
CTGAAAGTCG CTGGTGCCCA GTTGATGCGC TGGAGCATTC TTTTTTATCA CTATCTTCAT
TCCTCGTGGA GGGCCGCGAG CCTCCGAATA AAGAACACTC TCGACACAGC AGCTTCTCAA
AGCCTGCGTC TTATGTCCAT TTTATTAACA ATGCGTCGGT AGGTTGACTC TCATTGTCAC
TTACAGTTCT ATATAATGAA TTGTGAGGTA CGAGTAGCAT CTATTATATT CATCTTTGTT
TTGAATCTTC TCTCGTGCGT GAACGATGAA TTTTCGGATT CGAAATCCAT GTGGTTGGAT
GATTGAAGTG AGATGAAGCA TGAGGAAGAA CGGAGAACCT CATCAACACC TCTCCTTACA
TTAATTATAG AGCGGGAAAC GTCGACACAT GCCATGCTGC CCTCGACGTG ACACTCATTG
ACGCAAAAGC AAAGAATCTG TCATGCATTA ACGATTGA
 
Protein sequence
MAIAKNPHRS PEMFRIHRYS LWSFEVGASV QPLSKGKRKK SRFMISPTPT HSNFWNSHDF 
TLDKKIGQGC FGKIYRAKYH RPIELAPSRD RSQSSVHINA KKCSFVAIKQ FSKIKLMESK
DRGSRSHELL EREIGIHSQL QHKHILSFWG YFDSLSHVSL VLEYAPYGDL LNYTTRNFPY
SRELRLKASS HFVRQIACAL DHLQACQIAH RDIKPENIVV VSPRQVKLCD FGWAVSFQKA
GYQTTLCAYV DSWALGVLTY ELVEGESPFV LDASKCKTNL PRQQIHGNVT TEMVFDKIRN
FPGFFPRHGP SQLTSVEMRT FADFVTGLMQ INPESRWCPV DALEHSFLSL SSFLVEGREP
PNKEHSRHSS FSKPASYVHF INNASVG