Gene PHATRDRAFT_50441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50441 
Symbol 
ID7199304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011698 
Strand
Start bp70709 
End bp73130 
Gene Length2422 bp 
Protein Length776 aa 
Translation table 
GC content59% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185372 
Protein GI219130438 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.712504 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCCGCTTTCG TTACTCTCAC AGAGATACTC CATTCTCATT GTCATTGTCA CCCACACAGT 
CCATTCAATA GTACGTACGT AGCCAGCCCC CATGGACTTT TCGTCGATTC CTCGTGCCTT
GGAATCTCTC CAACGCCGTC AGCCTCCGCT CGACGAGGAG ACGGTGTTGC GTCCCACACG
AGCCCTCGTA GCGAACCCGT CCGCGGACGT CTTTTTATCC TCCATTACGT CACTCGTCTC
GTCTACATCG TCTCGTTGGG AACCACTCGC GGTGGGTCTT TACGTGGCGA CGGAAGCCCT
CACGCAGCAC GGAAAAGCCC TCGCCGCGTC GTCGTCGTCG TCGTCAACCC CCGACGCGGC
ACCGTCCGTG TATCTGGAAG GTCCACGGGT CCCATCCACC CGCGACGAAG GAACACCTCT
GGAATCGGTC GTACCCGTCC TCGACACAAA CCAGGCTTTG GTCCTTTGTC AGACCCTTCA
CGACGTCGCT TTGCGACATT TGGAACACGA CGAGCCCCGG GTCCGTACTC TCGTCGCCAA
GGCCGTGGGA GCCTACGCCA AACTTACCGT CGAACTCGAT GATGCACTCC CGCACAATCG
TCAAGCTCTC CACGATCGAC TCGTGCAATC CATTCGAACA CATATTCAAC AAGGTAGAGA
CGAACAACCT GATCCACAAA ACGATCCACA CACCAACACT CCCGACGATG GGGACGATGA
TGGGGACCGA TCCCCAAAGT ACAGTAAATC CTCCACGGGA GCTCTGGACG ACACCACGGG
ATGGCGTGCC TTGGAAACCA ATTGGCAGTG TCTCGCTTCC TTGATTCGGG CACTCGGTCC
GGCTTACGTC GTACACTTTG GGGTCCCCCA AACCGTTCTG GACGATTGTC AGTACAGTTG
TATCGAGCAC GTCAATCGCC ACGTCCGGGC GGCCGGGATT TCCGTGCTGG AACAGTGGCT
TTACGCGGCA GCAGCCGGAA GCCCTGTGCA ACAGGCCCTC TTGACGGAGT CCGACGGAGT
TCTGCGCAAA ACCTGCCGGG TTGTACTCAA ACACGGGCTC GCCGACAATT GGTCACAAGT
CCGGATGGCC GCCAGTGTCC TCTGTCGCGT TTTGTTCACC ACCCTACAGG CTCTACAGGC
ACCAGCGGAC GATTTGTATC CCGTACTCCT ACCCCGCATG TGTTTGAATC GATTCTATCT
CGCGCAAGGG GTCAAGCTCT ACAGTCACGA AACGTGGAAA CTCGTCTTTG TCGATTCCGG
CGTGTCCCTG GTGGCCGCTA ACTTGCCCGC CGTTTGCCGT TACTACGTTC AAATGTGCGA
CGCCGATAAT CACGTCGTCC GGGAAGCCGC TTGCCAGGCC GTCGCCGAAC TGGCCATCCG
CCTCGGGTCG GATCCGAACC ATCACGACGA GCTCCTGCCA CACATGGACC TACTGCTGCA
GGCCTTGCTT ATGTGTTTCC ACGACGAGTC CTGGCCCGTG CGTGACGAAG CCTGTTTGGC
GTGCGGCCTC CTCTGCAAAG CGTATCCCGA GTCCTGTCGT CCCGAACTCG GTAAACTTTG
GGAACGCTGG ACGGGACAGC TCACGGACCA GATTTGGTCC GTCCGCGAAG ACGCCGCGGT
AGCCTTGGGC GACGCCCTCG AAGCCTATGG TGCCGACTTC CTACAGGAAC TCCTAGCCCT
CGTTGACAAG CTTCTTCCGT CGGCTCGCAG CCAATCCGCC ATGACGCCGA CCGAATACAA
GGCCCGCCAA AACGACGCCG CTGCTCACAC CGACTCGCAA TTGTACAGTT GCGGGAGTTT
GGCGCCGAAG CTGCGCAAGG GCGGTGCCGG CCGCATCGGG TGTTCCTCGT GCGACGTTAA
TCGCGAAAAA TCGCCCTGGG AAGCGACAGA CGGCTGCGTC TACCTGATTC GCGAACTCGT
GGTGCGGTGT GCCTCGCCGG AGAGTCCGAC ACCCCTTGCG GACGAAATCC TGCTTCCCAT
GCTCCGGGAA CTAGCTGACG TGTGTCGGGT ACAGCACTTT CCGCAATCGG ACGATTTGCG
CACCACGTTG TGGAGAAATC TTCCTGGAAT GGCGGAAGCG CTCGGCAAAC AGCGTTTCAA
GCGGTTGTAC CTGGACGTCT TCCAGAATTT GTTGTTCTCG AGTTTGGATG CGCGGTCGTC
CTCGCAATTG TCACAGCACG CGGCGGGACA GTGTGCGGAA GAACTGGCCG ACCTGGTAGG
CCGGACTATT TTCCGGGCAC GTTTGGAAGA TGACCAGCGG GATACTTTGG ATCGCGTTTT
GCGCGAACGC GCTGCCATAC CGGCGGGGCC GGCAGACGGG GCCGTCTTTT CGCCCTTTGG
ACCGCCCGGA TTGCTCGATC ACATTCACAA GGGTACGGTG CATCCGGGTG TGGCGGGGAT
GACGCGAGGA ACGGCGCCGT GA
 
Protein sequence
MDFSSIPRAL ESLQRRQPPL DEETVLRPTR ALVANPSADV FLSSITSLVS STSSRWEPLA 
VGLYVATEAL TQHGKALAAS SSSSSTPDAA PSVYLEGPRV PSTRDEGTPL ESVVPVLDTN
QALVLCQTLH DVALRHLEHD EPRVRTLVAK AVGAYAKLTV ELDDALPHNR QALHDRLVQS
IRTHIQQGRD EQPDPQNDPH TNTPDDGDDD GDRSPKYSKS STGALDDTTG WRALETNWQC
LASLIRALGP AYVVHFGVPQ TVLDDCQYSC IEHVNRHVRA AGISVLEQWL YAAAAGSPVQ
QALLTESDGV LRKTCRVVLK HGLADNWSQV RMAASVLCRV LFTTLQALQA PADDLYPVLL
PRMCLNRFYL AQGVKLYSHE TWKLVFVDSG VSLVAANLPA VCRYYVQMCD ADNHVVREAA
CQAVAELAIR LGSDPNHHDE LLPHMDLLLQ ALLMCFHDES WPVRDEACLA CGLLCKAYPE
SCRPELGKLW ERWTGQLTDQ IWSVREDAAV ALGDALEAYG ADFLQELLAL VDKLLPSARS
QSAMTPTEYK ARQNDAAAHT DSQLYSCGSL APKLRKGGAG RIGCSSCDVN REKSPWEATD
GCVYLIRELV VRCASPESPT PLADEILLPM LRELADVCRV QHFPQSDDLR TTLWRNLPGM
AEALGKQRFK RLYLDVFQNL LFSSLDARSS SQLSQHAAGQ CAEELADLVG RTIFRARLED
DQRDTLDRVL RERAAIPAGP ADGAVFSPFG PPGLLDHIHK GTVHPGVAGM TRGTAP