Gene PHATRDRAFT_50470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50470 
Symbol 
ID7199316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011698 
Strand
Start bp151581 
End bp153313 
Gene Length1733 bp 
Protein Length474 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185388 
Protein GI219130471 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value0.904967 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAAGAAAGGA ATGGCACCCC ACGGCCCTTT CTCCTGTACT ACTCTCTAGC TAGAAACGTC 
CCCAACACTG CTTCGTATAT ACCAAAGTTC GCTACAGTTA CAGGCGTTTG CCACAGCATG
GTAAAGGCAA AAAAGAGTAA AAAATCCAAG CCCCGAAAAG AGGCGGCTTC CAACAACGGT
ACAGTGGTCT ACACTCCCAG TAGCAGTCAC GTCTCCGCTG TATCCGGTTC CTACAAGGGC
ACCAAGGAAG ACCTCGATGC ACTCGATTGG TTGCGGAACG TCGAAAACCG CCGGAGTCAG
GGCGAGCCAG TTTTGCAGCT CGAGTATCAG GAAGACGATG TGCTGATGAC GACACTGCAG
GACTTTTTGG AAACGTCGGC GGCGACGCAC GAAGTGATTG ACGAAATATA TCAAATTTGG
CAGCAGACCG TCCTATCGAC CTTGCTTGCC AGCTCGGCTT TGTGGAAACA GGAAACCGTG
CAAGCGAATG ATCAGCGCGT ACTGAAAGAT TGTTTGAAAA CGGCGAGGTA TAAAATTAAC
TTTGCACTGC ATCACGTCCT GGGAGCGATT CTGACGTCGC CGGAACCCAA ATGGAAACGC
CTACAGCCCT ACGTGATTGA TGTGTGTTGG CAAACGCTGG CGTTGTTACC ACGACTGGCC
GAGTCGGTAC CACATCATCC CACGGATCTG AGATCTAACA TTCTGCACTT TGAGGGCAAA
GAATTCCTAG CATCGGTCGC CTGTGAAACC TGGGACACAT TGTACGACTT CGACATTTCC
TTAGATCCGA CCCGATTCGT ACCCATTCTA CGTAAACTCG CCATGTTGGA CATTTTAGAC
AAGGACTGGA ACATGCTCGG TGGGTGGGAG GACATACTGA AAGGGTTGGA GGCGAAATGT
TTCCACGGAT CGGTGCTGCG GTTACCCTGC GATAGTGTTC TGCACGAGCA CACGACGTTG
TTGCGGAACG CCCGCAACTC AATGCAGTGC AAATTCTTTT CCGTCCGCTT CATGCGGTAT
ACCGAAGACC AGGCACCACA GCGGTGGCGA TACTTTCCCA AATGCGCCGC CCCGGCCTGT
GCGCACGTGG AAACGCCCGA ATCGCCCCAT CCGCACCGGT GTGAAAGCTG TTGGTATTTT
CATTACTGCA GTCCGGCCTG TCAGGAATAC TGCGACGTGG TCCTGGGTCT GCATCCAAAA
TTCTGCCGTG ATACACCGGC TAATAAGGCG GCATCGTGTC AGCGTGAGAC CGAGGCGTAT
TTGGGATGGA GCGATCCCCA ATCCGGACAA CCACTGGTGT GTCACGCCTG CGGAGTGGTA
CAAGAAGAGG TCAGTGGTGC CGACAGCCTC GTGGATGCAC AGTACGCCAT CGTGAGCAAT
GGAATACCGA CATCCAGTAT GAAGAGGTGT TCCAAGTGTC AAAAGGTGTA TTATTGCAGT
CGACAGTGCC AAGAATGGGA TTGGCGTGTT GGGGGGCACA AGCGAGTTTG TCTCTTTGAG
GCTGCCCAAA AGCAACAGCA GCAAATAAAA GAGTTGAACT GAAAAGGGAA GCATGAGATC
AGCAATGTCG CAATGGCTCT TACATCAAAA ACGTTCAAAG CTTTTTACTT TTCCAAAGTC
TCGTATTTGG GAGCAAGTTG GATGCTCCTT TTGGCTATAG CCGTAGGGAG AGGCCCAGTG
AGATAGCTAG GTTGGTAGTT TCTTGTTTCT AAGATAGCCG AACGACACTC ATA
 
Protein sequence
MVKAKKSKKS KPRKEAASNN GTVVYTPSSS HVSAVSGSYK GTKEDLDALD WLRNVENRRS 
QGEPVLQLEY QEDDVLMTTL QDFLETSAAT HEVIDEIYQI WQQTVLSTLL ASSALWKQET
VQANDQRVLK DCLKTARYKI NFALHHVLGA ILTSPEPKWK RLQPYVIDVC WQTLALLPRL
AESVPHHPTD LRSNILHFEG KEFLASVACE TWDTLYDFDI SLDPTRFVPI LRKLAMLDIL
DKDWNMLGGW EDILKGLEAK CFHGSVLRLP CDSVLHEHTT LLRNARNSMQ CKFFSVRFMR
YTEDQAPQRW RYFPKCAAPA CAHVETPESP HPHRCESCWY FHYCSPACQE YCDVVLGLHP
KFCRDTPANK AASCQRETEA YLGWSDPQSG QPLVCHACGV VQEEVSGADS LVDAQYAIVS
NGIPTSSMKR CSKCQKVYYC SRQCQEWDWR VGGHKRVCLF EAAQKQQQQI KELN