Gene PHATRDRAFT_49523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49523 
Symbol 
ID7195855 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011690 
Strand
Start bp503291 
End bp505362 
Gene Length2072 bp 
Protein Length584 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184265 
Protein GI219128111 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.047501 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAACAACAAC ACTTGCGGCT CATAATCCGC TGTACTCATA TTATTCATCT CAAAAACGCC 
ATGAAGAATA CGTCGGGTTG CTTCCTTGCC AACCCGTCCA GGCGTAGTCG CCTCGTTCGT
AGCTACTCGT CGGCGTTACG ATGCGGCGTG CTGGTCTGGC TCGCCGTCGT TCGCGACGCG
GTGCTCGCGC AGGAATGTGC CACCGACGGC ACCACGTGCG ATAAGCACTC GCGTTGTCCC
GTGTGGAAAG AAGAAGGCGA GTGCATGCGG AACGCCAAGT ACATGAAAGA ACACTGCCCG
GTATCCTGTC GGGACATTCA TCAGGAAGCC GTAACTATAG ACTGCATTGA TCTCCACGAA
CGCTGTCCCG TCTGGGCTGG CTTGGGAGAA TGCAAGAAGA ACCCCATCGA TATGAACCGG
TACTGTTCCA AATCCTGTAA GCAGTGCGAA GACGACAACG ACGGAAAAAA CAACGAAAAG
GAGAAACGTG GGGACCCAGA CAACAACAGC AAGACGGCCG ACGACGACGA CGATCCCCAA
TGTCAAGATG GGGACAAAAA TTGTCCCTAT TGGGCGAAAA ATGGCGAATG CCAAACCAAC
AAGATTTGGA TGACCAGTAA GTTTAGTTCC GAGTACATGT ACTATTCGTA ATAGTACCCT
TGTATATGAT AGTCTCACCT CCCAACGAAC ACTTGTACTC GCGCGAAATA ACTCCTTCAC
CACCGACTCA CTCCTTGCCT GACGAAAAAT GTTACAGCCA ACTGTCCCAA ATCGTGTCAA
ACGTGTGAAG AAATCAAGCC CAAAACACCT CGGCGCGCCT CACAGATGAA ACCGGCCGAA
GTCCAAGAGA TTTTGCGAGC GTCGGCCTCC TTTGGCGAGC CACAAACCGC AGAAGGATCC
CAAACCTCCG ATACAGTTGA CATTGTTCGA GCTTCGATAG ACTACATGAA TAGTGAGGAC
GTCCAGCAAC TACCATCGGC GATTGTGGCG TCCTGTCGCA ATCAGCACCA TCTGTGTTCC
TTCTGGGCCT TGATCGGAGA GGTACGGCAC GTTAGCAGCT CGGCGATACG GCTGCCCCGT
TACCCTATGA AACTCAACCA CTCTATCCCT TTGTCTTTTT TTTATATATT TTTTATTCAC
ACCCCACTGT TTTTTAGTGC GACGCCAACA AATCGTACAT GCGTACCAAC TGTGCCCCCG
CCTGTCAAAC CTGCCAACTT ATCGACATCG AAAACCGCTG TCCCCGTCTC GAACACGCCG
AACCCGCTCT CGTACCGGGT GATCTTAACA AGCTCTTTGA TCGCATTGTG CGGACGGCTC
CCGGCAACCG CACCTTGACC GAGGCCGAAC GACAGGAACT AATTGATCAA AAAATGCATT
TGTACACCGC GCACGTGCAT TCTCGTCCCA GTGCGAACCC CGTGGTTGAA GTTAGTACCG
TCCTCGACAA ATCGTTGCCA CCATGGGTCA TCACTCTCGA CAACTTCTTG ACGCTCGAGG
AATGCACCGA ACTCATCAAC ATTGGACACA AGCACGGCTA CAATCGCTCC AAGGATGTTG
GGAAAGTCAA GGTGGATGGC ACCCACGAAG CGGTGCAAAG CACGCGACGT ACTTCCGAAA
ACGCCTGGTG TTCCAATCAA AGTGGCTGTC GCGACGAAGC TCTCCCGCAG CTCTTGCACG
AGCGCATGGC GACGGTCATG CGCATCCCTG CTCAGAATAG TGAAGATTTT CAGCTTTTAA
AGTACGAAAA AGGGCAGTTT TACCGAACGC ACCATGACTT CATTCAGCAC CAGACGAAAC
GGCAGTGTGG ACCGCGGATT CTTACTTTCT TTCTGTATTT AAGTGACGTG ACGGCGGGCG
GTGGGACCAA CTTTCCTGAT CTCGACATTA CCGTTGAACC CAAAGCCGGT CGCGCATTGC
TGTGGCCCAG TGTGTACGAT TCCGATCCCA TGGCCAAGGA CGGACGCATG ATGCATCAGG
CGTTGGAGGT GGAAGACGGT GTCAAGTTTG CTGCCAATGG ATGGATTCAC TTGTACGACT
ACGTGACGCC CCAAAGCATT GGTTGCACTT GA
 
Protein sequence
MKNTSGCFLA NPSRRSRLVR SYSSALRCGV LVWLAVVRDA VLAQECATDG TTCDKHSRCP 
VWKEEGECMR NAKYMKEHCP VSCRDIHQEA VTIDCIDLHE RCPVWAGLGE CKKNPIDMNR
YCSKSCKQCE DDNDGKNNEK EKRGDPDNNS KTADDDDDPQ CQDGDKNCPY WAKNGECQTN
KIWMTTNCPK SCQTCEEIKP KTPRRASQMK PAEVQEILRA SASFGEPQTA EGSQTSDTVD
IVRASIDYMN SEDVQQLPSA IVASCRNQHH LCSFWALIGE CDANKSYMRT NCAPACQTCQ
LIDIENRCPR LEHAEPALVP GDLNKLFDRI VRTAPGNRTL TEAERQELID QKMHLYTAHV
HSRPSANPVV EVSTVLDKSL PPWVITLDNF LTLEECTELI NIGHKHGYNR SKDVGKVKVD
GTHEAVQSTR RTSENAWCSN QSGCRDEALP QLLHERMATV MRIPAQNSED FQLLKYEKGQ
FYRTHHDFIQ HQTKRQCGPR ILTFFLYLSD VTAGGGTNFP DLDITVEPKA GRALLWPSVY
DSDPMAKDGR MMHQALEVED GVKFAANGWI HLYDYVTPQS IGCT