Gene PHATRDRAFT_49838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49838 
Symbol 
ID7198664 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011693 
Strand
Start bp41944 
End bp43883 
Gene Length1940 bp 
Protein Length557 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184720 
Protein GI219129068 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTCGAG GAAATTCTCG CAAGAAGATG CCTGTCGCTG CAACGACGAC GACGACACAA 
AGAAAGTGAC AAGAGAAGAA GAGCCTGAAG TAAAGTCAGA AGAGAACCGA CAGACCCCTT
GGACAAACAC CTCTTCCAGC GAGAACACCA AAAGTAAATC CGTAAAGACG CCTCCGCCTT
TATTGAACGA AAACTCTCCC CAAGAAAGCA AGTTAAATGC CAACGCCCCC CAAGAAGCTG
AGCCCGAGCC AAAAATCGCT CGTAAAGGTC AAGAGCATCT CGGCCGTTGG ACTGAGCCAG
AACATGATCG CTTTTTAGAA GGTCTCGCAA AACATGGACG TGAATGGAAG AAGGTGGCTG
CTTCCGTGCA GACTCGAACC GTCATGCAAG TGCGGACTCA CGCTCAGAAG TACTTTGCTT
TGCTGAATGC AGGCCAGACT ATGAACAAGT TTGCTACCAC AACACCACCT ACCCGCCAAG
AAGCCGCCAT CAAAGCAGAT CAGAAAGAAA AGGCCCTGAA AACTAAGATG GGCAAGGCTA
AGACTTTGGA CGGTGCCGCG GTGGCCAGAT CCGAACCTGC GGTAGCCGTT CCCAATTTGG
CGACCGCTTC TCAACCTTCG GTGTCGCAAC CCCATCCGAA CTTGCCTGGA TTGGGTAATC
CTGGATTGCC CAACGTCCAC AACTTGCTAG CCATTCAACA ACAGAAGCTG ATGTACCAAC
ATCGGGTCAT GGCTCATCAG AACGCCATGT TGCAGCAAAA GACCATACAC ACGATTCCCA
GTCGCGTGAT CGATGCGCCT GTACAGACGG CTCCTAACAC CACCGTGGCA AAGGCACCGT
CTGGTCCGCT TTTGCAGGGG AACGACATTG TCATTTTCCC CGACACATCT GTCAACCCTG
CATCTCATCG GCCCTCTAAC CAAGAATACG AAGACTTGCT ACTCATCAAT TGCTTTTACT
GGCACTCGTT ACCAGCCGGA TTTCAGTGGC CATACGTTCA GCGGTTGTAC ACCCTACTCA
AGTTGCGAGG ATTACGTATG GTTACTTTGT GGCAGGACGG AAGCGTTCAC GACTTTGTGC
ACGGTCCTTG GCAAATTGCC AAAGAAATGC GTGACGAGAC GTTCAAGCAG AAAGCTTTGG
CAAAGTATGG CTCGATGCAC GTAACGGGAA AAACGGTGGG ACTCTGCGTC GCCTACAAGG
ACAACGAGTG GAATGTGCTT GACGTTGAAA AGTATGCAAA ATGGGATTCC AATCTCGTAT
CCGGTGGTGG TTTGGTGCGT CTTCCTCAGG GTGTGACACT CGCTCCGGTC CACTACCAAA
CCGATGCCAC GTCGGAAGTC ACGGACAAGT CGACTCCACA AATGACCCCA CCACGAAAGT
CAAAGACTCC TCGACAAAAC GAGTCACTTC AACAGGTGCA CAAGGAATTT TCTAAGCGAT
CCTCTTCTTC AGCCAGTCTC GATACGGCTC CGGTGAACGC AAAGAAAGAC ACTAAACAAA
AGACTGAAAA AAATACTTCC GCTGCGAAAA AGTCTGCCGC CGCCAAAGTT AAGGTGGAGA
AGGAAAGTGA CAAGTCTCTC CTGAAGAAAG CGAAGAAAAG CAGAACTAGT GTTTCGTTCC
CGTTGAGATC CCAAGCTAAA GAGGCTAAAC CACGAAAGTC GATGTCGGCT TTGAAGAAAC
GGTCAGAATC TCCGACGGCA ACAGATCCAC CGCGCTCGGC CCGCCGTTCC AAGCGACAGC
GTTTGTAAGT GGGGAAAGGG CGGCTTTTCG CCTTGACCGT TCTAATGTGA TATGTTTGTG
ATGGGACCCT TTTGGGAAAT TGGCGGAGTG ATCTGAGCAA ACCAAAAGTT GCGGGGAAGG
GAATATACTA CGTGGTCAGT TCTGTATACG CAACGGGAGA ACGTAAAGAA AAGAATATAA
ATATGAAATG TGGGGTGTTG
 
Protein sequence
MPRGNSRKKM PVAATTTTTQ RNENTKSKSV KTPPPLLNEN SPQESKLNAN APQEAEPEPK 
IARKGQEHLG RWTEPEHDRF LEGLAKHGRE WKKVAASVQT RTVMQVRTHA QKYFALLNAG
QTMNKFATTT PPTRQEAAIK ADQKEKALKT KMGKAKTLDG AAVARSEPAV AVPNLATASQ
PSVSQPHPNL PGLGNPGLPN VHNLLAIQQQ KLMYQHRVMA HQNAMLQQKT IHTIPSRVID
APVQTAPNTT VAKAPSGPLL QGNDIVIFPD TSVNPASHRP SNQEYEDLLL INCFYWHSLP
AGFQWPYVQR LYTLLKLRGL RMVTLWQDGS VHDFVHGPWQ IAKEMRDETF KQKALAKYGS
MHVTGKTVGL CVAYKDNEWN VLDVEKYAKW DSNLVSGGGL VRLPQGVTLA PVHYQTDATS
EVTDKSTPQM TPPRKSKTPR QNESLQQVHK EFSKRSSSSA SLDTAPVNAK KDTKQKTEKN
TSAAKKSAAA KVKVEKESDK SLLKKAKKSR TSVSFPLRSQ AKEAKPRKSM SALKKRSESP
TATDPPRSAR RSKRQRL