Gene PHATRDRAFT_38549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_38549 
Symbol 
ID7203496 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011684 
Strand
Start bp366573 
End bp370388 
Gene Length3816 bp 
Protein Length1128 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182671 
Protein GI219124774 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTCAC ACGGTCAGGA AACTGTGGTG AGCTTCGACG AGCTGTCCCA GGCAAGTTCC 
GAAAGCGAAA GCAAAAAATC TTCCAGTAAT GGCGGACGGT CGGATGATGA AGCGAGACAA
ATCTTATCCA AGCAGGAATC TGAGGACGTT TTCCGTCTCC GTGTCACAGT AATTCTGGTT
TTGTTTGCAG CCGCCATGGC AGTCTCTGCT TCTATTTTTC TCATGATCTC TACCTCAGAA
CACCAAGAGT TTGAAAACCA ATTTGAAGGT ATCGCTGAAC GAATTGTTGA TGCCTTCGAA
GGAATCCCTC AACAGAGAAT TGGCGCTGTG AGTTCTTTGG CTATAGCGGC AAGCGCGCAT
GGCGTCGATC ATCATCAACA TTGGCCGTTT GTAGCCTTGT CTTCTTTTCA GGAGCGTTCA
ACCTCGATTC GAAACCAGGC CCAGGCTCTC TTTGTCGCTA TCGCACCATT GATCAGTCAG
ACTGATCGAA GCGAGTGGGA GAGCTTTGTG TCAAGCAATG CATCCAATTG GATTGACGAA
GGCTTTCTCT ACCAAGAGCA AATGGGGTTA AGCCAATTTG CTCCATCAGG CAACGCGTCT
ACCATTATCG CTGGAGCTCC TTTAATTTCA TCATTGAATG AAAATGTATC CTACGAAAAC
AGTTTGGGAG AAGGTCCTTT TCTTCCATAC TGGCAAACAT CTCCGATTCT AGAAGAAAGC
CTGCTAAATG TTGACATTCT GAATGATCAA GAAAAGGCGG AAAGTGCTCG TACCTGTCTT
GAGACCAGAT CCGCGGTCTT TGGAAGGCTT GAAAGCGCTC CGGCTGCAAG CAATGACACT
GAATCATCCT TCTATACGCT CTTCTATTCC AAGCTCTTAA GCATCACGGA GGGGGAGTAC
GTAAATTACA AGGGTGACCC TCTGAGCACC TTGTTTGTAC CCGTTTTCAG GTCTTTGAAA
AAAAAAAAGA TGAACCAGTT GCTGTAATCG TAGCAACAAT GCACTGGGCC TCTCTTTTCA
AAAACTTGCT TCCTACAAAT ATTGAAGGCA TTCACGTCGT GTTGGAAAAT CCGTGTTACG
GGTCCTTTAC ATATGAAATC GAAGGGAGAA ATGTCAATTC TATCGGCAAT GGTGTAAGTA
GACAACAATG AGAGGGACGA AAAGGAGATC TAGAGCCTCA TCGGTGTTCT ATACGTTTTC
TTGTACATTT CACAGGACCA CCATGAAGTA GCCTTTGAAA ATACGTCGAA AAACGCGAGC
CTTTTATCTG TGGAGAATAC TGTTGACGGA ACGCTGAATG GTCTGCCGTT GTATGAAGGT
GAATGCCCGT TTAGCATAAG CGTTTACCCA ACGGCAAAGT TCAGAAGCAA CTTTATCACC
CTGACTCCAC TAGTCGTCAC TCTTGCCGTT GCCCTGATTT TCGTCTTTAC TGTCTTCATG
TTCATCATGT ACGACCGCTT TGTTGAACGA CGGCAGAAAA TTGTATTACG AAAAGCAGTT
CAAACAAGTG CAATTGTCTC TTCAATGTTT CCGAAATCAG TCCAAGATCG TCTTTTAGAA
GCCAATGAAA AAGCTAGGAC TCAGGTATCG ACTGGTGCGA ACAGCCAGAT GAACTCTTTT
TTCAATGGAG CCGAGCGCAA CAACTACGAT CAGCACGATC CTATTGCCGA TCTCTTCCCA
AACTGCACTG TCCTTTTTGC GGATATCGCT GGCTTTACCG CATGGTCGTC TTCACGTGAC
CCTGCTCAAG TCTTTGTACT CCTTCAAGCA GTGTACCAGG CATTTGATGC TATCGCAAAC
CGTCGCAAGG TTTTCAAGGT TGAAACGATC GGTGATTCGT GTAAGTAATA CCTCCCTCTT
CCCTTTGTCA TGGTCGATTC ACTAACAGAA AGCTTCCGTG AACTTAGATG TGGCTGTCGC
TGGGCTTCCT GAAGTGCAGG AGAAGCATGC GGTGGTCATG GCAAGGTTTG CTTGGGAATG
CCTTATCAAA ATGCATCAGG TTACGAAGGA TCTGGAAGTC TCATTGGGGC CTGATACCGG
AGAACTTTCC ATGTATGTCT GCATTACTTC ACGTTGCTTT CAAAGAGTTT GTCAACTTTC
ACATGTTCAG TCTTTCTGTA TATTTTTAGG CGCGTGGGTC TTCACAGCGG TGCGGTCACA
GCAGGTGTTC TTCGAGGGGA CCGAGCTCGT TTCCAGCTCT TTGGTGATAC TGTCAACACT
GCTGCCCGAA TGGAGAGGTA CGGACATTGT GTATACAAAT AATCAAATTG TCAAAGCAGA
CTCTAAATCA AATTGATCAC ACTTCAACAG CACCGGCGTG CGAGGTAAAA TTCAGATTTC
ACAGTCCACT GCAGATCTCA TTATAGCTAG CGGGAAAGCG CATTGGATCA AACAGCGCCC
GGATTCTGTC GAGGCCAAGG GCAAAGGCAC TTTAACAACT TTTTGGCTTC AACCTCGTGT
CAAGGAGGGA TCAAGCAATT GCTCAAGCGA AACAGAAGAT GCTTCGAATC CACAATTCCA
GTATGAGATA GGTGACAAAC CCCTTAGAAA GAGCTACGCA TTGGTTGCCA AGCAAGACCG
CCTTATCGAT TGGATCGTCG AGCTCCTCGC TGAGTATGCA AGAAAGATTG TACGTGCGGA
TTTGCCGTAG GCTAAGACAA ACAGTAAATC ATGTCCACTT CTCACCTAAC ATTCTCTGCT
TCCTTGCAGA TTTCAAGGCG TGGTGTTCTT CAAACCAAGC CAGACCGGTG TTTCGACCTC
TTCTACGAAA CGCCCGAAGG TAAAACATGC CTGGACGAAG CCACAGGAGC TGTATCCCTT
CCACAATTCG ATCATGAAGC GGGTTCGAAA AACGCTGACG AATCTGTTGT TCACCTTGAC
GCTAACTTTG TTCAACAGCT TCGCGAGTAC GTCTCTATTA TCGCATCCAC GTACAGAGAG
AACGCCTTTC ACAATTTTGA GCATGCCTGT CATGTCGCAA TGTCGGTAAA CAAACTCCTC
AAGAAGATCA TTTCACCCTA TTGGAGACCC GACGAAATAA ATGGAGGTTC AGGCGGCCTG
GCGTTACGCC TCCACGAGTT TACACATGGG ATCTACTCAG ACCCGCTGAC ACTGTTTGGC
ATAGTGTTTT CGGCTTTAAT CCACGACGTA GACCATCGTG GTGTCTCCAA CGTCCAGCTC
ATAAAGGAGG AACGTGAAAT GGCAGAGGTC TACCGGGGCA AGAGTGTCGC TGAACAAAAC
TCGTTGGACA TTTCTTGGGG GCTCCTGATG TCGTCACAAT TCAAGGAGCT CCGTACCTGC
CTCTTCCACA ACCGAGATGA GATGATGCGA TTCCGTCAGG TGATCGTCAA CACGGTGCTA
GCCACAGACA TTTTTGACAG GGAACTCAAC GAGCTGCGTA CGAAGCGGTG GAGAATGGCA
TTTTACGAAA GCCATCCAGA CACAACTTTT GGAAACGATC TCAAAGCCAC CATATTAATA
GAGCATATTA TCCAGGCGTC CGATGTGTCG CACACAATGC AGCACTGGCA TGTATATCGC
AAATGGAACG AGCACCTGTT CCACGAAACG TACTTGGCGT ATACGGAAGG GCGTATGGCA
TCTGATCCGT CAACGTTTTG GTACGAGGGG GAACTTCGTT TTTTCGACAG TTACATCATT
CCGCTGGCCA ACAAGTTGCG CGATATTGGC GTCTTCGGTA TGTCGAGTGA CGAATACCTC
ATCTACGCAC TCAGCAATCG TCAAGAATGG GAGCAAAAAG GCCAGGAAAC AGTGGCGGAA
ATGATGAAAA AATACTCTTC GTATCGCAAG AGGTGA
 
Protein sequence
MSSHGQETVV SFDELSQASS ESESKKSSSN GGRSDDEARQ ILSKQESEDV FRLRVTVILV 
LFAAAMAVSA SIFLMISTSE HQEFENQFEG IAERIVDAFE GIPQQRIGAV SSLAIAASAH
GVDHHQHWPF VALSSFQERS TSIRNQAQAL FVAIAPLISQ TDRSEWESFV SSNASNWIDE
GFLYQEQMGL SQFAPSGNAS TIIAGAPLIS SLNENVSYEN SLGEGPFLPY WQTSPILEES
LLNVDILNDQ EKAESARTCL ETRSAVFGRL ESAPAASNDT ESSFYTLFYS KLLSITEGEY
VFEKKKDEPV AVIVATMHWA SLFKNLLPTN IEGIHVVLEN PCYGSFTYEI EGRNVNSIGN
GDHHEVAFEN TSKNASLLSV ENTVDGTLNG LPLYEGECPF SISVYPTAKF RSNFITLTPL
VVTLAVALIF VFTVFMFIMY DRFVERRQKI VLRKAVQTSA IVSSMFPKSV QDRLLEANEK
ARTQVSTGAN SQMNSFFNGA ERNNYDQHDP IADLFPNCTV LFADIAGFTA WSSSRDPAQV
FVLLQAVYQA FDAIANRRKV FKVETIGDSY VAVAGLPEVQ EKHAVVMARF AWECLIKMHQ
VTKDLEVSLG PDTGELSMRV GLHSGAVTAG VLRGDRARFQ LFGDTVNTAA RMESTGVRGK
IQISQSTADL IIASGKAHWI KQRPDSVEAK GKGTLTTFWL QPRVKEGSSN CSSETEDASN
PQFQYEIGDK PLRKSYALVA KQDRLIDWIV ELLAEYARKI ISRRGVLQTK PDRCFDLFYE
TPEGKTCLDE ATGAVSLPQF DHEAGSKNAD ESVVHLDANF VQQLREYVSI IASTYRENAF
HNFEHACHVA MSVNKLLKKI ISPYWRPDEI NGGSGGLALR LHEFTHGIYS DPLTLFGIVF
SALIHDVDHR GVSNVQLIKE EREMAEVYRG KSVAEQNSLD ISWGLLMSSQ FKELRTCLFH
NRDEMMRFRQ VIVNTVLATD IFDRELNELR TKRWRMAFYE SHPDTTFGND LKATILIEHI
IQASDVSHTM QHWHVYRKWN EHLFHETYLA YTEGRMASDP STFWYEGELR FFDSYIIPLA
NKLRDIGVFG MSSDEYLIYA LSNRQEWEQK GQETVAEMMK KYSSYRKR