Gene PHATRDRAFT_38879 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_38879 
Symbol 
ID7203608 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011685 
Strand
Start bp432176 
End bp434599 
Gene Length2424 bp 
Protein Length807 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182835 
Protein GI219125118 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.192146 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCAGCA GCTGTAGCAT TTCCCGAAAT GGATCCACTA GAACTTTGTC AACATCACCC 
AAGAAGCCCT TGAAGGCGGC TTCTTCTCCT AGAAGTAGTG GCGACTTCCA AGTCTTACGA
CAAGGAATCG AAAAAGCTCG GGATACTGTA CATATATCAA GAATGGAACT TGCCTCCTGC
CTTGATCGAC TCGGGGAGCA TTATGCCCGA CATCACGAAT TCGACGAAGC CATGGATGCA
TTTACCGAAG CGCTCCACGA GAAGCGAAGC GTCCTTTCGC ATATATTACC AGAGAACTTG
TGGTCGAGTA AGAGCGCCCT TTCCCCACCG TTGGCAGTCG ATTTCGAAGA TAAAACCGGT
GGAGACTCTT TTGACAACTT GACCGACGAA ATTATCATGA CTTTGCGTAG TCTTGGAAAC
GTCCACTCTC TTCGTGGGGA ACAAGACGAG GCCATGCGGT ATTTCACAGA AATTACCAAT
CTTCGAGCAA GGAAGACGGA GAAAAAAGCT GACAGCGGCG ACCAAGCCCT CTTTTCAGGA
CTAGGAATTG ACGAAGACAA CTCGGCACAA ATGGCTGAAA TCAATGAAGA TATGAAAGCT
CTAGGCGACA TGTTTCAAAT CGTTTCGTTT CGAGACCGTG AAAATGGTTT GCTCACGCAA
AGAAGTCGGA TGACTTCGAC ACTTAGAAGC TCCGCAAAAG AGAACACTTC TTGTTCGAGC
AATAAAAGGA GAAAAAGCGA CTCTTACTGC GGGATTATGC CCGTCATCGA AAGTGAGCCC
TTCAAACGAT CTTCTTCACT TTGTCTTTAT GCTACGAATA GTGACCTTAG TGAAGCTCTC
CGGATGTATA AAGCTGTTCT TGAGTCATAT ACTGGTCCCA AGCTAGAGCA GCACAAGGAC
ATTTTTAACT CTCTTGCCTT AAGAGTCGAT CTGCTGGCGG AAACTGGTCA GCAAGACGAC
TTGGGCTCGA CAAGCAAAAA CAGATTTGAT AAGAACCTAG ACCTTGCGGT GGAGATCTAT
CAGCACACTC ACACAGCACA AGTGGAAATG ATTACGACGG AAAGGTCGGG GTCGGGGTCG
AATCCCCAGG CATGTAAGGG TATTGCCTCG ACTTTAATTC GTATGGGAGG TCTCTACTTT
AAGCTTGGAC GTCGGGTTGA AGAGTTGAGC ATGTACAAAC AGGCGAAGGA CGTTTACTGT
CGAGCATTCG GAGACAAGCA CCCTTTCGTT GCTGGGGCAA GGAAAAATAT TGGCATGGTT
ATGGCCGAAA GAGGGGAATA CGACAACGCA ATGGATCAGT TCAAAAGAGC AAAAGAAATT
TATCTCGCTG TCAATAGAGG TGACGAAATC AGCAGAAACG TTGCCAGTGC CATATCTTGT
ATGGGAAATG TCAAAAATCG AATAGGAGAA CTTGACGAGG CCCTTGAACT GTACGTGGAG
GCCCTGCGAA TCTACAAGGC AATTCAGGCC AAGCCAACGG ACAATGAGTG CGACGATGTT
TGCACTTTGG ATGTGACAGC AACACTAAAG GTGATTGGGA TGGTACATTC AAGAAAGGGG
AACCTTGATA CTGCAATGTC AGTCTTCTTG GAGGCCCTGA CTCTGCTTCG AACGTATGGG
GATAATGCTA CAGCAAGCTG TAAAGAAACG ACCTCATCTG TCTTGACTAG GATGGCCAGC
ATCTACGCGA AGAAGGGTGA GCTGGACCAT GCGATGGATC GTTACAAAGA GGCTTACGAG
ATCTCTGTCC AGAATCACGG GACGACAAGC CATCAAGAAG TCGCTGGTAT TCTGCATTAT
ATTGGTGGTA TTTTTCACAA GCGATCAAAT TTTGACGAAG CAATGAACTG CTACCAAGAG
GCTATTCGCA TCTACCATGA AACACTCGGG CCTGGAAATG CAGCTGTAGC GGGAACCCTT
GTCATGGTGG GAAGCATCCA TTACAAACGC CGAAACCTGG ACTCTGCGAA AATGTTCTAT
CGGGAAGCTC TTCGACTAAA CAGGGATGCC TACGGCTTTC ACCACCCAGA TGTGGCTCCT
ATCCTCAAAA GTATTGGCAC AATCCTCACA AAGAAAGGAG AATACCAAGA GGCATATGAC
ATGTTTAGGG ATGTACTTTC GATCAAGTGC ACGATTCATG GTACCGGTCA TCCCGAGGTC
GCTAGTGCCT ACAAAAGCCT GGGGAATGTC CACTACAAGC TCGGTGAGCT TGCAGATGCG
GAACGACAAT ATCGACATGC TCTGAATATT TTTCGACGTA CTCGCGGAGA AGACCACGCC
GATACAATTG CTGCTAAAAC AACAATTGAT CATATACGCT ACTGGATGAA GGAGCGAGGC
CAGCGAAAGC ATGAGCAACG ACAAGCTCGG AGCCGCGCCT TGTCGGAGGG ACGAGATGAG
GAAATTGATA AACGCAGTTT CTGA
 
Protein sequence
MRSSCSISRN GSTRTLSTSP KKPLKAASSP RSSGDFQVLR QGIEKARDTV HISRMELASC 
LDRLGEHYAR HHEFDEAMDA FTEALHEKRS VLSHILPENL WSSKSALSPP LAVDFEDKTG
GDSFDNLTDE IIMTLRSLGN VHSLRGEQDE AMRYFTEITN LRARKTEKKA DSGDQALFSG
LGIDEDNSAQ MAEINEDMKA LGDMFQIVSF RDRENGLLTQ RSRMTSTLRS SAKENTSCSS
NKRRKSDSYC GIMPVIESEP FKRSSSLCLY ATNSDLSEAL RMYKAVLESY TGPKLEQHKD
IFNSLALRVD LLAETGQQDD LGSTSKNRFD KNLDLAVEIY QHTHTAQVEM ITTERSGSGS
NPQACKGIAS TLIRMGGLYF KLGRRVEELS MYKQAKDVYC RAFGDKHPFV AGARKNIGMV
MAERGEYDNA MDQFKRAKEI YLAVNRGDEI SRNVASAISC MGNVKNRIGE LDEALELYVE
ALRIYKAIQA KPTDNECDDV CTLDVTATLK VIGMVHSRKG NLDTAMSVFL EALTLLRTYG
DNATASCKET TSSVLTRMAS IYAKKGELDH AMDRYKEAYE ISVQNHGTTS HQEVAGILHY
IGGIFHKRSN FDEAMNCYQE AIRIYHETLG PGNAAVAGTL VMVGSIHYKR RNLDSAKMFY
REALRLNRDA YGFHHPDVAP ILKSIGTILT KKGEYQEAYD MFRDVLSIKC TIHGTGHPEV
ASAYKSLGNV HYKLGELADA ERQYRHALNI FRRTRGEDHA DTIAAKTTID HIRYWMKERG
QRKHEQRQAR SRALSEGRDE EIDKRSF