Gene PHATRDRAFT_38095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_38095 
Symbol 
ID7203039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp61044 
End bp63377 
Gene Length2334 bp 
Protein Length777 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182151 
Protein GI219123686 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.167042 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAAGG TGGCAATGCC GTGGACGGGA GCTGCTGTTT CCAAAGAACT ATACGAATGG 
GAGAAGGAAT TTCTTGCTAA TTCTATCGAG ACATCCCCAC TTTCCGCAGC ATCCAGACGA
ACCGGGAATT ATGGACTTAC TTCCAGTATC GGCAGTGTTG CCGAGGACGA TAAAGTATAC
AAACGTGACT CGTCGGCCAT CGCATGGCTA ACATTATCTG ATCAAAGCCC GCGGCTGCGA
TCCTTCTTCC GCACACTCGT GTTTGATCGG GAGACTTGCG AGTTCACACC ACTACAATCA
CACTTCTGGT CCGGTGTACA AGGCCTACTC ATCGGTACTT TAACTTTCGT TTGGAAGAAC
AGTATTGAGT TTGGAATCGA ATTTTTCTGG GTCATTTTGC CCAAGACGCT GCGCAATTGT
GGAGTCTTCA CCGATGACAA CGGTTGGCTT CCGATATGGC ACTACACCTG GATATTTTCC
GTGCTCACGG CTACCATTTT AGGCTACTTT GCTGACTTGT ACAAGGTTCC CGGCCAGGAC
TCGTACAACG ACAGCGTCCA CCAAACTGGA TTGGTTGACT TTCGCACCGC CCTGAGCGTC
TTGGTGTTGT CCACTTGCGG ATTGTGGTCC GGTTTTAGCT TGGGGCCCGA ACTTCCGTTG
GTAATTCTGG GTGGTCAATT TGGATCATAC ATTGGCTATA CGCTCAACCA AAGTGTCCTC
CATTGTCGCG TCATGACATT GGTAGGATCG TCCGCAGCCG TGGCTGGATT CTTCAAATTA
CCACTTGCCG GAGCCTTTTT CGTGTTGGAG ATCCTGCACC GAGACGGGCT ACAATACTAT
GAAGCCCTCC GTCCTGCATT GTTCGCTTCC GTTGTGGCAG TGGAAACCAA TCGATTTTTA
GCCCATAGAA ACGAGCACGT CTTTTTCCAG TACCCCGGCA CGGAAGAAGA AATGCCAAAC
TCCCTATATC TTGGTGTAAT ATTCCTCGCT CTTTTTGGAG CTCTAGCCGT TGGGATTCCG
TACATTATTG GCGTGAATTT TTGCAAAAAG CTAATTGACT CAACCTACGA CTGGCTCGAA
GACGAATTTG GACAAGACTC TGAAAGGAAG AGCTTGAATG AGCTGCGTCG GCTCAATAAT
TTCAAAAGTA CAGAACCAGA ACCTGAAAGC GAGTATCTTT GCGGCGTATT CTCGACAGAA
GCCTTGCAAG CAGCCGGGAA AGCGGGATTG GCAGGCCTGA TACTCGGGTG GATCGCAATC
TACCTTCCTC ACACCATGTT CTGGGGCGAA GCTCAGCTGC AGACCATAAT TGACCGAGGA
GCGACACCAC TTCCTATTGT TGGTAGCGAC TACCCCAGCG GTTTGGGCTC GCAGGGGTTC
TGCATGACGG ACACCCAAAG CTCCAGTCCG GAACCACTCA GCTTGACCTG CTTGGCTACC
ATTGGTGTCG TCAAAGTGGT CCACGTTGGG CTGAGCCTCG GCACACACAT TATCGGTGGA
CACTTTTGGG CTCCCTTGTT GGTGGCAGCC CCCGCCGCGC ACTTGCTCAC GGACGCCATG
GGCGGTGTGG CCCGGGCCTT GGGCCACGCG GGGGGTTTAG AAGCCTACAA AACGATTGTG
ATTCTGGTGG TCATGGGTGG CGCGCACGTG ACGGTGTTTC GCGCATACAC AGCCATTGCG
TTTATTCTGT TCTTGACGGT CGCGGGACAC TTGACGGAAC TCTTGACCTT TCTCATTACC
GCCCTCTCGG CGGTACAAGT GCTGACTACC GCCTGGATGG AACGCTTCGT CATGTACCGA
TCTCAAGGTC CACGTTGCGA CGTTGTGGCC GCTCCCGAGG TTGTCGAAAA GCCCGACCGC
TTTGACGATT TCGACGACGA CGATGAGGAA AGCGGCAACA GTGCCGCGGA TGCGAGTGTG
GAGTCGGAAA CGAACCCCGG AGACTATTTG CGGGTGGAAA AAGCTAAATT CTACGGGGGC
ACCACGGACT CTGTGGAACA ATGCTTTGGC GAAAGTGATC GTAGTATTCC CACCCCCACA
AAGAGTAGTG CCGGAAAAGG CAACACCCAC AAGGTACAGG GCAATCGACG TCAGTTGCGT
CGAATGACGT CCAGTCAACT GGATTTGTTG GAACAGCCCC GTCAAGCCGC GCTTCGGAAA
CAAAAGTCGT TGACACTGTC TGGTTCCGCT CGGAGCCTGT TATCGAGTGG AAGTTCAGGT
CGCAGCGTCA GTACTACTAG TAGGAGTATC AGTAGCAGCA TCAGTAGTAG CAGCTGCCTG
GTGACACAAT CACGAGACCA TTGCGGCGAC CCAACATGGA CCCTGTATGT TTAA
 
Protein sequence
MQKVAMPWTG AAVSKELYEW EKEFLANSIE TSPLSAASRR TGNYGLTSSI GSVAEDDKVY 
KRDSSAIAWL TLSDQSPRLR SFFRTLVFDR ETCEFTPLQS HFWSGVQGLL IGTLTFVWKN
SIEFGIEFFW VILPKTLRNC GVFTDDNGWL PIWHYTWIFS VLTATILGYF ADLYKVPGQD
SYNDSVHQTG LVDFRTALSV LVLSTCGLWS GFSLGPELPL VILGGQFGSY IGYTLNQSVL
HCRVMTLVGS SAAVAGFFKL PLAGAFFVLE ILHRDGLQYY EALRPALFAS VVAVETNRFL
AHRNEHVFFQ YPGTEEEMPN SLYLGVIFLA LFGALAVGIP YIIGVNFCKK LIDSTYDWLE
DEFGQDSERK SLNELRRLNN FKSTEPEPES EYLCGVFSTE ALQAAGKAGL AGLILGWIAI
YLPHTMFWGE AQLQTIIDRG ATPLPIVGSD YPSGLGSQGF CMTDTQSSSP EPLSLTCLAT
IGVVKVVHVG LSLGTHIIGG HFWAPLLVAA PAAHLLTDAM GGVARALGHA GGLEAYKTIV
ILVVMGGAHV TVFRAYTAIA FILFLTVAGH LTELLTFLIT ALSAVQVLTT AWMERFVMYR
SQGPRCDVVA APEVVEKPDR FDDFDDDDEE SGNSAADASV ESETNPGDYL RVEKAKFYGG
TTDSVEQCFG ESDRSIPTPT KSSAGKGNTH KVQGNRRQLR RMTSSQLDLL EQPRQAALRK
QKSLTLSGSA RSLLSSGSSG RSVSTTSRSI SSSISSSSCL VTQSRDHCGD PTWTLYV