Gene PHATRDRAFT_38794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_38794 
Symbol 
ID7203569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011685 
Strand
Start bp223716 
End bp225341 
Gene Length1626 bp 
Protein Length541 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182796 
Protein GI219125038 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCAAG CGGACGATAC AATGATCCCA AATCGATCGT CGACCCACTT TTACGATGAA 
GACGAAGACG ACGATGATGA GGAGGATGAC TTGGAGGAGC TTCAAGTTCT GCAGCCCTCC
GCGAGACAGG AACTCAGACG AGCCGATCGC TTATCGGTGC GTTTGCTGGC AATTCCGGAC
GAAGATGAGG ACGAGCACGA TCGCGTTTTG CGAAAGTCAC TGCGCCTACT GGACAACGAT
CTAAACGGCT CATCTGGCTT GTTCGACGAC GAAAGCAATG GCGCTGTGAT ACGTCAAAAT
TCGGGCGACA TCAATTTAGG AGGAGGACTG GTCCGTCGGA GCTCACGGGC TTCTCTACGC
TTGTCCGCAC GCCCCGGTGA AGACGGCAAA ACTGCAGGGC AACGCGTGTG CACGATGGTT
GGTGTTGCCG TAGCAGCGGT TGTTTTGCTT TTAGGAATCG CCGGATTCAT TGGTGTTACG
GTCGTTGGCC CACCCAATCA ACCAGTCGGA CCGTACCAGT TGGTAGAACG ACAGGAAGGA
AACGATTTCT TCCAGTTCTA TGACTTTTAC GAAGGCCGAG ACTCGGCCGG ATCTAACGGG
TTTTTGAATT ACGTATCGTA CGATAAGGCA ACCTTGCGGG AAATCGTCAA TGTCACCTAC
GAAGATGACG TTCTGGATAT ATACGCACAG CAACGCAGCA CACCGGAAGT CGGTTCGAAT
GAAGCGCAGA CCAAACAAGA ACCATTTATT TACATGGGAT CGGCTCCAAC GCCAGCTGGT
CCGCGAGATT CTATTCGCTT GGAAGGTAAT CGCCGCTTCA ATAGGGGCTT GTTCATCATT
GATATTCGCC ACATGCCCGT GGGATGCGGA GTCTGGCCCG CCTTTTGGCT CACGGACGAG
GCCAATTGGC CAGTCAACGG AGAAATCGAT ATTGTAGAAG GCGTAAATTA CCAGTCCGTG
GCGAAGACAG CCTTACATAC TACAAAAACA TGCATTATGG ACGACATTCC ACTTGGTACG
ATGACAGGAG GATGGGATTC AGCCCAAGGT ATCCCAAATG CCAAAACCGG TATCCCAGAT
ATGACAATGC GAGAAGCACG CAATTGCTTC GTGTACGATC CCCATCAGTG GCTGAATCAA
GGGTGTGTTG CAGTGGATAC GGAAGGAGGT TCGTTAGGAG TTCCGCTTAA TGCTAAAGGA
GGCGGTGTCT TTGCGTTGGA ATGGGACCCC ATCAACCGAC ACATTCGTAC CTGGGTATTC
TCTCCGCATT TAAATGTACC TGATAATCTC GTCGATTCTA TTCGAACGGC AAGTTTACCC
GACTCAGAAC GCATCGTGCC CGATCCAGAT GTTTGGCCGC TTCCGTACGG CTTTTTTGCA
ATTGGTGAAG GTACCAACTG CCCGGCATAC CATTTTCGGC ATATGCGACT TGTATTTAAT
ACGGCGTTTT GCGGCAGTGT GGCAGGAAAC CGGTTCCACA TTGATTGCAA AAAGCAAGTC
GCGGCCAACT TTAGTACCTG CACTGATTGG ATCAAAAGCG AGCCAGAAGA ATTGCAGGAA
GCTTATTGGA AAATTCGCGG GGTGTATGTT TACGAACGTG CGTGGGAGCG AACATGGAGT
GTTTAG
 
Protein sequence
MKQADDTMIP NRSSTHFYDE DEDDDDEEDD LEELQVLQPS ARQELRRADR LSVRLLAIPD 
EDEDEHDRVL RKSLRLLDND LNGSSGLFDD ESNGAVIRQN SGDINLGGGL VRRSSRASLR
LSARPGEDGK TAGQRVCTMV GVAVAAVVLL LGIAGFIGVT VVGPPNQPVG PYQLVERQEG
NDFFQFYDFY EGRDSAGSNG FLNYVSYDKA TLREIVNVTY EDDVLDIYAQ QRSTPEVGSN
EAQTKQEPFI YMGSAPTPAG PRDSIRLEGN RRFNRGLFII DIRHMPVGCG VWPAFWLTDE
ANWPVNGEID IVEGVNYQSV AKTALHTTKT CIMDDIPLGT MTGGWDSAQG IPNAKTGIPD
MTMREARNCF VYDPHQWLNQ GCVAVDTEGG SLGVPLNAKG GGVFALEWDP INRHIRTWVF
SPHLNVPDNL VDSIRTASLP DSERIVPDPD VWPLPYGFFA IGEGTNCPAY HFRHMRLVFN
TAFCGSVAGN RFHIDCKKQV AANFSTCTDW IKSEPEELQE AYWKIRGVYV YERAWERTWS
V