Gene PHATRDRAFT_49524 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49524 
Symbol 
ID7195856 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011690 
Strand
Start bp506303 
End bp509374 
Gene Length3072 bp 
Protein Length926 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184150 
Protein GI219127871 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.168314 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGATC CCCACGGCTT TGCTGTAGCT ACCGACACCG CAGCAATAAC AGCAGTATTA 
GCAGTACCGG CAGCAACAAC TACCGAAAAA ACCGACACTT TACAATTAAA CGTAGACTCA
GCCACCAACA CCACAACGTG TCGAAAGCCG CATCGTGACG GTAAAGACGA CGAGAACGAT
GACAAGCACG TCGATTTTAT CGTTGCTTCC GTCGACTCGC CGACGATGGA CTTGTCACAT
ATTCCGAATC ACGCGGCGCG GGTCGCGCAC GGTAGGACTG CCACTGCTGG AGCACGTATA
CACCACGGAG GCCATTCCGA TCAGTTCTCG CACACCAACA GCGAAGGAGG ACAGTCGTAC
CAAATTATGG AGTACGATTT GGCCGACAAC GAACACGGTT TCGGTCGACA CGTTTACGAC
TACAGCGACA AAGCCGACGA ACACAGTGGC ACCGACGAAG ACGACGATTT CGTCAATAAT
CCCATGCTAT TGGCGGCGAC GGAACCGCAA ATTGAGGACA ACACACCCGC AGCCGCTGCA
CTACGGGCCA ACGCCAACGT TAAGGTATCC ACCGTTGGCA AGGCCGCTCG TGCCGTACAC
ATGCAGTCGG TGCGCAAACT CATCAAACGA GCTACCGGTA CAACCCACAG TGGATTCGTA
CGTGGAAAAC CGCCCCGCTT GCCACCGGAT GTTCAGGAGG AAGAGCAGTC ACAAGGTGAG
GAAGTATCGC TGGAAGCCCA CGAGCAACAT AATCGTCGAG CAATACACGG ATCCTCCGCA
CTACATCCAC CGTTTCACCA TCCGCAACCA CTACAACCAC ATCCACACGC ACATAGTCTC
GCACACCCTC CACTTCCAAC ACAAGCACCA CAAGCCATTC CACCGACATC ACTGGAATCG
ACAGTCGTCC TGGGCGTCAG GGTACAACCA CATGATGTGG ACAGTGCCGA TGTCATGCCG
GATGCCGTGG TAAAAGGGAC CGTCGAAGCA GGGAAAAAAG GTGAGCACAG CCAGCGCTCC
TGTGTAGATT GCACCCACTC TGATCGTTCT CACAACATGT CTTTGCTCGA CAACTACAAC
AACAACCATA GTGCTGGTCT CCGTGGATCC CCACGACATG CTCAAATTCT TGCGATGGCG
GCGCAAGAAA AAAGCCCACG GACAGCAACC TCCAGTTGCC AAGTCCTACG TAAAGGGCAA
GGTCATCGAC GGAGAACACG AACTTTATAC ACTGTCGATT GCCGTAATGA TTGGAGTTAG
GACGTCTATT TCCTCCACGA ACGCTGTGAT TAACGGTACC ACGGCACCAC AAAATCAGCT
ATTTCCGGAT GCCACCGCAT CGCAGCCCAG TAAACGATGG GTCCAATCAA CCGATTTCAG
CGCATCCGAG AAGTACGAAT TTCGTCCCAA AGGTGGCGCC ACCCCACCGC ATAGACTGGC
GCATACCTTC AAATTCAAAG ACTACTCGCC CATCGTATTT GCCTACTTGC GACGCATGTA
CGGTGTCAAC GAATTCGATT TCTTACTATC CGTATGCGGC AACGCCAATT TTATCGAATT
CATTTCCAAC GCCAAATCTG GACAATTCTT TTTCTATTCT AGTGACGGAA AATACATGAT
CAAGACTATG ACCAATGCCG AAAGCAAATT TTTGCGGAGA AGTACGTGTT GTTATGACGA
GTCTGTACTA GATTCTCTGA GGAGCGCGAT GACAATGGTC TCACTTGTTT ATCTTTCAAC
AGTCCTCCCA AGTTATTTTC GACACTGCTG CGAGAATCCC AATACTCTCA TTACGCGTTT
TTTTGGAATG TACCGCGTCA AATTGTATCA TTTGCGACGA AACGTCAAGT TTGTCATTAT
GAACTCGGTA TATTACACGG ACAAGTATTT GCAAAATTTT TACGACTTGA AAGGTTCCGT
AGTCGGTCGA AACGCCAAAC CTGGGCAAGC GGTCAAAAAG GACAACGATT TGCGCCAGGG
ACTACCGGAG TCTGCCTTGT CGTTGCGACC GCCGGTGCGA ACCCACATGC GCGACCAAAT
TTCCGCTGAT TGTGAGTTTT TGCGGCAGAT GGAAATTATG GATTACTCTA TGTTGGTAGG
AGTTCATCAC GTGCCACCGG TGGAAGATCA CAGTCTCGCC ACGATTGGGT TTCGTGCGAG
CGCACGGACG TCGGCCCAGC GCATGCGCAA GGGATCCTTG GAAATGGATT CCGGTTGGAA
ACCGCTGCCG CAGGACCCGG TCACGAATGG CGCCTTGGTA GACGGGAGCG GCTCGGAAGA
AAAAATCACC GATTTTTCGG TTGTTTCTGA TCGTCGGTAT AATCGCAATG AGTCATTGGG
AGGTTTCTTT TTGGACGATG GGCTTGAGGA CGATGAAAGC AGCTACTTGA TGGGCAGCAG
TAGACGTTTC GAACCGCGTT CGCCTTCATT CCACGAAGAA ACGGAACGAA AACGTCAAGC
CACCATTGAG AAACTATACT GGCCTTTTCA TCGACTGTTC GATATTCATG GGTATCGCCT
TTTGGAACCC GTACAATGCA CCAAGTGCTT CGCTGCTCCT TGCAACTGCG ATTCCGACGC
CTCTCTACTC GAAGGCTACA AGATCCCCAT ATTCGTCAAG CCTCTGTCCG AACGTAAGGA
CGGTGGTCTT GAAATGGATA CGACCGGGCG CCAATTACCT ATGAAGCTGA AAGGTCCACA
CGGCGATCAG CTATACGAGG GTAAGATCTT TTACATGGGA ATCATCGACG TGTTACAAGA
ATATACTTCT CGGAAACGGG TTGAGTCCAG CTATCGGGCC TTGACCAGTA GTGGAAAATT
TGAAGCCAGC TGCGTCCCGC CCGACGTTTA CGGCGAACGC TTTGTACGAT TCTTTGACGA
ATATACTGTT GGCATGACAC AGTCGAAGGA GGGTACGAAA GGATCTATAG AGAAGACTAA
GAACACCCCC TGAAATAAAA CAGGCTGAAT CCCGAATGGC TTTACCTGCG GGCATTGTGC
AATCAGTCCC GGGGTCGGAT GTAGTAAACT TTGCTAACAA ATGAACAAAA TTAAGGCAAT
TTTATTACAA GG
 
Protein sequence
MTDPHGFAVA TDTAAITAVL AVPAATTTEK TDTLQLNVDS ATNTTTCRKP HRDGKDDEND 
DKHVDFIVAS VDSPTMDLSH IPNHAARVAH GRTATAGARI HHGGHSDQFS HTNSEGGQSY
QIMEYDLADN EHGFGRHVYD YSDKADEHSG TDEDDDFVNN PMLLAATEPQ IEDNTPAAAA
LRANANVKVS TVGKAARAVH MQSVRKLIKR ATGTTHSGFV RGKPPRLPPD VQEEEQSQGE
EVSLEAHEQH NRRAIHGSSA LHPPFHHPQP LQPHPHAHSL AHPPLPTQAP QAIPPTSLES
TVVLGVRVQP HDVDSADVMP DAVVKGTVEA GKKVLVSVDP HDMLKFLRWR RKKKAHGQQP
PVAKSYVKGK VIDGEHELYT LSIAVMIGVR TSISSTNAVI NGTTAPQNQL FPDATASQPS
KRWVQSTDFS ASEKYEFRPK GGATPPHRLA HTFKFKDYSP IVFAYLRRMY GVNEFDFLLS
VCGNANFIEF ISNAKSGQFF FYSSDGKYMI KTMTNAESKF LRRILPSYFR HCCENPNTLI
TRFFGMYRVK LYHLRRNVKF VIMNSVYYTD KYLQNFYDLK GSVVGRNAKP GQAVKKDNDL
RQGLPESALS LRPPVRTHMR DQISADCEFL RQMEIMDYSM LVGVHHVPPV EDHSLATIGF
RASARTSAQR MRKGSLEMDS GWKPLPQDPV TNGALVDGSG SEEKITDFSV VSDRRYNRNE
SLGGFFLDDG LEDDESSYLM GSSRRFEPRS PSFHEETERK RQATIEKLYW PFHRLFDIHG
YRLLEPVQCT KCFAAPCNCD SDASLLEGYK IPIFVKPLSE RKDGGLEMDT TGRQLPMKLK
GPHGDQLYEG KIFYMGIIDV LQEYTSRKRV ESSYRALTSS GKFEASCVPP DVYGERFVRF
FDEYTVGMTQ SKEGTKGSIE KTKNTP