Gene PHATRDRAFT_48922 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48922 
Symbol 
ID7195349 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011688 
Strand
Start bp36395 
End bp39615 
Gene Length3221 bp 
Protein Length965 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183659 
Protein GI219126846 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGCAGA ATTCACCGAC AGTACCCAAA GTAATCACTG GTAAAATAAT GGACGAATCA 
CTCTATGAGC TGGTTCTTGA GCACTTGAAC AATTATCAAC AAGCTATTGT TGTGGACAAC
CGCGGGGAGA ATGATGCGTT GCTCGATAGG CTGTCTCGTT CATTTGGGGA TAAGCTCAAT
GAACTCGTCG TGACGGGATC GCTCGAGAAA ATGGGCTTCC GAGATGCTTC ATCCAGTGAC
ATGATCCAGT CCGATAAGAA TCAGGATTCC GTGGAAAGCC AGTTGGGATG CATACTTGTA
TCCTTCGTAG ACACGGTTGT CTCCACCGCA GAGCTTTCGG CAACAACGAA AGACTCCGTT
TTCGCCTTGT TGGAGGCTAT TGCTTGCTTG GCTTCTACCT CTCTCGTTTC CTCACTTTCG
GTGTTGACTC GTGCAATTGA GTTCTCCAAC GTTCTATTAG AACGCGTTAG AAACGTTGCT
TGTCGACTGA TTGGGAATCT TGTCGGATTC TGGATGCAGA GTTCAGCGCA GCACCTTTTT
CACGGTCTTC TCGACCAGGC GTCACAGGCT GTACTGCCTC GCTTTACGGA CAAGACCCAG
TCAGTTCGGA ACGCGGCCAT TATCGCTGCT AAACAATTCT TCACCGGAAC CATTGATGAT
GCCGATTTGC GGACTGCCTT GGTGTGGAGT GTCCAACACG ACCCGTCCGT TACTAATCGC
CTGCAGGCTC TCGAAAGCCT GCCACTAAAC GGCCAAACAT TGGATATCAT TGTTGCCCGA
ATTGCCGACG TGAAACCTAA AGTCCGAGTT GCAGCCTTAC ATAAGCTTTC TACCGTTTCC
ACTTGGCAAT CACACGAACG AGCGGCGCTC GTGCGAGCTG GTCTTTCCAA GCGGTACGTG
CTACGACAGA CACGCAAGCT GATCATGACG TACCTTGCTT ATGTACTAAT CACTTGACCT
CCGCTACATC CTTTTGTCTT CATCAATCTA CAGCTGTACT GCCACGCGGG ACGCGACCGT
CAAAATGGTT TGCCAGGCTT GGATGAAAGC CGTCAAATAC GAACCACTGG AACTTCTTCG
TGGATTGGAC GTGGTGAATT TTGAAGAGGA GGCAGCTCAA GTTGTCAAAC TATTGCTAGA
CGCGACCAAG GACCCTACAG CGTACCTGGA AGAGTTGTCG ATGAGTCCAC CCGAAGTACG
TGCGTTTGTT GAGAATGTGA ACGCGTCGTC TTCCCTTGTG ATTGCAAATG CTGAACAAGT
TTTGACGCCA GAGGCGTTAC TGTGGGCTCG TGTAGCTGTC CAACACACCA AAACATCGCA
ATCAAACTCT CGAGCTGAGG CCATGCTTTC CCTCATCATC CCTGAAGTTC CCATTCTTTG
CTCTGTTGTC GAGAACCACG CCGCCCAACT CATGACAGTG CTTAGTGAGC AGTACGAAGA
TGAGGACGGC GGCTTGGTGG ATAGCCTCGT CACAATTTGC TTGCAGCTCT TACAGCTAGC
GACGTGCGTT TCAAAGTCTT CGATGGAAGA AGGATCGCGT CGAGTCTTTA CTGCCGTCAT
GAAACGCATG CTCAGTTCCG TCGTTACACC GGACGACCTT GTCGAAGGCT GCGTTCAAGC
CCTACACTCC GTTTGCATCC TCGAAAAAGA TCTCGTCGAC GCAACCAGCG AAATAATTGC
CGAACTCAGT CGTCTGAGCC AAGAGCACGC GGAGTTGCAA AAACAGCATC ACTTGCGCAT
ACTTGCAATT CTCAGCCTAG TCTTGGAACG TGTGTCTCCC GGATTTGGTT CAAATCCCGA
TGCTTTAACA TCCTGGGCGA CTCATATTAT CCCAGCTGTA ACCAGTGAGA ATCTATTGAT
CCGTCAACTC GGCGTTTCTT GCTTTGGCAA GCTCGGTTTG TTTACCCCTG TCGACACAAT
CTCGGAACAA TTCCTCCCAC TAATATTGCG CATGGCGTCA AACGAAGTGG AGACGGACGA
AATACGCGCT CAAGCCTTGC TTGCCCTCTC GGATTGGGCT ACGCTATTCC CAGTTGTCTT
AAAAACTCAA GAGTTAGACG GGAAGATGGT GTCTATTTCA GACGTTGTCC ATTATTGGCT
CGATAATCTT CCGAAGGGGT CGGGCAATCA TACGTCTTTG GCTTTTATAG CTGCAGAGGT
TGCAACGAAG CTCTTATGGT CGGGACGAAT TGTGGACAGC TCATGGTTGG CCAACCTTGT
GGTCATATTC TTTGACCCCA ATCTTTCGAC TGGGGAAATG GAGGAAGAGT ACCACGAAGA
GGAATCAAAG GAAATTGGGA GCCTCGTTCG TTGGCAACAG TTGCAGAGCG TGTTCTTTCC
CGCATACGTT CGTCGTGGTC CTGTTTATCA AGAGGCTCTG TTGAATTCCA TTTCTCACAT
CTTGCAGGTT TTCTCCTCTC GCTCGCAAAA AACAACACGT GGCAAATCGC TGCCCGTTGT
AAAAATGATT GATTTCGTCT GCGCCTTGGT AGAGGAAGGA GTAGCCAAAG CAGACTCGAC
AAAAAGTGCT ACGGAACTTG ACAATGCGAA AGACGAAGAA TGCATGGCAC TCACGTCTGT
CGGACTTTCT TCCGCAGTTC AGATTGCCAC TTTTCTATAC GCTAGTCACG ACAATCTCAA
TGCGGCGGGT CTACGAGCGC TTTGCAAATA CTTGGGCAAC ATCAAATTAG ATTTGAGACG
AGTTTGTTCA ATAGAACTGG TGAAGCTCAA GGGGTCCGTC GAAGACCTGA CCATGGCAGT
CTCTGACTCG ACGTGTCTTC GCGCGTTGGA CTTGCTTACG GAGAAACTGG CCGGTGTCGA
GATCCCTGAT AGCGAAGAAA GCTCGGATGG CGAAGAATCC CTAACGGAAG CAATGGGCGA
TCTGCAGGTA GGAAAAGAGA ACTCGATTCG GCAGGACAGT ACCGCATTGA AGGGGGATCC
TGCAGCCGTA ATCCCCGTAA CTTCCAACAA TCGAGCTACC CTTTCTTACG TAAACTAAGA
GTAAAGGGAA CTGTATCTGA AAGCACATTT AAGTCGTCTG TCAGAGTGAG TTGCTTTCGC
CGCTGTCCAC GGAAGAACTA TAAAACACCA TCAATCTACC GGACCGAGTT TGAAAACAAG
GTACATTGAC TGTGGATATG AGCAGACCGT CTTCTCGCTC GATTTGAGGC TGAATAAGGT
CAAGTGCGCC TAACAGTAAA TTACTTCGAG AAGATGGCTA A
 
Protein sequence
MVQNSPTVPK VITGKIMDES LYELVLEHLN NYQQAIVVDN RGENDALLDR LSRSFGDKLN 
ELVVTGSLEK MGFRDASSSD MIQSDKNQDS VESQLGCILV SFVDTVVSTA ELSATTKDSV
FALLEAIACL ASTSLVSSLS VLTRAIEFSN VLLERVRNVA CRLIGNLVGF WMQSSAQHLF
HGLLDQASQA VLPRFTDKTQ SVRNAAIIAA KQFFTGTIDD ADLRTALVWS VQHDPSVTNR
LQALESLPLN GQTLDIIVAR IADVKPKVRV AALHKLSTVS TWQSHERAAL VRAGLSKRCT
ATRDATVKMV CQAWMKAVKY EPLELLRGLD VVNFEEEAAQ VVKLLLDATK DPTAYLEELS
MSPPEVRAFV ENVNASSSLV IANAEQVLTP EALLWARVAV QHTKTSQSNS RAEAMLSLII
PEVPILCSVV ENHAAQLMTV LSEQYEDEDG GLVDSLVTIC LQLLQLATCV SKSSMEEGSR
RVFTAVMKRM LSSVVTPDDL VEGCVQALHS VCILEKDLVD ATSEIIAELS RLSQEHAELQ
KQHHLRILAI LSLVLERVSP GFGSNPDALT SWATHIIPAV TSENLLIRQL GVSCFGKLGL
FTPVDTISEQ FLPLILRMAS NEVETDEIRA QALLALSDWA TLFPVVLKTQ ELDGKMVSIS
DVVHYWLDNL PKGSGNHTSL AFIAAEVATK LLWSGRIVDS SWLANLVVIF FDPNLSTGEM
EEEYHEEESK EIGSLVRWQQ LQSVFFPAYV RRGPVYQEAL LNSISHILQV FSSRSQKTTR
GKSLPVVKMI DFVCALVEEG VAKADSTKSA TELDNAKDEE CMALTSVGLS SAVQIATFLY
ASHDNLNAAG LRALCKYLGN IKLDLRRVCS IELVKLKGSV EDLTMAVSDS TCLRALDLLT
EKLAGVEIPD SEESSDGEES LTEAMGDLQV GKENSIRQDS TALKGDPAAV IPVTSNNRAT
LSYVN