Gene PHATRDRAFT_42513 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42513 
Symbol 
ID7196063 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp266059 
End bp270510 
Gene Length4452 bp 
Protein Length1379 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176556 
Protein GI219109603 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACGA ACCGTTCGAT TCGTGACGGG TCGCCGAACG TTGCTAACGA CCATCCCGGA 
GACACGGAAC GGAGGAAGAG TCCCATCTCG ACCCCTGCGG CTTCACCGTG TCACGACGAA
ACCGACGACG CTCCCGTGGC AACGCTAGAT ACCGCTCCAG AATTTGACGG CGGCACGAAT
GAGAGTTGGA TGTCTCCCGT TTTGGAAACA CCCAGAGCTC GAGCGTACTC TTCGGAAGAA
CGAGGTGCAA TATCCGCGTC GGTGGGACGG CCTCCCCTCA TCGTCACGAC TTTGGCCGAC
ACGGACAACA CGAGCGTCAC CGAGGGCACC GCACCCGTGT CTCCCGCGAC GTCGACTATT
CCCGTGCACC GACGTGTCGA CAGCAGCGAC GCCAATTCTC TGCCTTCGCT GTCCGGACCC
ATACAACCGG CCCATCCCTT CTTCAGTCCA TCCTTTTTCG AAGAATCCGA CGCCAGCGAT
GGTAGTATTG TGTTTCAACC ACCAACGCCG CAACGCGATA CTTCCAACCC AGATTCTTTG
CGATTCACTG GTCCGCAACA TGTTCGGCGG GACAGCCGTG GACGGGATGG GAGGGATCGC
TTTCCTAGTT TCGATAGTCT CGGGAGTTCC GGATCCATTC GCATCGTTCC TCCCAGTAAT
AGTAGTCAAA TCATGCGAGG AAAAGAAAAG TTCTCCGCAT CGTCAAACTC CTTACCCGCA
AATCCAGCTC CGGTGATACC CAGCCTGCCG ACTATTGGAC AACTGCCGTA CCACGAACGA
CGAAAACTCA TGGCACAACG TGCCAAAGAT CAGCAACAAC AACTACAGCT GCAACAACAA
CAACTACAGC TGCAACAACA TCGCCCGCAT ATACCTCCGA ATTTAGCACC CATGCCGCTG
CCGGCTCCGA CTTTGCAGCA TCTTCCACAC CAGCAGCAGC ACCAGAATCT TCCTCAGATT
ACACCACACA CCAAAGAGGT TGCTTCTCCG TATGGACAAA ATGTATCGTT TCCTCCCCAT
CAACCTCAGC ATCCACTACC GCAGGCTCAT CCTCACCACT ATATTCAAAA CTTGCCATCG
GGCTACGCAC CGCATGCCGG AATACCGTTG AGAGCATCGC ATGGGCCACC ACCGCCGCCC
CTTTATGCTT ACCCAGTACC TCAAGTACCG CTAGGCCCGT ATCCACCGCC ACCTCTTCAG
CAGTACTACG GGCACCTTCC ACCACCACCA TCGCAGCAGC AGCAACTGCA ATCACTCTGG
ACACGAAACA ACCCTCTCCA AACCGCGGAC AAGGTAGCAT CGGATCCTCG TCAGCAGCAA
CATTTATACC GAACGTCCGG CAATCACCAT CAAATACCTC TTCCCCCACG TGGAGGAAGT
CATTCGCGGA ACAATTCGTC GACAATTGGA CTGTCTTCGA ATATGTCCTC ATCGGATGGC
GATGTGGAAC GGAAGCCGTC GGCACGAGTG CGTCCCGAAA TTGCCCCAGC CCAGCCTCCA
TTACCACCCC CTCCCCCTCC ACCTGGTTTG CCGCCAAATT ACGCTCATAT GCGAACGGAC
AGCTCCGGAA GTCTCTCGTC CTTAGGTAGT TTCGACCGAC CAGCGAAGAT AGAACCACGA
AAAGCTAGTT TCTTGGAAAA ACTCAATCCG TGGTCACCGA AAGTACCAAA CGTAAACGAT
TATCATCGTA AAAATCAGCA GTTTTTGCGG CGGGCAAGTG TAGAACGCGG AAAACTTTGG
AGTACGAGCC CACAAACACC AGCTGGACGA CGGTACGTGC ATTACTTGGA GATCTCTGCG
CATCTTTGTT GACTTCCGTC GTTTACTCAT TTGCCTTTCA TAAATCCGAA AGAAAACCAT
CAGTTTCCAC CGAAGGACCA CCGCCCACAC GCGGATCACA CAAGCGGTTA AACTCCATAG
ATGGTGACGA CTGGGAGGAG AGACCTGAAT CTCTTGATAC CGTGCCCTAC GGCAGCAAGC
AGAATTCGGC CGATCCAAGT GACCAACTAA CAGTGAGTTC CGACGTCAAT TCCAGGTATG
TAGACATTTT ACTTTTTTAC CAATTCATTG TTGTTGGAGC GAGTACTCAT AGTCTGATGG
CTGTACAGTG AGGACAATGA TGTAGATTCT CTGGACTCGC CACAGTTTTA CGGGCATGGT
TCCCGACGTG GATATGACGG AGGTGCTGAG CCGAATGAGC GCTCAAATCT ACTGCCGCCT
CCTGGCTTGA ACACGAGTGA ATACTACGAC AAGTCAAATG CATCTGAGTC TCGTGCTGGC
TTAAGCAGCG AAAGCTCCCA ACGCCGACAG AGCCGTCGTA CGAATTGGCC AAACGAATTT
CAGTCGGCTT CAACGGAACG CAGGGGAAAT ACGACGGGAG CCAAAGAAAT GGAATATCTG
GTACGCCTTG GCCTTGTGTG CCTGCCTTGA TAAATGAAGT TTTGGATTTC ACTCAACACC
ATATATTCCT TTGATAGACC GAGAAAGAAA GACGAAAGCT GAATCGAAAG AAGAAGAAAA
AGAAGAAGAA ACACGAACGA CACTCGAGAA AAGCTCGTGT GCAGAGTGGT TCCTCGGAGG
AAGAATCCAG CAGCTCTGTG TCGGCATCAT CAGCGTCTTC TCACGAATAT CAACGATGGA
TGAAGAAGCG TGCACGGATG CTGGAGAAAG AAAGATCTCG ATTGATCAAG CAATGGAGAG
CTGAAGCATT TGCGGAAGAA CGGTCTACGC AGCAGCACAG TCGTTGGTAC CGTCGTTTCA
GCCGCTATCA AAAAGAACAG TTTGGCGAAT GGGTCAGCCA GCTTTTCCGA TTCTTTATTT
GGCTAGAATC CTTTGTTGCC AATCTTCCGC TTACGATTGG TGCAATTGCT CTAGCAGTCG
CCAACCTTGG TGTTGACTGG TTCAAATTTG CTGAAGAAAA CATGGATTCC TGCGAACCGG
TGCACTTTCA TTCATCTCAG TGCACATTCC CCGAATTTCC TGGTTGTTTT TATTGCGACA
CTAGTGCTAG AATGTACAAA GTTGCGCTGA ATTTCCATTT TGCTTGTTCG ATTATCGCAG
GAGTCATAGC GTCGACTTTT ATTGCCAAAC TCATTTTGGC CCGCCGTGTG GTATTCGACG
AACTGAGTTC TCCAACTACG GCAACACCAG CGGGTTTGCT TTGCATGACT CTGAATGTGG
TTTTTGCCGG ACGAGGACTG ATAGGACAGG TAGTGGTCTC GCTAGCTGGC TTTATTCATC
TCTGTCTTGC AATCTGGTTC ATTTACATGG CGCTAGCATA CCGCATCATG CCTGAACCAA
GTTGGTTCCC AAACACGGTT TCTATTGGAC TTTCGGCAGT GAAAATATGG TTGTACTATC
CAATGGCTGG GCATTTCCTT ATGGCGGTGT GTACTACTTG TTTTTGTTGC TACTAAACAG
AAAGGGCGTT ACAAATTTTC TCATGGCGCT GGCTCTTTTT TGCTTCAGAT ATCCCTCTCG
TTGAACTTTT TCTTTTTCCC GATCAGTCTT ATTCGTGTTG CCATGAATAG AAAAATTTCG
GCAACAGTAG GGTGGATGCA AATGTCCGCC CCAAACATAA GTCTTTATGC AATGACGCTC
ATGGCCCAGC CTTCCTTTAA GGAAGAACAC CCAGATATCA ATCGGTTTCA AGTAGTCCAT
CGCATGGTAT ATCTACCTTG CATGCATTTT TTCTTTGGCC TTTGCATAGT GGGAATGCTA
GCTAGCGTCC ACAGCTTGTT GGTTCGATGG ACTGAGTTCC GAAAGATTCC ATTTTCTCCA
GCTCATGCTG CTTTTTGTGT TCCGACCTTA TCTCACGCGA ACGCTATCCA GGCGTACCGA
GCAGCCGTCA ATTCATTTTC AAAGGTGCCT GTTGGAAGTC CGTTTCGCAG CTTCCTTTAT
GTTTACTGGG TCTTTGTTCT CATAGCTGGA ACGTTCCTGA CACTTTGGAT TGCGACGAAA
TTTATGTGGA GCTTACCAGG TTGGACTCAT ATTGATACGG CAGGCGAAAT GGAACCGCCA
GCCCCATACG AAACAGCCAT GACGTCATCT AACCTAATTA CGACCGGAGA AAGCTTGGTG
CAGCCGTTCA TCAGCCCAGC GATTCTACAG GCCAATGAAA CGGGTGCTTT GGTAGTTTCC
CGCGACCAAT ATGGAGCTCA AGTCTACCGA CGAACGCGAA TGGTGACTGC GCTCGGTTTC
GAACCGATTA TGAACCAACT GCAAATGGAC GTAGAGCGCG AACTACTTTT GGACTGGGTC
GGAAAGAATC CTCCGCGACG GCGACACCGG ACACTAAGCG TACCGGGAAT TGACTTTACA
TACGGAGCCA CGGGCGCTTT CGGTGCGGGC AACGCCGGTG TGTACGGAAT GGACGAGGGA
ACAGGGTCAC CGTGGTTTTC TCGTCCGAGA GCGAACACCA GCTCTCCCAA TGTGAGTCAT
CGGTATACTT AA
 
Protein sequence
MSTNRSIRDG SPNVANDHPG DTERRKSPIS TPAASPCHDE TDDAPVATLD TAPEFDGGTN 
ESWMSPVLET PRARAYSSEE RGAISASVGR PPLIVTTLAD TDNTSVTEGT APVSPATSTI
PVHRRVDSSD ANSLPSLSGP IQPAHPFFSP SFFEESDASD GSIVFQPPTP QRDTSNPDSL
RFTGPQHVRR DSRGRDGRDR FPSFDSLGSS GSIRIVPPSN SSQIMRGKEK FSASSNSLPA
NPAPVIPSLP TIGQLPYHER RKLMAQRAKD QQQQLQLQQQ QLQLQQHRPH IPPNLAPMPL
PAPTLQHLPH QQQHQNLPQI TPHTKEVASP YGQNVSFPPH QPQHPLPQAH PHHYIQNLPS
GYAPHAGIPL RASHGPPPPP LYAYPVPQVP LGPYPPPPLQ QYYGHLPPPP SQQQQLQSLW
TRNNPLQTAD KVASDPRQQQ HLYRTSGNHH QIPLPPRGGS HSRNNSSTIG LSSNMSSSDG
DVERKPSARV RPEIAPAQPP LPPPPPPPGL PPNYAHMRTD SSGSLSSLGS FDRPAKIEPR
KASFLEKLNP WSPKVPNVND YHRKNQQFLR RASVERGKLW STSPQTPAGR RKPSVSTEGP
PPTRGSHKRL NSIDGDDWEE RPESLDTVPY GSKQNSADPS DQLTVSSDVN SSEDNDVDSL
DSPQFYGHGS RRGYDGGAEP NERSNLLPPP GLNTSEYYDK SNASESRAGL SSESSQRRQS
RRTNWPNEFQ SASTERRGNT TGAKEMEYLT EKERRKLNRK KKKKKKKHER HSRKARVQSG
SSEEESSSSV SASSASSHEY QRWMKKRARM LEKERSRLIK QWRAEAFAEE RSTQQHSRWY
RRFSRYQKEQ FGEWVSQLFR FFIWLESFVA NLPLTIGAIA LAVANLGVDW FKFAEENMDS
CEPVHFHSSQ CTFPEFPGCF YCDTSARMYK VALNFHFACS IIAGVIASTF IAKLILARRV
VFDELSSPTT ATPAGLLCMT LNVVFAGRGL IGQVVVSLAG FIHLCLAIWF IYMALAYRIM
PEPSWFPNTV SIGLSAVKIW LYYPMAGHFL MAISLSLNFF FFPISLIRVA MNRKISATVG
WMQMSAPNIS LYAMTLMAQP SFKEEHPDIN RFQVVHRMVY LPCMHFFFGL CIVGMLASVH
SLLVRWTEFR KIPFSPAHAA FCVPTLSHAN AIQAYRAAVN SFSKVPVGSP FRSFLYVYWV
FVLIAGTFLT LWIATKFMWS LPGWTHIDTA GEMEPPAPYE TAMTSSNLIT TGESLVQPFI
SPAILQANET GALVVSRDQY GAQVYRRTRM VTALGFEPIM NQLQMDVERE LLLDWVGKNP
PRRRHRTLSV PGIDFTYGAT GAFGAGNAGV YGMDEGTGSP WFSRPRANTS SPNVSHRYT