Gene PHATRDRAFT_44493 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44493 
Symbol 
ID7197718 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp726186 
End bp730242 
Gene Length4057 bp 
Protein Length1247 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178290 
Protein GI219114989 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AAAGCCTTAT TTCATCTTGC CATCTTTATT GGTCTTTGCA GCTGATCGTA TTCACAGTAA 
ACCTCTTAGC ATTCCTTGCT GACAACATTT TCTTTCAGCC TGGACGAATT CAAATCTGCT
GTGTAGACTT TCTACGTAAC TGTACGGGGA ACGTCTTGTC ACTGGTCTTC TTTGCATTTC
CTATTTTGCA CCTATCATTG ACTTGCTCTC TAAGGTCGGA TCCAGCCAAC CATTGCAACG
ATGTCGGTTA ACGTGAAGAC TGTGTCCCAA GCCGAACCGA CGACTTCTCC AAAGTCAGAA
CCGCCCATCA CGTCATTTAC ACTAAAGGTT TCGGCCAAGG ATGCAGCGCG CTTTTTCGCC
ATGTCCAAAG CTATCAATGA GCGACTACCA GAAGTATCAC CAGATACAGG AGATACTTGT
GAATTCAGTC TTCACGCACA ACTCTACGAA GCCACGCAAA AGTGCCCTGA CGAGCTGGTG
AAAGGATTTT TGGATCGAAT GATTGCTTTC CTCATCCCTG TCAAGGAAAT AAAGGCTGGC
TTGGTGCTTT TGCATGATGC TAACATGAAC GATAAGCTTC TACATGTTGC GGAAGCGCTG
AGCCCAAGTG CTACCACTAG TGGTTTGGAT AAGGGACAGC TGACTTCATT GTTTCGGTCG
CTGTTGACGG CGATCTCATG CTGTATTGAC CAATCGACCG AGGCAGTCGA CTTGGAAGAT
ACGCTAATGG AAGATAAACC CCAAGGACTA GAACCGCCGC CGAAAAAGAC CAAATTAGAC
ATAAAAATTG TCGAAGAGAG CGATCAAGAG TTCCGACCAT GTCAGTCCCC TTCTTTTGAC
TGTTCGTTGG CAACGCTGCG AGACGAGGAC GACACAGCTA CCAATACTGT TCGTCGAGAG
ATTGACGAAA TATCTAGTTT CGCTGCAGGA GAAGTTCTCT CAGACGGAGT GGAACGAGCG
ACTTTTGATC TGTTGCGATC GTGGTATGAT GAAAAAGGGA AGTCGGTTGT TCCTTGGCTA
GAACTATTGG ATGCAACTAA GTGGATGTCT ACCGAACCTC CAAGCCCACC CTGCAGAAAG
ACGGCCGAAT GCGACCCCTC CATGGAAACG CACCAAGCTC TAAGAGACGA TATACTGCAA
GTCGACGAGA ATGAAAGAAG TGTCTCTTCT GGCTCTAGAG ATATCCGCTG CTCTGTTGAA
GAAGAACTGC CTGTCCCACC TACAGCCGCG CATTCTCCGA TTGATTCACT GAGCGATGGT
GACGGCAGTC GAATTCTCGT TTCGTTTGAT TTTAGCGGCA CTGGCCATAC AACCCCACTT
TGCATCAATG TGTCGGAAAA CAATATCGTT GCTCTTCGTC AGTTGGTCCA TCGCACTGGT
ATTGTGCACT GTCAAGCCGT GGAAATGTGT CGTCGTTTGC TTCATATGGC TTCCCAGCGC
CAAGAAGGTA ATGAAACTAT CCTGGCGCTT CACTGTCATG ATTTTTCTCG ATCGATTGAT
CAGTTGTGGT CTCCAGAGGT GTTGAAGAAT ATTTCAAAAG AAGAGAGGGA ATCTTTTACC
TTGTCGCTCA CTTCAATTTT GTCTTGCTAC CAAGAGACAA AGCCATCTCT TTCGTCTGAA
GAAGTCGACC TTCAGGAATT TGCCGTGGGC TTCTGCTTCT TTTGCGCGGG CAACAAGAGT
GCAAAGCTTG CAACAGGCTT TGAGATGTTG GATGATACAA GACATGGATA TTTGACCGAA
CAGCAGTTAC TACGATACTT GAGCTCCTAC CTCATGATGC TTTCAGCGAT ATCGCTCTTA
CATCCGCTTT CCAAAGGGCA TCATTCAAAC AAGCTGACCC CTCAGCGTCG AAAAGCGATG
CGCACTGCAG TTGACAACGG TGCAAAATGG ACAATCGGTC ACTTCTTGAA GCACATGCAT
GATCGTGAAG GTGGAGAACA CCCAAATGCA TATTCCTTCG AGTCATTTGC ACTGTGGTAC
AGTGTCGGAG GATACAATGT TGCACCCTGG CTAGAACTTC TTGACCTCAA CAAACTTTTC
GTATTGATCG CTCCGGACGT GGAAAGCTCC TTGCATACCA AATCTTCTGC ATCAATGGCA
CACACATCAG GGGGACGGCG TCCAGCTGGT CAGCGCGATC GAATGTCTAC TCTACGCAGA
CACCACTCCC GACGGAATCC AGGAACTCAC CCGGAAGTGT TGTTCACCTT CCCTCTAGCC
CGCAGCCGCT CGTTAGTCGT TTTGAAGGAA GATGCGAGAT ACGTTCGTGA TGTCGTCCAA
GAACTCGGCC TCCTGTCTTT CAGCCCCGAT TTCGTGTGGT CCAGTTTGAG CAAAATTGTG
ACACGACCTA GCACTCCCAC AGAAGGCTAT GGCGTCGACA TGCAGACCTT TGTACAATGC
ATGATCGATG TTTGTAATAA ATCAAGTCGC AAACGTTCTG CGTCAGGAGC TGAGTCTACT
ATGGAAGAGC TTCTGTGTAA TTTTTATCAA TGTTTCAATT TGGATCAAAA AAAGCTGGTC
GCCGTAGACG AGCTTATGGG TGGCCTTACT CTGCTTTGCG GGGGAAAGAA AAGCGTCAAA
CTTGCCTTCG CTTTTGGAAT TTTCGATACA CGCCCGGGAG TTCACGGCAA GAGTGCGGAA
TCGGTTGTAC ACTCTTTGGA TGGTCACGAT CTCTTCGTGT TTCTACGCTC AATCCTGATA
GTTGCATTCT CATGCTGTCG CCAAAGCTTA GATATGGATG ATTCTGTTGT AGGGCAATGT
ATCTCTGATA CTGCCAACAT GCTTTGCAAC GACGTCATGA CTCACCAGGG GAAGCAGCGC
TTCTGTGACC GCCTCAACTT TGATGAATTT GGGCTATGGT ATAACGAAGG AGGATTTGAG
CGAGCTCCAT GGTTGGAGCT ATTAGATCTT AAGAAGTGGG TGCTTGCCGA CAATTTCGAC
GCCACCCTTG AGAAGCGTGT TGTCGAGTCA CAATTACAAG TGATCCCTGT AAGCATTGCG
ACAGATTCTT CAATTCCACC GCCTCCTCCC GAGGATGCAT TAGACGGTAG CTTTTTCGAA
GAGAACGGAA TTATGGCAAT GGACAGTGTA TGTATCCGTT TTAAAACTGA TTCAATTCTT
GCATATCTTT TTAATCCGTG TTCTCTTTCT CAAATTTCAG ATGGATGAGA TGGATATGAT
TTTGATGCAG TCGTCTACAG ATCGCGAAAG TGACCAGCGT TCGCCGGCAT ACGGACCTCT
CCCTGAGTCT TCGTCTCACT CGCCAGGATC AAAACTGTGC TCTCCCGATC CAAGAGGGAA
TCCACTCAAG TTTCATCTTT TGACAAATGA AGAGCAAGGA GGGTATAATG TTTCTCTCAG
TCACATCCGC ATCACACATC TGAAAAGCGT GCTGGAGGAC AACTGTTTGC ACGGACTGGA
CTGTGAAAAT GTATGCAACG CCATTCTGAA TAAAGCGACG AAGAAGAATA AAGCGATATC
AAAAAAGGGT TTCGATGCAG CAGTTGCAAG TGTCATGGGA AGTCAGAGAG GAAGGCCTGA
GACGCAACAA GTGTTGTCTA ACCTTCTTTC GGGAATTTTC GATGCGTTTG ATCGATTAGG
ATCAGGCACT CCGAGCGCAG TAGAAATTGC GTGTGGCTTT ACTGTCCTTT GCCACGGCAA
GAAAAGCGAC AAACTCGAGT TTGCCTTTGA AGTCCTGGAC ACGAAGAAGA AGGGCAAACT
GAGCCGCTCG GATATTTTGA CTTACCTGCG CTCTTTCCTG ACTGTTCTCA TGAGTATTGC
ATTTTCGCCA GCCCTCAAGA AGGATATTCG AGACGACAAG ATATCCACCA TGAAAGGATT
TGGCTGTAAT CAAACAACCG CTGCCGTCAA ACACGCCGTG AACGCTGGCG CTGAGTGGGC
CGTAACTGCA GCATTCGATG GAAAAAGAGA AGGCGACATG TCCGTTATGA GCTTCAATGA
ATTCGCGGAC TGGTACACTA CCGTCGGCTA CAGTAGTATT CCATGGCTTG AGCTGTTGGA
TCTACAGAAG TGGGTTTTCA CCAATGATGC TACCTAA
 
Protein sequence
MSVNVKTVSQ AEPTTSPKSE PPITSFTLKV SAKDAARFFA MSKAINERLP EVSPDTGDTC 
EFSLHAQLYE ATQKCPDELV KGFLDRMIAF LIPVKEIKAG LVLLHDANMN DKLLHVAEAL
SPSATTSGLD KGQLTSLFRS LLTAISCCID QSTEAVDLED TLMEDKPQGL EPPPKKTKLD
IKIVEESDQE FRPCQSPSFD CSLATLRDED DTATNTVRRE IDEISSFAAG EVLSDGVERA
TFDLLRSWYD EKGKSVVPWL ELLDATKWMS TEPPSPPCRK TAECDPSMET HQALRDDILQ
VDENERSVSS GSRDIRCSVE EELPVPPTAA HSPIDSLSDG DGSRILVSFD FSGTGHTTPL
CINVSENNIV ALRQLVHRTG IVHCQAVEMC RRLLHMASQR QEGNETILAL HCHDFSRSID
QLWSPEVLKN ISKEERESFT LSLTSILSCY QETKPSLSSE EVDLQEFAVG FCFFCAGNKS
AKLATGFEML DDTRHGYLTE QQLLRYLSSY LMMLSAISLL HPLSKGHHSN KLTPQRRKAM
RTAVDNGAKW TIGHFLKHMH DREGGEHPNA YSFESFALWY SVGGYNVAPW LELLDLNKLF
VLIAPDVESS LHTKSSASMA HTSGGRRPAG QRDRMSTLRR HHSRRNPGTH PEVLFTFPLA
RSRSLVVLKE DARYVRDVVQ ELGLLSFSPD FVWSSLSKIV TRPSTPTEGY GVDMQTFVQC
MIDVCNKSSR KRSASGAEST MEELLCNFYQ CFNLDQKKLV AVDELMGGLT LLCGGKKSVK
LAFAFGIFDT RPGVHGKSAE SVVHSLDGHD LFVFLRSILI VAFSCCRQSL DMDDSVVGQC
ISDTANMLCN DVMTHQGKQR FCDRLNFDEF GLWYNEGGFE RAPWLELLDL KKWVLADNFD
ATLEKRVVES QLQVIPVSIA TDSSIPPPPP EDALDGSFFE ENGIMAMDSM DEMDMILMQS
STDRESDQRS PAYGPLPESS SHSPGSKLCS PDPRGNPLKF HLLTNEEQGG YNVSLSHIRI
THLKSVLEDN CLHGLDCENV CNAILNKATK KNKAISKKGF DAAVASVMGS QRGRPETQQV
LSNLLSGIFD AFDRLGSGTP SAVEIACGFT VLCHGKKSDK LEFAFEVLDT KKKGKLSRSD
ILTYLRSFLT VLMSIAFSPA LKKDIRDDKI STMKGFGCNQ TTAAVKHAVN AGAEWAVTAA
FDGKREGDMS VMSFNEFADW YTTVGYSSIP WLELLDLQKW VFTNDAT