Gene PHATRDRAFT_44839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44839 
Symbol 
ID7199556 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011673 
Strand
Start bp393597 
End bp397124 
Gene Length3528 bp 
Protein Length928 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178992 
Protein GI219116394 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0126188 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATCCG TCGATGGGGT TGAGCATCTG TCCTGCTCGC TGAGACCACC CACATTGGAT 
GATATTCAGA GTGCGTTTGT TGAGACGTCC GTCTCGGATA CTATTGACAA CAACGATGAA
ACCCAGGCGA GTACTTTAGA CCAGCCCGCG AACGGAGATT GCACTGCTTG TCCGGGGATA
AACAGCATCG ACAGTATGGA GCGCGGCGAA AGTGTGGAAG AAATTTTTAG AGCGTCACTA
AATTTCGACC GATCTAGTTC TTTTTGCTTT TCAGGAAGTC TACCATTATT TGGTGCCTCT
CTTTATCCGA ATGTTGCGCG GATTGAAGAT GCTGATGGAA TATCGACAAC AGCACAACTT
GAACAAGGAA AACGAGAGCA GTCGGAAAAT ATCAGAGTGG AATCAACCTA TGCTGAAGCC
TCGCTGAACA CTCTCACTGA GGAAGCAGTT GTACTCGGCT CGGCTAGAAT GTCCGATGAT
CCCTGGGACG AAATGACCGT CGAAGACAGA GAAATTCTTG CCATGAAGAG GGCAGCTTAC
GCCGGAGAAA GGAGCGATCC ACTGGTGGCA CAGAACATCA AGAAGCATGG ACAAGAAGCG
ACAGTCGTCG CTATATCGGA GTATGACGTG CATCCGAATG AAATTGTCCA AGATGCCGTT
AGAGCGGAAC TGGTTGGACA GGACTTCACC AGGACAGTCA GCAATGCAAT GGACAACGAA
AACGTGGAAA GTACAGAAGC GGATGAATTC ACAGTCAGTG CAGTCTTTTC AGCATTGAGA
ACGGTCGACA ATGAAGCTAC CGAGGCAACT GTGATTGACA GTGCCCCCCT CGCTTCGGAA
GAAGATATCC CTTCATGGAA GCACACGGAA GAAGCCCAGG TTCTCGTTCT GGATACGTCC
TTGCCTTTGA TTGACGTTAA ACCAGCAGCT GTTGACTTCT CCGAGTACAG CAACAGTGGA
ATAGAGCCTA TCGCCGACCA AGAAGCGGAA GTTCTTGGGA TTCAGGAAGA GATACATCCG
TCCGAGGTTT CAGAGAACGA AACGGAAGCT GAGCTCGTGG GAACGGACCA CAATTTTGCC
TTCACGCTGG CTAATGAGTC ACATCTTGAC CGTAATACAG CAAGAAGCGA CGAGCCAAAT
GAATTGCGGC AAGTAGATTT TGTAGTTGAG ACTGTAGATG GCGGTTCTGA CGGTGAAGAG
CCTATTAGTT TTGGTAATGT TCGCGCAGCT CTTCTGGTTC CAACTATTTT GAACGCTGAA
ACCGTCGATT CCATTCCAAC GGTAGCACCT TTCACACGCT CACTGTCTAT GGAAGGCACA
GATACTTCCT TTAGAGATGA TACATCGGCA CAGACGGGGA ATGGGTCCGC GTTCCCGTTG
CCTCTCCCAC CTCCTAGACA AACTGCTGAA CCCAGTACCA GAAATTCTTC TGTTGGGGAA
GAAGACAACT CACGTCCTGA TTGGCTTCGA GATTCTCCTG AGCAGCTTCC TTTGGGGAGC
GAAACGGTAC AAAGAGAAGC TTCGAACGCC AGTGGCAGAA GTGCCGGCAG TACAGGGAAT
CAGATAGTCC AGAGGACGTC CTCACAACTA CAAGTGGTAA TGTCATATGG CGCATAGCAT
CAGAATATCC CCTTACTCTA CCACCGAACT CACTGAAGTT TTTTTAATTG TCGAAAAGCT
TTCCAGCAGC ATAGCTCGCG GTACTAACAA GGCCTTTGAG GCCTTGTTCG GTGATGCAAA
ACCTCCGTTT GTCAGTCGAA GGCGTCTTGC TGATGGGGAA GCTGCGAGGT ACATCGTTTC
CCGGACACTT CTTCCCGCTT CCGTTATCTT CAGCCAGCCC ACAAAAATGG TGCGTTCTCC
AGCGCTCGTG TTGGATGCTC TATCGTAGCT TCGTCTCGGG TTGCTAATAC GTAGTTCGTT
GTTCTTTTAA AAGTGGATCG CAACTCTGCA GACGAATCAA AAGGCTCTGG ATAGTAACGA
TGTCATGGAG GCGTCTAAAT CTTTGCGGGC TTTTAGTTTG CCGTCCGAAC GCCAGGCAAA
ATGTTTGGCC CAAGCTTGGA CGCCTCCGCG GATGGAGCCG TTCGCCAACC ACCCACTGTG
CAATACCTGC CAGTCCAAGT TTGCTGTCTT CCGTCGAGCC TGTCACTGCC GAAATTGTGG
TGTTTGCGTC TGCAAAGACT GTACCGTGAC TTGGCCGGCA AAGATGGTAC CGGAGACTTA
CAACATAAAG AAAACAGCGA CAGTAAACAT ATGCAAGGCC TGTGATTGGC TTTGCAACAG
CTTTCGTCTT GCTCTTTTGG AGGGTGACCA GGACAAGGCA GTTGCTCTTT ACGCTACTGG
GAACATCAAC ATCGTGTGCC CTTTTGGAAA CGTACGTAGT CAGCCTACGG AATCCGGCTG
CATTGCAGTG CCACCACTCA TTTTCATTTC TTCTAGGTCA AAGGAGAGTT GTTCTACCCT
GTTCACGCAT GCGTTCTCGG TGAGTCACTT TCGATCTTGC GATGGTTAGT CGACGAGAAT
TGCTGCCCCA TAAAATCCGT TCGAGTCAAT GGAAGGACGA AGGATGGAAT CTGTAACTAT
ACGGCAATTG TGACATCAAA GGGCCGGTCG TTGCTGGGAA TCGCAATGGA AAACAACTTG
ATACCAATCG TTCGATATCT GGTTGTAGAG AAGGGCCTCT CTTTGGCAGA GGAGAAGTCC
CTAACCCGCG AAACGCTTGT CCAAAACCTT CAGCTTGCTC TAAGGGCCAT ACCAAACGCC
ACAACATCAA CCGAGGCCAT CGAAATGGAT GTGTCGGAAG CATTGTATCA CGACGCCACA
GTCAATGACA GCGCCGAGGC GGATTTGGGG GACACGACGA GCCCGCTTTC TTCAGAGTAT
CAAAACGTTG TACCAGTGCC TATTCCTGAC CGCGAACAGG GTAGCGAGAG AGATTTGCCA
GGTGGGCGTA CTCTAAGTGA AGAAGCTCGC AATTTCGGAG CAATTAGCCG ACCCGGTAGA
GGTTCTTTTT CGTACGATGG TCGGCAGGAT GAAAATGAAT GTAGGTGTAC TTACTGAAAC
CCGTCTTAGC CTCAGAAGAT TCTCATTTAT CCAACTTGTT TGTCCACAGG TATCATTTGC
TTTGACGCCA ATATCGAGTA CGTCCTTCAC TCTCGCCTGA ATATGTTCGT CTCTACCTAG
CGAGACTTCG TTGCTCACCT TTAATTGTTA CTCTTCGCTC TGAAGTTGCG TGGCTACCCC
TTGCGGTCAT CAAGTCTGTT GTCTGGACTG CAGCGAGCAC TTGTCCCGTT GTCCAGTTTG
CGCCATGCCG ACCTCCTTCA TGCGAGTATT TAAAGTATGA AAAGAAGAAA TTAAGACCGA
AGGTACGCTT GCACTTCGTG TTGTTGGCCA TGCCATACCG AAACGACCCC ATCAGCAAAA
ACACTTAGTT CAGAAGGATC TGTTGCGTAT AGTTTTGTCA CACGTAAGGC TTTTATGTAG
ATTTTTATGG TAGTGTATGA GCTTGTCTTT TGGTCTAGTA GTAAGTAG
 
Protein sequence
MPSVDGVEHL SCSLRPPTLD DIQSAFVETS VSDTIDNNDE TQASTLDQPA NGDCTACPGI 
NSIDSMERGE SVEEIFRASL NFDRSSSFCF SGSLPLFGAS LYPNVARIED ADGISTTAQL
EQGKREQSEN IRVESTYAEA SLNTLTEEAV VLGSARMSDD PWDEMTVEDR EILAMKRAAY
AGERSDPLVA QNIKKHGQEA TVVAISEYDV HPNEIVQDAV RAELVGQDFT RTVSNAMDNE
NVESTEADEF TVSAVFSALR TVDNEATEAT VIDSAPLASE EDIPSWKHTE EAQVLVLDTS
LPLIDVKPAA VDFSEYSNSG IEPIADQEAE VLGIQEEIHP SEVSENETEA ELVGTDHNFA
FTLANESHLD RNTARSDEPN ELRQVDFVVE TVDGGSDGEE PISFGNVRAA LLVPTILNAE
TVDSIPTVAP FTRSLSMEGT DTSFRDDTSA QTGNGSAFPL PLPPPRQTAE PSTRNSSVGE
EDNSRPDWLR DSPEQLPLGS ETVQREASNA SGRSAGSTGN QIVQRTSSQL QVVIIARGTN
KAFEALFGDA KPPFVSRRRL ADGEAARYIV SRTLLPASVI FSQPTKMTNQ KALDSNDVME
ASKSLRAFSL PSERQAKCLA QAWTPPRMEP FANHPLFRLA LLEGDQDKAV ALYATGNINI
VCPFGNVKGE LFYPVHACVL GESLSILRWL VDENCCPIKS VRVNGRTKDG ICNYTAIVTS
KGRSLLGIAM ENNLIPIVRY LVVEKGLSLA EEKSLTRETL VQNLQLALRA IPNATTSTEA
IEMDVSEALY HDATVNDSAE ADLGDTTSPL SSEYQNVVPV PIPDREQGSE RDLPGGRTLS
EEARNFGAIS RPGRGSFSYD GRQDENEFAW LPLAVIKSVV WTAASTCPVV QFAPCRPPSC
EYLKYEKKKL RPKIFMVVYE LVFWSSSK