Gene PHATRDRAFT_44574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44574 
Symbol 
ID7198085 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp944634 
End bp947670 
Gene Length3037 bp 
Protein Length556 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178609 
Protein GI219115627 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GAACCACAAG GATTCACAGT CAATCACTGT TCGAAATGGC TCCATCTGTC TCCGTCTACC 
ATGGGGCTGC TCCTATTTAT GTAATATGGC GAGTTATGAT GGCCAATCGC ATGCCGATTA
TGGACTGCGA CGAAGTCTAT AATTATTGGG AACCCCTGCA CTTTATTCTC TACGGATCGG
GTCAACAAAC GTGGGAGTAC GCGCACGAAT ACGCGCTGCG AACGTACGCC TACGTGGTTC
CGTTGCAATG GCTTGCCCCG GTTCTCAGTC GATTTGTGTC GCCCTACGCA TGGTTGTTAT
CGGATTACGT TGTTGATACC ACCCTTGATA CCAGCAAACT ATCCTTGTTT CTTTCGTTTC
GGGCTTTTCT TGCAGGTACC ATGGCGCTTT GCGAACTGGC TTGGCTCTAC GCATTGCACT
CCCGACTCTC CAAAGACAGT TCGCTTGTTG TAGGCTGGAC GGGAGTGCTG TTGTTGACTG
CTGCAGGCAT GAACCACGCC GCCGGAGCAT ACTTACCAAG CTCGACCTTT ATGATGGCGT
GGCTCGCTGC GAGTGCGGTT TTTCTGTTGG AACGTCATTT TTTATTCGCA GCAATTGCGA
TTATATGCAC ACTTTCAACT GGATGGCCCT TTGGAGTCGT TGTGCTCGTG CCCTTGGGCA
TTCGTGTTCT AAGAAAGGAA TACAGAGCAC GTGGCGTGTG GAGGTTGTTG CTCTGGAGCG
TTGGCGTTAC GGCTGCGGTC GAAGCTGCCG TCTTGATGAT CGACCACAAA TTTTACGGGG
TGTGGCTATC TCCGACGTGG AACATATTCA AGTACAACGC CGCGGGAGGC GGCGACGAAC
TCTACGGAAT TGAACCAACA TCCTACTACA TCAAAAATCT GTTTCTGAAT CTGAACCTCC
TAGCCCCGAT GGGTATCATC GGCCTTCCCG TTCTCGTCTT TTCGCGAAAT CAACCTGCCA
AAGCAGACCT AGTGACAATG ATAGTCACGC TCTACACATG GCTTGCTATT ACTGTCCCTC
GTCCGCACAA AGAAGAACGG TTCTTGTTCC CAATTTATCC GGTGCTGGTA CTATCTTCTG
TTTTGACGGT TGACCACACT CTCAATTTCA TTGGCCGAAT TGTTGCGGGA TTTTCTCGTC
ACAAGACTCT CGTGCGTAAC CAGCGGATAG CTTTGCACTG CCTCGTATGG CTTCCCGTCG
TCGCGATGAG TTTGTGTCGC GTGGCTGCCT TACACAAGTA TTATACAGCT CCGTTGCAAG
TGTATGCAGC ACTAGTTTCC AGAATGGACC CACTCTCCAA CCAACTGGTC TGTAGCTGTG
GTGAATGGTA TCGATTTCCG AGCTCATTCT ATTTACCAAA GAACCATGAC CTCGGCTTTC
TCCCTTCGTC CTTCGGTGGG CAGCTACCAC AAGCATTTTC TGTACACGGA TCGCTACCTA
AAAGTCTGAA TCTTTTGCAG CCTTTCAACG ATCAAAACCA GCAAGAAATG TCGCGGTACG
CTACCTTGGA TCAGTGCAAT TATATTGTGG ATCTAGAAGG AAGCGATTGT GCTCCTTCCG
GCGCCGAGGT CGTTGCTCGT GCTCCGTTTT TGGATGGTGG ACGATCCTCA ATGATTCATC
GTATGTGGTA CCTTCCGATA TTGCACGATG CCGCCATCAA ATCCGGGAGC GTGCAATACG
AACACTATGT TTTGTACAAG ACTTGAGGTA TGTTGGTTTC CGACGAGACT AGTCTACAGC
ACTTGTTTCG AAAAGTAGGA AAGCTCGGTT GCTACGGTTT TTGGAAGAAT TTCAAGTATT
GAAATTCTCC ATCCGAGTGG TTCTCTTTCG AAAAAAGCCG TGGTGTGTTT TTTGGATGAG
AAATCTGTCG GTAGAGACGA TGATGAAAAC ACTTTATCTA TTGGAGCTGT CTAAATGAAT
GCGTGTTGAA AAGCAAAGGG GTTTATCTGA GTCTCACTTG TTTTAGTACA ATATCTCCTG
TAAACGAGCA TTTTCCCTTG CTGACAAATC GCCTTTCTTC GCAGGCTGTT ATTGAACGCG
ACGACTTATA TTACGGAACG AGTTTCTTTT CCAAGTAGGT CAAAGGTACG TGAGGAAAGC
AGATGCCACC CCTTCGCATG TCCGGAAAGT GATGCAAAAA TGATGAAGTC CAATTGCTGA
CCGAACACAT TTTGTGGAAA TAAATTCACA GCTATTTCAA ATTGCCTGAT TCCTTTTATG
CTATTAATAC GCACGGCACG CTTTTTCCTT TCTCTCACTC GTGCTTCTCT GTTTAGATTT
GCGTAAAGGG CCAAACGATC CGAACGCATC GCACACTTTC CACACTTGAG CTTGGAGCCT
AATGTGGAAT GGTATCCAAT GCTTTTGTTA AACCTTCGGC AGCGTCTGAA AACACGTAGG
CATTACAAGA TTTTTAAAGC GCTTCGCACA CGAGAATTAT TCCTATCAGG AAAGGGTTTA
TTCAATTGGC ACGACTTCCC TAGCGAGATC TTTGAGGGAA GTCATATCTG TCTCGTACAA
ATGTACTTTC GGTTCTGTTT TGTTGTCGAG TACATCTTCT TGGTACAAAC CGACCACACG
AACGCCCTTG GTCATTCCAG GAACCAGAGT AATCATTAAC TTGTGTAGTG ATGGTACGGG
ACGTTTTTCC TCTTTTGCCT TTTCGATCTC AAACTGTTCG TCGTAATCAG CATACCCCAG
GACTAAACCG TCTTGGTCCA CTGCGTAGGC TACTTTCAAG CTCATTCCAT CCTTCGTTAC
CGGTATTATG CTCGACGTGT TCACAATGAA GGGCTTCTTC TCTAAGATAA CAAATCGTTT
CCAGTAGGGA ACGAACTTGT CCAACGACCT CCACCCAGCG ACGTCTTCAA TAGCCTTTTT
AAATGAACCT TCGTCCTTCA AAAAATCAAA CCACTCCTTC CAATCATTTT TCGAATTCAA
GGCTTTTGTT TCACTGAAAC GGAAATCAAT TTGAATATTG CGTACGTTGA CTTGATTGAC
ATCAAGAGCG ATAAGGACAT AAAGCTCGGC ATTCTCC
 
Protein sequence
MAPSVSVYHG AAPIYVIWRV MMANRMPIMD CDEVYNYWEP LHFILYGSGQ QTWEYAHEYA 
LRTYAYVVPL QWLAPVLSRF VSPYAWLLSD YVVDTTLDTS KLSLFLSFRA FLAGTMALCE
LAWLYALHSR LSKDSSLVVG WTGVLLLTAA GMNHAAGAYL PSSTFMMAWL AASAVFLLER
HFLFAAIAII CTLSTGWPFG VVVLVPLGIR VLRKEYRARG VWRLLLWSVG VTAAVEAAVL
MIDHKFYGVW LSPTWNIFKY NAAGGGDELY GIEPTSYYIK NLFLNLNLLA PMGIIGLPVL
VFSRNQPAKA DLVTMIVTLY TWLAITVPRP HKEERFLFPI YPVLVLSSVL TVDHTLNFIG
RIVAGFSRHK TLVRNQRIAL HCLVWLPVVA MSLCRVAALH KYYTAPLQVY AALVSRMDPL
SNQLVCSCGE WYRFPSSFYL PKNHDLGFLP SSFGGQLPQA FSVHGSLPKS LNLLQPFNDQ
NQQEMSRYAT LDQCNYIVDL EGSDCAPSGA EVVARAPFLD GGRSSMIHRM WYLPILHDAA
IKSGSVQYEH YVLYKT