Gene PHATRDRAFT_40608 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40608 
Symbol 
ID7198384 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011692 
Strand
Start bp435360 
End bp438320 
Gene Length2961 bp 
Protein Length911 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184615 
Protein GI219128847 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.158773 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACCA AGTCTACTCC CAAAGACCTC ATTGACTCGT TTCCGCACAG CAAACTCACA 
CCGATTGCTA CCGCGACAAC CGAACCCGAT TACTTGTCGC TCCATCAGCT GCAGTATGAA
ATCAACGACA ATGCGGAGAC CCTTTCCTCC ACTCTTGGAG ACGGCCAACA CGGTCACCTT
TTTCTCGTAA TTTCCGAAAC CGAGTACCTC GAAATGACCG ACGGCGTTCC ATGCATTCCT
CCTGTGCAAC CGCCTTTCGA CCCAGTTCAC GCTGCCAACG CCACAGCTCC TCAAATTATC
GAAGCTAACC ATCAGAACGA CAAACGTCAA AAGCTTTTTG ACCTTTACCA CAACGCCATT
AAAATCAACT CCTTGAAGCC ATTCCCATTG AATACATTGA ATCTCTCGGT CATCCTACAC
GAGGCTTTAA CAAAGTCTCT CCCCTCGAAA TCCTCTCTCA TCTCTGGGAA AATTTTGGTA
AAATTCAGGC TTCGGATCTC ATCGCTAACG ACGAACGCAT GAAAGCCGCC TGGCATCCAC
CAACGCCTAT CCAGCAACTT TTCCAGCAGC TTGACAAAGG CAATCAGTTT ATCATCGCGT
CTGGCCAAGT CATGGACGAA CGAATTATCG CTCGCATCGG CTACCAGATC ATCGAAAAAA
CCGGGCTCTT TGATCTTGCT TCTCGCGACT GGCATTATAA AGATGAAGCC GATAAAACTT
TGGCAAATTT CAAAAAACAT TTCCAGAAGG CCAACAAGGA TCTCGCCCTC ACCGCCACCA
GCAGCTCTGC AGGTTATCAC ACCGCAAATC AGAGTACTGT CACCAAGGGA AAATCGTATT
GCTGGACACA CGGCATCGTT CACAACACGA AGCACACCAG TGCGACATGT GAAAAACAGG
CCCCGGGGCA CAAAACTGGC GCTACCTTGC ACGACAAACA AGGCGGGTCG ACCAAGACCT
ATCAATACAC GCCACCGGTG CCCAAATAGG AAAGGGGGAC GGCCAAACTG TTGAGTGTGC
CGCTGAATAC TTATCATAAC AAAAATAAGT CTTCAGTTGC ACCAAACACT CCTCCGTTAG
CTTCCTCCCC GCCATTTTTT CCTCCCGACG CCATTGCAGA CACTGGCTGT ACCGGACATT
TTTTGAGCAC CAACATTGCT CACATACACT GCCAACCGAC AGTCCCCGGC ATCAACGTGG
TCCTCCCTGA TGGTCGCACA ATCACTTCGA GTCACATCAC CGAACTCAAC ATTCCCTCGC
TTCCGCCGGG AGCTCGTACC GCCCATATCT TTCCTGGTCT CTCGAATGGA TCCCTCATTT
CCATCGGCCA ACTTTGTGAC CACGGCTGTA CCGCCACGTT CACATCCGAC TCAGTCCGCA
TTGAGCTCAA TAACACTGTC GTTCTCCGCG GCGGCCGTTC TCCTTACACC CGATTGTGGA
CCCTCGACTC CCCTGTAACG CCAAATCCTC CCGCCACTGA ATTGCATGCG CCTTTGCACG
ACAAAAATTT TGCGAATCAC CTCGGAGACC ACTCAGGGAC CCTTGCCGAC CGCATTGCCT
TTGTTCATGC ATCCTTATTC TCGCCACAAC TTTCGACATG GTGCAAGGCC ATTGACAAAG
GCCGCCTCAC AACCTTTCCG GACATCACGT CTGCACAGGT AAAACGGCAC CCCCCACAGT
CCGTCCCTAT GGTCAAGGGA CACCTTGACC AGCAACGGTC CAACCTACGC TCAACCAAGC
CCAAGGTCAC CCTGTCTGCC TCTGTTGATC CTGATGACAT CAATTTCGAC ACCAATCCTG
TCGTACAAGA CCCTCCAGCC GCCAGGACGC AGTTTTTGTA CGCCGATTTC GCCGAAGTCA
CCGGAAAAAT TTTTACTGAC CCTACCGGCC GTTTCGTTAC CACTTCAAGC TCCGGCAATG
CATACATGCT AGTGGTTTAT GACTACGATA GCAATTTTAT TCATGTCAAA GCCATGAAGA
ACCGCACCGG TCCCGAGATT TTGAGCGCCT ACAAGCGTGC TCACGCCATG CTGTCCTCCA
AAGGTTTGCG CCCCCAACTC CAACGCTTAG ACAACGAAGC CTCAACTGCG TTACAACAAT
TCATGTCCTC TGTTGACATT GATTTTCAAT TAGCTCCTCC GCACGTGCAC CGTCGGAACG
CCGCCGAACG GGCAATCCGC ACGTTCAAAA ACCACTTCAT TGCAGGTTTG TGCAGCACCA
ACAAGAACTT TCCGCTCCAC CTTTGGGATT GCTTACTCCC ACAAGCCATC ATGACTCTCA
ACCTTCTTCG AGGGTCTCGT ATCAACCCAA ATCTGTCGTC CTGGGCCCAA CTCCACGGCT
CGTTCGACTA CAATCATACC CCTTTGGCTC CCCCGGGCAT CCGCGTGCTT GTACACGAAA
AACCGTCAAT TCGCAGAACT TGGGCCCCCC ACGCAGCCGA CGGTTGGTAC GTTGGCCCCG
CCATGAATCA TTACCGATGC TATCGCGTCT GGGTCAAGGA GACCACCAGC GAACGCATTT
CGGACACTCT GACCTGGTTT CCCAGCCAAG TCAAAATGCC CAGCACCTCG TCTCGCGACA
CAATTGTCGC CGCCGCTCAC GATCTTGCCC ATGCTCTGGC ACATCCCTCT CCTGCGTCGC
CTTTATCACC TCTTTCGGTC AACGAACGCG AAGCCCTCTC GCAACTTTCA GATATTTTTT
CGAAAGCCGC TAACCCAGTT GACTCGTCCC TCCCAGTTGC TCCCACGGCA ACCCTAAGTC
CGCCAACTTC ATCTACTTCT TCACCTTGTC AAGTCCGCTT CCGAGACCCG GTCACTGAAT
CACTTCCGAG GGTGCCGACC GCCACAGCCG CCCTTCCGCA GTCACTTCCG AGGGTGCCTC
CCCCGGACTC CGAGGCTGAG ACATACAAGC TTGTCACCTG CAACCCTCGC CAAGCACGTC
GTAGGGCCGC TCGCAAACTG A
 
Protein sequence
MTTKSTPKDL IDSFPHSKLT PIATATTEPD YLSLHQLQYE INDNAETLSS TLGDGQHGHL 
FLVISETEYL EMTDGVPCIP PVQPPFDPVH AANATAPQII EANHQNDKRQ KLFDLYHNAI
KINSLKPFPL NTLNLSVILH EALTKSLPSK SSLISGKILQ LFQQLDKGNQ FIIASGQVMD
ERIIARIGYQ IIEKTGLFDL ASRDWHYKDE ADKTLANFKK HFQKANKDLA LTATSSSAGY
HTANQSTVTK GKSYCWTHGI VHNTKHTSAT CEKQAPGHKT GATLHDKQGG STKTYQYTPP
SSVAPNTPPL ASSPPFFPPD AIADTGCTGH FLSTNIAHIH CQPTVPGINV VLPDGRTITS
SHITELNIPS LPPGARTAHI FPGLSNGSLI SIGQLCDHGC TATFTSDSVR IELNNTVVLR
GGRSPYTRLW TLDSPVTPNP PATELHAPLH DKNFANHLGD HSGTLADRIA FVHASLFSPQ
LSTWCKAIDK GRLTTFPDIT SAQVKRHPPQ SVPMVKGHLD QQRSNLRSTK PKVTLSASVD
PDDINFDTNP VVQDPPAART QFLYADFAEV TGKIFTDPTG RFVTTSSSGN AYMLVVYDYD
SNFIHVKAMK NRTGPEILSA YKRAHAMLSS KGLRPQLQRL DNEASTALQQ FMSSVDIDFQ
LAPPHVHRRN AAERAIRTFK NHFIAGLCST NKNFPLHLWD CLLPQAIMTL NLLRGSRINP
NLSSWAQLHG SFDYNHTPLA PPGIRVLVHE KPSIRRTWAP HAADGWYVGP AMNHYRCYRV
WVKETTSERI SDTLTWFPSQ VKMPSTSSRD TIVAAAHDLA HALAHPSPAS PLSPLSVNER
EALSQLSDIF SKAANPSASE TRSLNHFRGC RPPQPPFRSH FRGCLPRTPR LRHTSLSPAT
LAKHVVGPLA N