Gene PHATRDRAFT_44995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44995 
Symbol 
ID7199513 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011673 
Strand
Start bp926228 
End bp930154 
Gene Length3927 bp 
Protein Length1217 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179092 
Protein GI219116594 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00833272 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GAGACAGGTC AGATATCCTT TCGTGTAGTG TTGCCTCACA AGTCAGATTC TAAACGGCAA 
CCATGATGAA GCTGGCGTTC TTGGCAACCC TCGCAACGTC CCTGGCGCTG GCGTACGGCA
ATGATGGTTC AATGGTCGGT ATGTGCAGCA ACAGCTTTGT CGAGTTCGAT TGAGTGAAGA
CCATAGGTAC TGGAATACTG ACATGAGAAA TTCCATCTCG CTCCTATTTG TCCAGCAGGC
TCTAACGAAG AGCGAAACTT AGCCGCGATG ACTTGCACAA CCAAGAATTT GGACTTTAGC
GAGTTTGCCA CCGGCACATA CTTGAGTAAC TTGGAGGCTG ACTATGGGGT GACTATCACT
GCCGTTTCCC GTACAAGCAA GGGCTACACA CCCAACGGAG CCGCCCGTGT TTTCGACACG
TCAAAGCCCA CCGGTAAGAC GGGACAGTCA ATGTGCGGAA GAAACGACGG GGATTCAGAT
CTCGGATCAC CCAACTCCGC TTGTCCCGGA GGTGGACCAG GTCATGGACC TGGAGGTGCA
CCAAAACTCG CCAACGGACA GAACAATCCT TATAAGAACT GCTCGCCCCA AGGCAAAGTA
CTCATCATTC AAGAACGGAA CAAAAACTGT CCCGACGACA GTGCGGACGG CGGTACCATC
CGCTTCGACT TCTCCAAAAC AGTGGACCTC GAGTCCGTGA CGTCCTTGGA TATCGACGAA
GGTAACAACA CTCCGGAGAT CACCGTCTCG TACGGCAACG GCCAGGAGGC TTTTTATAAG
CTGCAGGCTA CGGGCGACAA CGGTGTTTTC ACGCAAATGA TCAACAAGAG TGACGTCAAG
TGGTTCCAGA TCAAGTTCTA CGGCTCAGGA TCCGTATCAG GCTTCAAGTG GGATGAGTGT
GTCACAGCCC CAACGAAAGC CCCCACGAAA AGCCCAACAA AGGCTCCGAT ACCGGCCCAA
ACCAGAGATG ATACTTGTCC AATAAAGAAC TTGGACTTTA GTGAATTTGC CACCGGAACC
TACTTGAGTA ACTTGGAGGC TGACTATGGG GTGACTATCA CTGCCGTTTC CCGTACAAGC
AAGGGCTACA CACCCAACGG AGCCGCCCGT GTTTTCGACA CGTCAAAGCC CACCGGTGCA
ACGGGGCAAT CAATGTGCTC CTCTGGTGAC GGTGACTCTG ATCTCGGATC ACCCAACTCC
GCTTGTCCCG GAGGTGGACC AGGTCACGGA CCTGGAGGTG CGCCAAAACT CTCAAACGGT
CAAAACAATC CTTACAAGAA CTGCTCGCCC CAAGGCAAAG TACTCATCAT TCAAGAAGGC
AACAAAAATT GTCCCGACGA CAGTGCGGAC GGTGGTACTA TCCGCTTCGA CTTCTCCAAA
ACAGTGGACC TCGAGTCCGT GACGTCCTTG GATATCGACG AAGGTAACAA CACTCCGGAG
ATCACCGTCT CTTACGGCAA TGGCGAGCAG GCTTTTTATA AGTTACAGGC TACGGGCGAC
AACGGTGTAT TCACCCAAAT GATTAACAAG AGTGACGTCC AGTGGTTCCA AATCAAGTTC
TACGGCTCTG GATCCGTATC AGGCTTCAAA TGGGCCGAAT GTGTCACAGC CCCAACTAAA
GCTCCAGTAA AAGCCCCAAC CAAAGCTCCG ACAAATGCTC CCACAAAAGC ACCTGTTAGA
GCTCCAACCA AGGCTCCAAC CAAAGCTCCG ACCAAGGCTC CAACCAAGGC TCCAACCAAA
GCTCCAGTCA AGGCTCCAAC GAAAGCTCCA GTCAAAGCTC CGACCAAGCC GCCAGTCACG
GCAGCACCAT CCGAGTGCGT GGACGGTATG GACGTGGTAC TTGTCAATAA GTCCACCGGT
CCCGAGAGCA CCATTGATGG CAAGAGCCCA ATCAAAATTG TTAGCGGTGA CGGCCAGTCC
GTCTCTTTTG AGGTGCACCA ATACTGGAAG AGTGGAGCTA GTAGTATCAG CTGGATAGCG
ACTCAGTTCC GTACCAACGA TGGCAACACG GCGTCTGATG CATGGGAATG CGAAAAGATC
GAAGAAGTCT CCTGGGGCAG GGTGAAGGAG TACACTGCTG AGTGTGTTGG AGGAGCAGCA
ACCGTGACCT TGTGGGTACA CGACGGACAG TTCCAAAATA CTCAAAACCT GAACAGCCTG
GTCCCTGCAA GGTGCAATCC TTCAAACGAC CAGTTTCGCA AAAAGATCAT GTATAATTAC
ACGTTGCCTT GCTCCTCGAT ATGCGCTCCA TCCCCGACTA AAGCTCCTGT CAAAGCCCCG
ACCAAGGCGC CAACCGGTAC ACGAGATGAA ATATGTGTCG ACGAAGTCCT CGACTTTACT
GACTTTACTA CAGGCGAGTA CGTCCACGAC CTGGTACGAG CTCGCGGCGT TACAGTGACA
GCAATTGCAT CCGGAAGCGA CGGATACACG CCCGGCGGTG CGGCTCGCAT TTTCGACACT
CGCTACCCTT CCGGCAGCAC TGGACAAGCG CTCTGCGCCC AGAACGAAGG TGAAACAACT
CTCGGGTCAC CCAACCTTTC GTGCCCCGGC GGTGGACCCG GATCGGGTAA CGGAGGCAAA
GTCAACACGC CCTTCGCCAA CTGCGACGCT CGCGGTAAAG GTCTCATCAT TCAAGAAGGA
AACGTGGCCT GTCCTGAGCA CGCTGGACAA GGCGGGCAAA TCGTGTTTGA GTTTGCGGTA
CCGGTTGAGC TCAACTACAT CGATTTGCTG GTTAGCACCG ATTCCAGCCC CGTAATTACG
GTGTACTACG GCGTAGACCA ATCCATGTCG TTTGATATGC CGATGATGGG TGCCAATGGC
TACCATCGGC AAGTGATCGA TCGATCGCAG GTTTACAAGG TCGAGGTGGG CTTCTGCAGT
GGAGGTACCG TTACTGCCAT AGATTACGTT CGTTGCGAGC CGGAGGGTCC GCCAACGAAA
GCTCCAGTGA TAGCTCCAAC AAAGGCTCCT GTCAAAGCCC CGACCAAGGC ACCCATTGGT
ACCCGAGACG AAATATGTGT TGACGAAGTC CTCGACTTTA CTGACTTTTC TACAGGCGAG
TACGTCCATG ACCTGGTACG ATCTCGCGGC GTTACAGTGA CAGCAATTGC ATCCGGTAGC
GATGGCTACA CCCCAGGCGG TGCGGCTCGC ATTTTCGACA CTCGCTACCC TTCCGGTAGC
ACTGGACAAG CGCTCTGCGC CCAGAACGAA GGTGAAACAA CTCTCGGGTC ACCCAACCTT
TCGTGCCCCG GCGGTGGACC CGGATCGGGT AACGGAGGCA AAGTCAACAC GCCCTTCGCC
AACTGCGAGG CTCGTGGTAA AGGTCTCATC ATTCAAGAAG GAAACGTGGC CTGTCCTGAG
CACGCTGGAC AAGGCGGGCA AATCGTGTTT GAGTTTGCGG TACCAGTTGA GCTCAACTAC
ATCGATTTGC TGGTAAGCAC CGACTCCAGT CCGGTAATTA CAGTGTACTA CGGCGTAGAC
CAATCCATGT CGTTTGATAT GCCGATGATG GGTGCCAATG GCTACCATCG GCAAGTGATC
GATCGATCGC AGGTGTACAA GGTCGAGGTG GGCTTCTGTA GTGGAGGTAC CGTTACTGCC
ATAGATTACG TTCGTTGCGA GCCTGAAGAG GAGTGTCCGC CGAGTAGCGG TTCAGTCAAA
CCGCTCCCTC CGATCGAAGT GCATCTTCCC CCGCCGAACA GCAAGCACAT GGTTTTTGAC
TTTGTCGTTC TAAAGAATCA AGAATCGTGT CCTCCGGAAT GGCTTGGTCG CAGGGAACGT
CGCGCTTTGG TAGATACCCG AGGACGACGT TGAGTCGATA ATGAAGCAAC GTGGTCTTCT
CTCAAAATCG TCCACGACAA ACTCTTCATA TATTCATTGC GTTGAACCAT AGGGAAACGA
TAATTACTTG TGGCTCATTA TTGGATT
 
Protein sequence
MMKLAFLATL ATSLALAYGN DGSMVAGSNE ERNLAAMTCT TKNLDFSEFA TGTYLSNLEA 
DYGVTITAVS RTSKGYTPNG AARVFDTSKP TGKTGQSMCG RNDGDSDLGS PNSACPGGGP
GHGPGGAPKL ANGQNNPYKN CSPQGKVLII QERNKNCPDD SADGGTIRFD FSKTVDLESV
TSLDIDEGNN TPEITVSYGN GQEAFYKLQA TGDNGVFTQM INKSDVKWFQ IKFYGSGSVS
GFKWDECVTA PTKAPTKSPT KAPIPAQTRD DTCPIKNLDF SEFATGTYLS NLEADYGVTI
TAVSRTSKGY TPNGAARVFD TSKPTGATGQ SMCSSGDGDS DLGSPNSACP GGGPGHGPGG
APKLSNGQNN PYKNCSPQGK VLIIQEGNKN CPDDSADGGT IRFDFSKTVD LESVTSLDID
EGNNTPEITV SYGNGEQAFY KLQATGDNGV FTQMINKSDV QWFQIKFYGS GSVSGFKWAE
CVTAPTKAPV KAPTKAPTNA PTKAPVRAPT KAPTKAPTKA PTKAPTKAPV KAPTKAPVKA
PTKPPVTAAP SECVDGMDVV LVNKSTGPES TIDGKSPIKI VSGDGQSVSF EVHQYWKSGA
SSISWIATQF RTNDGNTASD AWECEKIEEV SWGRVKEYTA ECVGGAATVT LWVHDGQFQN
TQNLNSLVPA RCNPSNDQFR KKIMYNYTLP CSSICAPSPT KAPVKAPTKA PTGTRDEICV
DEVLDFTDFT TGEYVHDLVR ARGVTVTAIA SGSDGYTPGG AARIFDTRYP SGSTGQALCA
QNEGETTLGS PNLSCPGGGP GSGNGGKVNT PFANCDARGK GLIIQEGNVA CPEHAGQGGQ
IVFEFAVPVE LNYIDLLVST DSSPVITVYY GVDQSMSFDM PMMGANGYHR QVIDRSQVYK
VEVGFCSGGT VTAIDYVRCE PEGPPTKAPV IAPTKAPVKA PTKAPIGTRD EICVDEVLDF
TDFSTGEYVH DLVRSRGVTV TAIASGSDGY TPGGAARIFD TRYPSGSTGQ ALCAQNEGET
TLGSPNLSCP GGGPGSGNGG KVNTPFANCE ARGKGLIIQE GNVACPEHAG QGGQIVFEFA
VPVELNYIDL LVSTDSSPVI TVYYGVDQSM SFDMPMMGAN GYHRQVIDRS QVYKVEVGFC
SGGTVTAIDY VRCEPEEECP PSSGSVKPLP PIEVHLPPPN SKHMVFDFVV LKNQESCPPE
WLGRRERRAL VDTRGRR