Gene PHATRDRAFT_42537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42537 
Symbol 
ID7196086 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp352279 
End bp355509 
Gene Length3231 bp 
Protein Length1030 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176566 
Protein GI219109623 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCCGAATACA TAAGTAGTAA ATTTTCAAGA ACCTCTGTAT TGATTTGGCT TCACAGGCCA 
CCATGGCACC TGCCACTCGG CAAATGACGG GCGGAGCGGT CTATGCGCAC CTTTTGGATA
ACGTGCTTCT TCTTCCCCAA GGGCACCCTA TTTGTCTCAG TTTTGCACAA CAAGGATACG
AATCGGCTGA TGACCTCCTA TGTATTTTTG AGAATGAACT TGAAACTCTT GAATTCATTC
CTCTTGCCCC TGCTGACGGC CCCGAAACTA CGGCACCGGT TGCCTTACTC ATGGCACATT
GACAGATCAT CCGTCATTTC CTCCGGTGGC AAGCGTCCCT TGAGCGTCAA AAGGGAACTC
CTTTGAAGAA CTCCGAGCTT GCAGCCCTGA ACAACAAAGA CTTTGTCCTG TACCGCCGAT
CCGCTCTCGG CCAGGTCTCT TCGACTGTTG CTCCAATAGT CACAAACCCC AATGCTGCAA
TTCCCACCGC TAAAACTCGA TCTGCTGTGG AAGATTTCAA GCGTGGGATC AAACGAGACA
AAACCCATTA CCCCGTGCTC AAAGACGACC GGTACTGGGA TAATTTCTAT CGATCCTTCG
TCGTCACTGC CGTCTCCCAT AACGTTGAGA AGGTACTTGA CCCATCATAC TTGCCTACTG
ATCCACTGGA AAAGTCGTTG TTTGAAGAAC AAAACAAGTT TGTATACTCA GCCTTGGAGC
ATACACTTCA GACGGACATG GGCATAAATA TCGTTCGAGA ACATAGTTTT GATTTCAATG
CCCAGGAAGT TTTCCGTAAA GTGGTCAAGC ACTATACAGA GTCCGCCTCT GCAAAGATCA
GCTCCTCTAC CACTCTAGGA TACCTGACCA CGGCAAAGTA TAGCTCATCA TGGACTGGCA
CAGCGGAGGG ATTTATCCTA CACTGGAAGA ATCATTTGCG TATATATAAT GATACCGTCC
CTACGGGTGA GCAGCTCCCA CAACAGCTTT GTCTCAGTCT ATTGGAGAAT GCTGTCCATG
ATATACCCGA ACTTCGTCAG GTTAAAATCA CGGCAACTTT AGACTTAGCA AAAGGTGGCA
GCCCTATTAG TTATGACGGT TATCTCAGTC TACTACTTGC ATCAGCATCG CTCTATGACA
ATGGCAACAA CCTATCTAAT GCTTGTAGCA ACAAGAACAA ACGTCATGTT TATTCTACTG
ACTTAGTCTA CCATCCAACT GACTTTGACA GCGATCTAGA CGTAAGTTAC GATATAGATG
TGTCACCCAC AGCAATCTAT GAAGCCAATG CCCATGCACG CAACTCCGGT AATAGTGGCA
ATCGCAGTCG CAACGCAGCT AGCCCCAGAG ACCGACCTTA TATTCCCCGG GAAATGTGGA
ATCAACTCTC AGAGGATGCA AAAGCCATTC TCCAAGGCTT GTCTGCTCCT AGTAAGAGTA
CATTACCGGC CGCGCAACCT TTTTCACAGG TGCTACAAGC CAATACGCAT AGCCATGGTA
GCAGCGAAAC CGCGGACACT TTCCATGATT GCGCACCGGA GACTGAGTTG TTGGCTCACC
TTACTGACCG CGTCAGTCGT ATGAACGATG GTGATATTCG TAAAGTCCTT GCAGCATCAC
GTGACAACGT CTCCCCACAA CCAGGAGCGA GACCCAAATT CATGCAATCC AATATGCTAC
GTTATCAAGT CTCTCGGCAT AATGTCAACG GTACCACTGC AGCTCTTGTC AATCGTGGTG
CTAATGGCGG ACTTGCCGGG GCGGACGTCA TGGTGCTCAA CAAAACAGGA CGTTCCGCCA
ATATAACTGG TATTAATGAT CACACATTGT CCGATTTGGA TATTGTCACC GCTGCAGGAT
GTGTTGAATC CCATACCGGT CCTATCATTG TAATTATGCA TCAGTATGCG TATCTTGGCA
CTGGTAAGAC TATACATTCC AGTGCGCAAC TCGAGCATTT CCATAACAAC GTTGAAGACC
GTTCACGTAC AGTTGGTGGA GACCAGCGCA TTGTGACCTT AGATGATTAT ATCATCCCCT
TGCACATCCG CCAAGGTCTT CCATATATGG ATATGAGGTG CCCAACAGAT GCCGAATTTA
CCTCTCTCCC GCATGTGATA TTGACCTCTG ATGTCGATTG GGACCCGTCA GTCCTTGACA
ACGAGATTGA TCTGGCCACC GATTGGTACG ACACTGTACA GGATTTACCC CACTACCATA
TGTCGAACCG CGTTTTGACC ACATGGGCAA ATATCTCCAT CGTCATATTT CGCTTTGTGA
CACTCGCCAC CATGCCGTTG ACTGTATCCT TCAATGTCAG CAGCATGAAA TTCAGCGTAA
TGACCATGAC TACGAAACCC TCCGTCCTTG TCTTGGTTGG GTATCCGCCG ATACCGTTCG
TAAGACTATA CAGGCCACCA CCCAGTATGC ACGAGAGGTA TACCACGCAC CGTTACGCAA
GCATTATAAG TCGCGCTTCC CGGCCTTAAA TGTCCATCGG CGTAACGAGC CAGTTGCCAC
CAATACCATT TGGTCAGATA CTCCTGCTGT TGATAGTGGT GCCAAATTTG CGCAACTTTT
CGTGGGCCGC CGATCCCTTG TCACTGATGT TTATCCCATG AAAACCGACA AAGAATTTGT
TAACGCTCTC GAAGACCATA TTCGGTTTCG CGGCGCTATG GACAAGCTCA TCAGCGACCG
TGCACAGGTC GAGATTAGTA AAAAGGTCAT GGATATCACC CGTGCTTACA ACATTGACCA
GTGGCAAAGC GAACCACACC ACCAACACCA AAATTTTGCT GAACGTCGCA TTGCCACTAT
CGAGGCTAAC ACCAACAACA TTCTCAATCA CACCGGTGCC CCTGACTCCA CATGGCTTCT
TTGTGTCACG TACGTGTGCT ATGTATTCAA TCATCTCGCC CATGAATCCT TGCACAACCG
TACACCCTTA GAAGTCCTTA CTGGTTCCAC TCCTGATATC AGTGTTCTTC TTCAGTTCCA
TTTTTGGGAA CCCGTCTATT ATCGACTCGA AGATGCGACC TTTCCGTCTG ATGGTACTGA
ACAAACGGGA CGTTTCGTAG GCATTGCTGA CTCCGTTGGC GATGCTCTTA CTTATAAGAT
CCTCAACGAT ACTTCTAATA GAATCCTCTA TCGTTCCAGC GTGCGCTCTG CAAACCTTCC
CGGTGAAACC AACCTACGCC TTACATCACA GGATGGGGAG AATGGCCCTA A
 
Protein sequence
MAPATRQMTG GAVYAHLLDN VLLLPQGHPI CLSFAQQGYE SADDLLCIFE NELETLEFIP 
LAPADGPETT APIIRHFLRW QASLERQKGT PLKNSELAAL NNKDFVLYRR SALGQVSSTV
APIVTNPNAA IPTAKTRSAV EDFKRGIKRD KTHYPVLKDD RYWDNFYRSF VVTAVSHNVE
KVLDPSYLPT DPLEKSLFEE QNKFVYSALE HTLQTDMGIN IVREHSFDFN AQEVFRKVVK
HYTESASAKI SSSTTLGYLT TAKYSSSWTG TAEGFILHWK NHLRIYNDTV PTGEQLPQQL
CLSLLENAVH DIPELRQVKI TATLDLAKGG SPISYDGYLS LLLASASLYD NGNNLSNACS
NKNKRHVYST DLVYHPTDFD SDLDVSYDID VSPTAIYEAN AHARNSGNSG NRSRNAASPR
DRPYIPREMW NQLSEDAKAI LQGLSAPSKS TLPAAQPFSQ VLQANTHSHG SSETADTFHD
CAPETELLAH LTDRVSRMND GDIRKVLAAS RDNVSPQPGA RPKFMQSNML RYQVSRHNVN
GTTAALVNRG ANGGLAGADV MVLNKTGRSA NITGINDHTL SDLDIVTAAG CVESHTGPII
VIMHQYAYLG TGKTIHSSAQ LEHFHNNVED RSRTVGGDQR IVTLDDYIIP LHIRQGLPYM
DMRCPTDAEF TSLPHVILTS DVDWDPSVLD NEIDLATDWY DTVQDLPHYH MSNRVLTTWA
NISIVIFRFV TLATMPLTHE IQRNDHDYET LRPCLGWVSA DTVRKTIQAT TQYAREVYHA
PLRKHYKSRF PALNVHRRNE PVATNTIWSD TPAVDSGAKF AQLFVGRRSL VTDVYPMKTD
KEFVNALEDH IRFRGAMDKL ISDRAQVEIS KKVMDITRAY NIDQWQSEPH HQHQNFAERR
IATIEANTNN ILNHTGAPDS TWLLCVTYVC YVFNHLAHES LHNRTPLEVL TGSTPDISVL
LQFHFWEPVY YRLEDATFPS DGTEQTGRFV GIADSVGDAL TYKILNDTSN RILYRSSVRS
ANLPGWGEWP