Gene PHATRDRAFT_42495 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42495 
Symbol 
ID7196679 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp214447 
End bp218323 
Gene Length3877 bp 
Protein Length1233 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177043 
Protein GI219110583 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GAATTATATA TACCTTACCG AACAGAAACA TATACCTTCC CGAAAACAAA CCTACCGTTG 
TCCTCCTACT CTACCATCGC ACCCACTCTG CGCCCAAAAG AGAACATTTT ACAAACCGTT
GACATTGCCA ACAGCACGCA CTCTTGCGTC CGGAACGAAC GTACGTGTTG TTACAATGGT
TCGGACCCGT TCCAATTCGA AATCAACACC CCCATCGCAA CCCAACGAAC CCCATGACGA
CAACGACGAT GGTTCCTCTC GAAACGAATC GGACGATGAT ATTCCCAAAC CCGTCAGTTC
CGACAGCGAA AGTGACGACA CGCAGCCTCC GACAACACTT CCGGGAAACA CGTCTGTGCT
TCCGGGCACC GGCACGTCCC GTCTCTTTAG TCCCTACCGC AGTCTCGGCG TCGTCTCCAC
CGGGGCCCCC TTTCACTTGA TACCGCACGA TCACTCCGGC AACGCCATGG TCTGTGTACC
CATCGGTGAT CGTTTCCAAT TACTCCGCAC GGACCGACTC CATCCCGTCC TCGTCAGTCA
AGCCGTGCCG CACGATGTAC AGCACGTCGT CACCGACGCC ACCCTCAGTA TTACCGTCGC
CGCGCACGGC GACCGCCACG TCACGCTCCT ACACCGCACC CGACCCCTCG CGACCCGCGC
GTTGGCCGCT AGTACACGCT GGAGGACGGT GCAGCTCTTG CCCCTGGGAC GGACACCGGT
ACCGATGCGT GGCGAAAAAC AAGGTACCAT GGAGAACGCC GCCATTGTCG CCGCGATTCT
GCAACGCGCA CCGACCGTAC GGGATGATGT TCCCCTCGTT GGACAAGACG ATGATGACGA
TGACAACGAT GACAGTGAAT CGCTGGACAG TGACGACAAT GACGACGAAA CGACGTGCCT
GGGGCAAGTG GTTGTTTTGC TGGCCACCCG AGACTCGATT GCGATTCAGA GACGGATTCG
CTTGACCGCT TTGCCCAACT TTCGACCCAC GACCGCGTTG CATCCCTCCA CATATCTCAA
CAAGATCGTT CTGGGCGGAA CCAATCAACC CAACCAGGAT ACTCCCTCTT CTCCCGCCAT
GTGCTTGTTG AACGTGCGAT CGGCCCGAAT CATTCACGTA TTTGCTGGTC TTCCCTCGGA
AACCGAAGAA TCGTCCGTCA CTACCTTGGA ACAGTCTCCC GCCGTCGATA CGATTGCCGT
CGGAACCTCG TCAGGGAACG TTCACTTGAT CAATTTGCGT CACGATCAGA AACTGTTTAC
GCTCCGTCAC AAAACCCACG ACGGAAATAA CGTCGGCATT TCTTCCATCT CCTTTCGGAC
GGATCATTCC GCCTTGCAGT ACGATATCGC TCCAATGGCA GTGGGACGAG TGGATGGACA
CATTACCATC TGGGACTTGA CGGCACCTAC CGAGATAGAG TCGGGTCGCA CCGTGCTGCA
CGAAATGGAA CACGTGCACG TTGGCGGTGT TGCCAAAATC CAATACCTTC CCCAGGAACC
CTTGCTGGTC TCCATTGGAC TCGCGTCCAA TCGTATCGCG ATGCATATTT TCGATAGTCC
CGACCATTCC GCTCGCTTGT TGCGCTCCCG CCAAGGCCAT ACCGGTCCAC CCACACGGAT
ACGTTACCTA CATCCCGGAT CAGGCGCCGG AGGCGGCGTC CTCGTGAACG CGTCTGACGG
AACGGACGCG AGTGCCTGTC AAATTCTTTC CTCTGGTGGA CCCGATCGGA CGTTGCGCGT
CTTTTCTACC GCCCGTACCG TGCTCGACAA AGAATACAGT CAGGGGGCAG GGCTGGAAAA
GAAAGCCCGC AAATTCGGTA TGGACACGGT CGCGGAACTG CTCCTACCAC CCACCATTGG
CTTGGCCACG GCGGAATCGC GCGCTCGTGA TTGGGGTGAC TTGGTGACAA TTCACCGGGA
TCACGCTTTT GCCTACGTGT GGAGCACTAA GCGGGGCGCC CAGTCTGGGC CAATCTTGCG
GCAGTCGACG TGGAACGTAA GTGCCATGAA GATTCCACCA CCGCCGCACA CGCACGCCAC
GGCGGTCGCC ATGTCTGCCT GTGGAAACTT CGCTTTGGTG GGGACACGAG GTGGAACGAT
CTACAAGTAT AACGTCCAGA GTGGGAATGC TCGGGGAATG TATCCGATTC AAGCAAAGGA
AGATAGCAAG CCGAGAAGAC AGGTTGTAGC GGGAGATCTC GGGCGTACCA TGAAATCGTT
GGAAAATAGC ATGAAAGTCA GCAATCGGGC TGCCAATGTG GACAAGAAAG AACTCGATGC
CGAGCAAGAA GCGAAACGGG AAAGTCGTAT TTTGGCAAAG TTGCAAGCAG CATCCCACAC
GGGTCATTCC GTAACAGGAT TAGCGGTAGA TTCGGTAAAC AAGGTCCTCA TATCGGTAGG
GGCGGATGCA AAACTTATAT TGTGGAACTT TGCATCGCAT GCTCCGCACA AGAAAAGCCC
GTACACTTTA CCATGTCCCG CCACTCGAAT GTGCCACGTG CGTGATTCAG ACTTGGCAGC
CATTGCTTTG GAGGACTACT CTGTTGTGTT GTTCGATTGT GCAGCCCTGT CGATTGTTCG
TCGTTTTGGA GCTACTGGTG GTCACGTTGG ACCAATTAGT GACCTAGGAT TTAGTCCAGA
CGGCCGCAGT CTCTTTACTG CATCGCTGGA TTCTTCTTTG CGGGTATGGG ATGTACCTAC
CAACACGTGC GTCGACTGGC TCAGCTTCTC GACGGCTCCG ACGTCTTTGA CGATCAGTCC
AACGGGTGAG TTTCTAGCGA CAACACATAA GGGAAAACTC GGTATCAGTG TTTGGAGCGA
CCGAACCTAC TATCAAACCG TAAACATTGA CGGCACACCA CTAAAGGAAC CTGCGCGCAT
GGACGAACCG GTGCCCATGG CCGATGATGC TCCTTCACAA ACATACACAG CAAGTAAACC
GAGCGAGGAA AGAGTGCCTA CCGGTAACGA AAGTGAAGTG GACGGGGTTG ACGATAAAGG
CCCCGCCCTA CCCAAAGAAG CTGGACTCAT CACGCTCTCT GGACTGCCAC CTGCCCACTG
GAAGAATCTG TTTCACCTGG AACTCGTTAA GGAGCGCAAC AAGCCCAAGG AAGCCCCCCA
AAAGCCTCCT TCGGCACCGT TTTTCCTCCA ATGGCGTTCG GGTGAGTCGA TCAGCGAAAC
TGTTGCCAAT CCTCAGTTGG CTTCCAAATC CAAACAAAAC GTAAGTGAGG AAGAGGAGTG
GGTTGCGGCT TGGACTGATA ATGATGATGA CGAAGCAAAA GTCGATGTGC CTGAAGCAAG
TGGACTAGTC AAACGAGACC ACGAGAAAGT TGAAAAGGAA GCTGCGTCCT CCAAAAGACG
AAAAGTTACA CGCTATCGTT CAGCACTGGC GTCTCTTCTG GAACAATGTA ACAACAGGGC
TTCTGGATCG AATCAGAAAC GCTTCCAGTT GGTTACCGAC CATATAGGGA AGCTTGGACC
ATCCGCGATT GACGTCGAAC TTTCCACTCT TTGCAGTGGA TTGCATGATT TGGAAGAGGG
CCTTCCATTG TTACAGCTCA CCTGTTACTG GTTGTTGGAA GCTTTACAGT CACGCGAACG
GTACGATGCC GTTAATGCGT ACTTACATCG CTTTCTGCAC CTGCATGCAT CGGTTATTGT
TGGTATCGAC GAATTTTACC GTGGGGATGA TCAACCGCTA CGCGTGAAAC ATTCCGAGCA
AGAACGAATT GAGTTGGAGA CACAACGTGA CCAACGGATC CGCCTACTTG AGTCCATCAC
AGAGCTACAC GATGCGCAAA AATCGGCTTC AGAAGCGCTC CAGAACAAGA TGCAAAATAC
GCTCTGCCTC TTGCGGCACT TTTCTCGGAT GATCTAG
 
Protein sequence
MVRTRSNSKS TPPSQPNEPH DDNDDGSSRN ESDDDIPKPV SSDSESDDTQ PPTTLPGNTS 
VLPGTGTSRL FSPYRSLGVV STGAPFHLIP HDHSGNAMVC VPIGDRFQLL RTDRLHPVLV
SQAVPHDVQH VVTDATLSIT VAAHGDRHVT LLHRTRPLAT RALAASTRWR TVQLLPLGRT
PVPMRGEKQG TMENAAIVAA ILQRAPTVRD DVPLVGQDDD DDDNDDSESL DSDDNDDETT
CLGQVVVLLA TRDSIAIQRR IRLTALPNFR PTTALHPSTY LNKIVLGGTN QPNQDTPSSP
AMCLLNVRSA RIIHVFAGLP SETEESSVTT LEQSPAVDTI AVGTSSGNVH LINLRHDQKL
FTLRHKTHDG NNVGISSISF RTDHSALQYD IAPMAVGRVD GHITIWDLTA PTEIESGRTV
LHEMEHVHVG GVAKIQYLPQ EPLLVSIGLA SNRIAMHIFD SPDHSARLLR SRQGHTGPPT
RIRYLHPGSG AGGGVLVNAS DGTDASACQI LSSGGPDRTL RVFSTARTVL DKEYSQGAGL
EKKARKFGMD TVAELLLPPT IGLATAESRA RDWGDLVTIH RDHAFAYVWS TKRGAQSGPI
LRQSTWNVSA MKIPPPPHTH ATAVAMSACG NFALVGTRGG TIYKYNVQSG NARGMYPIQA
KEDSKPRRQV VAGDLGRTMK SLENSMKVSN RAANVDKKEL DAEQEAKRES RILAKLQAAS
HTGHSVTGLA VDSVNKVLIS VGADAKLILW NFASHAPHKK SPYTLPCPAT RMCHVRDSDL
AAIALEDYSV VLFDCAALSI VRRFGATGGH VGPISDLGFS PDGRSLFTAS LDSSLRVWDV
PTNTCVDWLS FSTAPTSLTI SPTGEFLATT HKGKLGISVW SDRTYYQTVN IDGTPLKEPA
RMDEPVPMAD DAPSQTYTAS KPSEERVPTG NESEVDGVDD KGPALPKEAG LITLSGLPPA
HWKNLFHLEL VKERNKPKEA PQKPPSAPFF LQWRSGESIS ETVANPQLAS KSKQNVSEEE
EWVAAWTDND DDEAKVDVPE ASGLVKRDHE KVEKEAASSK RRKVTRYRSA LASLLEQCNN
RASGSNQKRF QLVTDHIGKL GPSAIDVELS TLCSGLHDLE EGLPLLQLTC YWLLEALQSR
ERYDAVNAYL HRFLHLHASV IVGIDEFYRG DDQPLRVKHS EQERIELETQ RDQRIRLLES
ITELHDAQKS ASEALQNKMQ NTLCLLRHFS RMI