Gene PHATRDRAFT_31525 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_31525 
Symbol 
ID7196692 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp284490 
End bp287886 
Gene Length3397 bp 
Protein Length1104 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177057 
Protein GI219110611 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACATTGT CAAACTACCC CCGCCATCCG TCGCCGCTGG GTGGATTGAT GGTGCCTAAA 
TTCAACACAA TGGTTTTGTA TTGTACAATT TTAGTCATAT TTATTGCGCT GAATATTGTT
TTGGCGGGGA AGGGGCCGGT TTCGCAACGA CGCGTCGCGT CCTCGGCGCC TCTTTTTCAA
TTTGACTTGC AACCGACAGG ATTTCTAAAA GAGGCTCGTC AACGCCGCAG ACTACAGAGC
AACTCGTCTT TCGTCATACC GGATGAATGG CTGCGACCGG AACGAAGTGC CGTTTATGCC
GGTATTCCTG ATGCCCGTGA TCCACGTTAC CGACACAACC GTCAATTGCA AGAGTTTCAT
GACCTCCGAC ACTTGAGTCG ATACGAGCAG GCTTATCGGA CCCAAAATAA TATTGATTTA
CGAGAAAAAT GGGATGAAGA ATACTCTTTT GAAGACGAAC AAAAGGAGAT TCCCAAAAGC
GCAGAGCTTC GCAACCAGAC TGCAGAAAAT CGATTGCGTT CACACTCCGA CAAAAGGAGA
AGGACACAGG AGGCTGCTCC CGTTGCTGGT GGACAATACA ACAATTATCA GGCCGTACCC
TTGGCACAAG GCTATGGAAC GCACTACGTT AATGTCTGGG TGGGATCTCC CTTTCCGCAA
CGAAAGACGG TCATCGTCGA TACCGGCTCG CACTACACTG CTTTTCCCTG TAATGGATGT
CAAAATTGCG GTTCGACGCA TCATACCGAT CCTTACTTTG AACCAAAAAA GAGTGCATCG
TTTCATCAAC TCCAGTGTGA CGAATGCCGG GATGGTATCA CATGTCAAGA CGGGGAGTGT
AGGTTCAGCC AGTCCTATAC GGAAGGCAGC TCATGGGACG CCGTACAAGT TTTAGATCGA
TTCTATTGCA GTGGTTCCGA TATTATCGAC TCCGTTTCTT TAGAAGACCA ACGAAACTCA
ATCGACTTTA TGTTTGGATG TCAAAAGAGT ATGACTGGCT TGTTCATTAC GCAGCTTGCG
GATGGAATAA TGGGAATGTC AGCACACCAG GCGACCTTAC CGAAACAGTT GTACGACAGA
CACATGATTG AACACAATAT TTTCTCCATG TGTTATCGTC GAGAGCTGGG CACAAGCAAG
CGTGGTGTCA TGGCAGGTAG TATGACAATT GGAGGAATAT CCACCAACCT CGATACCAGC
CCCATGGTGT ATGCCAAGAA CATGGCCAAA ATTGGATGGT ATACAGTCTA CGTCAAGAAC
ATTTATATTA GACAAGGAGG CGGACAATCT GCCAAGAGTG TGGATCCTGA TCACCGTACG
ATCAAAGTAA AGATGAATCC TGCTGTTCTT AATAGTGGTA AAGGTGTCAT TGTGGATTCG
GGAACAACCG ATACTTACCT CAACAAAGAT GTAGCTCCGG AGTTTAATAT GGCTTGGCGT
CAGGCAACTG GTCAGTCGTA CTCTCATCTA CCGATGAGAC TCTCGCCGGA ACAAATTCTG
GAGCTGCCTA CTGTTCTCGT GCAATGTCAT GCCTACAGAG AAAATTTGGA TCCGTCAATC
GAGGGTTATG AAGATATTCC TGGGTACGCT GGTCGTTTGG ACCCATCGTC CCCCAATGAT
TTGCTCATTG CGATTCCAGC CACCAGCTAC ATGGACTTTT CTCCGATCAC GTCCATGTAC
ACCAGCCGAA TCTATTTCAG TGAAACATCT GGCGGAGTCT TGGGAAGTAA CACTATGCAA
GGGCACAACG TTGTCTTCGA CTGGGAAAAC GGCCGTGTTG GGTTTGCAGA GAGTTCTTGC
ACGTACGACA AGAAATCCGT GCCGGAAGTT GCACAGGATA ACGGATACTC AAAGGACTGC
ACAGTGCACG CTCCAATTCT ATCTACGCCA TGTATTGATA CCGTGCACCG GGAAATTTGC
GAACACGCCT CCTCGAATAT TGCTCTCCTG GGCAACGAAA CTTGGACTGG TATTGTCGAG
AGTGCTGGCA GCAAAGAGGG TGTTCAGTGT ACTGAAGTAG CAAGAGAATC TTCATCAAAG
AGTGTGTTCC AGAACTCTGA CGTTGACTGC AATGGCAAGG GAACTTGCGA AGAGAAGAGA
TCGTGTCAAC TTACATGCGC CGAAGCCATA GTAGCTGCGA ATGTGTCAAA AGCGCCTATT
TCTGAAAGCA TCAGATATGA CTGCGGAGAT TCTTTGTGGA GCACATGCGA TCATGGGTGC
GAGCAGACTC GGATTGTATC AGCCGCTCAT ACAGACGGAA TCTGCCACGA AGAGCACAGA
TTCTCCCGAC CTTGCCATAT CGAAGCTTGC GCCCGTTCAG ACCCTTGTCT TGTCCCATTT
CTCATTCATA CAGTTGTTGG CCTTCAGGGA ATATCGGTTT CAAAATGGAC AGGATCCTCG
GAAAATACTT TTGTTTCAGC TCTGACAAGT GTGGCACGTG CACTTAATCC ATTAGAAACA
TTTGGCGAAG GTGATGTGAA TGTGCTGCTA GCTATTCCAT GGCATGTGGA CGAAGACGAT
CCAGACCAAG GTACTCATGT ATCCAAACCT ATTGGAACAA AAATTATTCT GGAGATTTCA
ATTTTTAACA ACCTTTCCAA TGCTACATCC ACTGTAACCA GCGATACAGA CGATTCATCC
GTCAAAGGAA TACTGTGGAA TATTACCGAG CGCATAAAGA CGCGGCTACC CGATACGATC
TGCAACTCGG ATGATATGTA CACGCTTGCA AAGAAAACTC TCTCCATAAA AAAGCGTGTC
TTAGAAAGTC AGCTTTTCAT TGGTTCGCTT ATTCACGAAA TGGAGAGAAT AGAATTATCA
GACCCAATCT CAGCGGCAAT TTCGATGTTT TCGCCACTTT TCCATACAGT TTCCCTGGAG
AGCGAGAATG AAAGCCGAGT TGTATCTTCT TGGACGATCC AGACAACGAT CGATGATCAG
ATCAACTACT TCGTAAGTGA AGCAATAGAA AAAGTCTTTG CTGAGTGCAA TGACATCAAC
CTCACTGCAG TCAACTTGAC ATTTTCCCAA ATAGGGACCG CCCAGGCCAA TTTGGCATAC
AATGTTAAGC TTCATTCATG TCGTGTTGTT GACGATGATC TTTTTTTTAA TCCTGACATC
AGCTTGGACA CTGCTCGTGT CTATGTATGA ATATCTACTG GAGCGCGGTT GGTTTCTACG
AGTAAGGAAA GGCCGACATC GATATTCTCC AGCTAAGGCC TGTGACGACT CACAGACAGC
TGATCAGGAG CTCGAACTGG CGGATGGCGG TAATCTGGAA ATGACGGTAC AAAACACTGA
TTTTCAAAGC CGAGGCGGAA CTAAAGCCAT CAAGCGAAAG ACATCTCCAG CCGATCGTGA
AGGCACGATT GTGAAGAATA TTTCGAGAAC GACCTAG
 
Protein sequence
MTLSNYPRHP SPLGGLMVPK FNTMVLYCTI LVIFIALNIV LAGKGPVSQR RVASSAPLFQ 
FDLQPTGFLK EARQRRRLQS NSSFVIPDEW LRPERSAVYA GIPDARDPRY RHNRQLQEFH
DLRHLSRYEQ AYRTQNNIDL REKWDEEYSF EDEQKEIPKS AELRNQTAEN RLRSHSDKRR
RTQEAAPVAG GQYNNYQAVP LAQGYGTHYV NVWVGSPFPQ RKTVIVDTGS HYTAFPCNGC
QNCGSTHHTD PYFEPKKSAS FHQLQCDECR DGITCQDGEC RFSQSYTEGS SWDAVQVLDR
FYCSGSDIID SVSLEDQRNS IDFMFGCQKS MTGLFITQLA DGIMGMSAHQ ATLPKQLYDR
HMIEHNIFSM CYRRELGTSK RGVMAGSMTI GGISTNLDTS PMVYAKNMAK IGWYTVYVKN
IYIRQGGGQS AKSVDPDHRT IKVKMNPAVL NSGKGVIVDS GTTDTYLNKD VAPEFNMAWR
QATGQSYSHL PMRLSPEQIL ELPTVLVQCH AYRENLDPSI EGYEDIPGYA GRLDPSSPND
LLIAIPATSY MDFSPITSMY TSRIYFSETS GGVLGSNTMQ GHNVVFDWEN GRVGFAESSC
TYDKKSVPEV AQDNGYSKDC TVHAPILSTP CIDTVHREIC EHASSNIALL GNETWTGIVE
SAGSKEGVQC TEVARESSSK SVFQNSDVDC NGKGTCEEKR SCQLTCAEAI VAANVSKAPI
SESIRYDCGD SLWSTCDHGC EQTRIVSAAH TDGICHEEHR FSRPCHIEAC ARSDPCLVPF
LIHTVVGLQG ISVSKWTGSS ENTFVSALTS VARALNPLET FGEGDVNVLL AIPWHVDEDD
PDQGTHVSKP IGTKIILEIS IFNNLSNATS TVTSDTDDSS VKGILWNITE RIKTRLPDTI
CNSDDMYTLA KKTLSIKKRV LESQLFIGSL IHEMERIELS DPISAAISMF SPLFHTVSLE
SENESRVVSS WTIQTTIDDQ INYFGPPRPI WHTMLSFIHV VLLTMIFFLI LTSAWTLLVS
MYEYLLERGW FLRVRKGRHR YSPAKACDDS QTADQELELA DGGNLEMTVQ NTDFQSRGGT
KAIKRKTSPA DREGTIVKNI SRTT