Gene PHATRDRAFT_41559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_41559 
Symbol 
ID7199398 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011699 
Strand
Start bp162840 
End bp167263 
Gene Length4424 bp 
Protein Length1148 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185494 
Protein GI219130695 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GGTCGGTCGC TTGGCGTTGA CTTTGCGCCG CAGATTGGCC GGTATTTCTG CAGCGGAACC 
GCTGTCGTCA GTCACAACTG TCGTTTTCGG TTTGGCGAAA ACAGCCCAGG GCGATGCGCG
ATCCTTCCTT TTCTGGGCAG GAAAGGCCGC GGAAATTGTC AAAAAGGCTC CATGTGACAC
CGAGAACGAA GAAACAACTG TCGGTGCCAA AACAGAAAGA ATCGCGCAAC ACCGTACCGA
TCGACGACAA CGGCATGTCC TCATGGTAAC GATGGCAATT CCTACGAATG TTTGCTTGCT
TCTCTCAGAA ATATTCCGTT TCTTGTGAGT AGTGGTTGTC CGAAAGCGAT TGCCAAGTGT
ATTTACAGAA GGTTGTTGTG CTGTGCAGTC ACCCGTACGC AAAATCGATG GTGTCACCGA
CTCGCGGCCA GACGTGTCCG GGATGCCTCC GTCGGGCGAC TCCCACAATC CATCACCGAC
GTGACCTCGC CGAGACACAG TGCGGGCGCC TCCGCAGTAC GAACCAAGCG ACAGGCATAC
GGACACTGGC CAAACCATTG CCTAGCACCC GTTTGAGCTG ATTCTTTTTT TCTGACCCCT
GGCAAGCTTT AAGAATCCAG AATGATGGAG ATGGAGGAGC ACGATTTGGC GCAACAAAAA
GACCGTAAGA TGCCAGCCCA ACCTCTCCAT CCGAACCATG TGCGTATCTA CAGGAATTTG
GAATTTCATT TAGTTCAACT TGCGTTTGTT CATCCTTCAA GCGGTATTCG TCTTCTATGT
GGAATAGAGA GTCACGAAAT TTTGGTTGTA TTGAATACCG GGTCTATAAG GAACTGTCTC
TGTTTCTCTG TGGAACAGTG ACGTGGCCTT CGGAGAGACA TCCAGTCTAC AGCTATCTTT
CGTTACTGTG GTCGCGGATT TGGGATCGAC TGACCTCACC GAGGTTGTTC CCAGAGATGT
TTCTGATTGG TTGATGTGTG CCAATTTAAT TTCGCATGCT CGGCTCTGAT TGGTGTTCGT
AACCAAACGT TCCTTCCCCA GCGTGCATTG ACTTGACGAC ATCGTATACG ATAGCCGGTA
AAGTCTCGAG CTGTTCTGAC AGTAAGGGAG AAGGACACGC GCTCGAAGTG CAACTCGTTT
ACGTTTCTTC CGCGGTGGGG TTCCGTGTTT CTTGGCGTAG TGCGTCGTCG TTGTTGTCCT
CACGGGGGCG GGGGTATCCA ATCTTGTCGG ATGGTTGTGG TGCCCGTGTC TCTCACAAAC
CCCGTTTTCT ACTCGTTGCA CCCTTATTAT ACCCTTATCT GTTGTGTCTA TACCCTTTGC
TGTGTCTTGG TTGGTCTTCA TTACCTGCCA TTTGCCACCT GTAATCTGCT ACCTTTGCAC
TCTAATTCAC AGTCGCAGTC CACAAACGAG GCAGTTTCCA GGACTACCGA CAGTTCGAAC
CACACTTTGC TACAGAATTC GAATCTCTCG CAGCAGCCTC CCCTCCCTCT TCTGCTTCCT
GCCACTGATT CTTTGCGGCA TCCCGCAAAC CCTCTCTACA GCAGGAATCG ATCACACGAC
ACGAATAGCG CTATAGGCGT TTCCGACCCC GCTACGCATA CGAGCCGTGC CATGAGCAGT
CACACGTCGC TGCAATACTC GAGTTCCGGA GGCATCGCGA ACATCTCTAC CACAACCGAT
CCTCCACACA AACGACTCAA GTTGGACCAT GCCATGAGCC ACACATCGCT CGGCAACCCA
TCCTTGAGCT ATCACGATTT TGCCGCACAT TACGACAGTC GCAGTACCTT ACACACTAGT
AGCACCATGG ATCTAGGCGT TTTGCGGAAA GAAGATTCCT TGGGCATGAT GCGCAAGGAC
GGCGACGACG AGGACGACGA AAATGATCAG AACGACCCGA TATCCTCCAC AGCTGTACGA
CAAGCGACGG TCCAACCTAC TGCTCTTCCG AATGAAAGTG CGAAACCCAC ACACCCCACT
ACAGCGAACG TAGCCACCAC AAATTCCGTT TCGTCCTCCG ACAGTCTGCG CGATCTATCC
GCACACCGTC CACAACATCC ACAGAATACT ACTCGTCTTC CCGTTTCTTC GTCAACGACT
ACGGTAACAT CGGGTTCGAA TTCTCCGCTC TCTGCGGGGC CGGTATCAGC CCAAGCTCCT
CCCTCGCCTC TGTTACCTCT CAAGGCTACC AAAATGTCAC ACCTCCGCCA AAAATACATG
CAAGAACTAG AGTACATGCT GTGTGAGTTC CAAAAGCTGG AACGTCAGCT ACTAGGTGCC
AAGGCGACGA CAGCCGAATC CGCTGGCAGC CGCGAACGTC GAGAAAAACT GCATTCGTTC
ATCACGCACC TGAGCGATAC GATCCAGAAC ATACAGACCG GATGTCAGCT AGAGTCGGAG
GGAAAATCAA CCGTCGGAGA AGCTTCCAAG CAAGATATAG CCCAGGAGGC CGCGCTGGCA
GATTTGACGT GCGAAAAGGG GGAAGAGGAA AACGTGCAAA AGCTGGAAGA GCACATTCTA
GCCAATCTGT TGCCCGTCAA AGTCCGGCTC AAGAAACAAC TGGCGGCCCA GCAAGGTGCC
AAGCATAACC CGGCGGGGAT GCCGGTTGCG CAAAGGGGAC TAGTGGCACC GAGCGAAGGT
GGTAAAGGCA CGTTTGCGGC AGCGGCCGAA GAGCGCAGAA AGCAATTGGC GGACGCGGCC
GCCGCGGCAC AAGGCTTCGA TCATACACAC GTACCGGCGG AACCGGTTCA TCCAGACCAG
ACACAATTTG GTAAACCACT ACAAGGAAAC GGCTCCTCGT TGACGCGAAA TTTGCATGGA
TCCACTTTGG GATCCGCGAT TAAAGTGGGA ACGGATAAGT CCAAAATTTT GTTCGCTGGT
TTGGCGATCG GATCGTCGCA AGTAAAGTCG TCGGTCAACG CAGCTTCGTC GGTACATCAG
CTCGTAATTA AGGATCCCGC TTTGTTGGAG TTGGCTCGCC AACAGAGCGC GTCAAAACAA
CAAGAGGACC TTCCACCGCA AACACAACAA GAAGACTCTC CAACGCAAAG CAAACCCAAT
TCGCTGCTGC CTCCTTCCTC GTCCGAGCCG AATGACTCTC CAGAGGATAC AAACCGTAAG
GCTATATCAC TAAAAGTTTC GCCTGCTGTT GCTTCTGCAG CAGCTTTGGC CGCGTCTGAG
CAACCAGACG CAGTCTTGTC AAAGGCTCCA CCAAGCAGAT TAGATGATGT TGATGCCACC
TACCCCGACA TGCCATCGGC AGCTTTAACC GATGAAGAAC GGCGAACCCT CCGTCGTCTC
AAACGCCGAA AAAAGAGACG AAAACGCAAG GCCGAAGCAA CTCCAGTCAC GGCAGCGGCC
ACGGCAGCAC CAGTGATCAA TCGCCATCAC AAGCCGACGA CAAAAAAACG GGGACCTCGG
ACGGTGGAAT ACATGTGTGC TTTGTGTAAC GAAGTCTACA ATTCTACCTG TGATTATAAT
CCTTGGTGGG CTCTGGCTCA ACATGATTGT CCAAAATGTC GAAAAAATCA GGTTCGTCGA
CTTCGTGTAC CGAATTGCGC CCATTCATAT TCGTACCTTC TTGCGGAAGT TCTCACACCC
TTTCTCTCTA CTGACAGATA CCGCGGGTAG ATATTAGCGC ACCTGCCAAT ACGATCGAAT
ATCATCCGGC GTTGCTAGCT CACGCAGACG AAAATGGCGG TAGTACTCCG ACACCGCCTG
CAGCAATAGT GAAGCCAGTC ACAACTGTGT CGGCTCCTGT CACTAGTGTG CCAAAATGTG
GTAATGATTC CGATTCGTTC GGATCTGACT TGTCAGACGA TGATCTTGAC GGCCTGTTGT
CAGACACTGA CTCGGAGGGC TCGGGAGAAA TAGGTATGGA AAGAATAGAT GCGCTATCGC
CTGCGGAACA AGCAGAGAAT GAATATTTTG GGGTGGAATA CAAGGGGCCA AAATTGAAAG
ACAGTGAAGC TGCTCGGCTA CTGATTCTCA TGGGGCATGC GTCGACCTGT CCTTGCAAGC
ATCAATCGAT CAAACATCGT GAAACCTGCA GAAATACGAA ATGGATGATG TTGCATGTTC
GGGATTGTCC AGGAACTACA TCTTCGTTTG ATGTCTGCCC ATTTCCATGG TGCCGCAAAG
TCAAGCATTT GTTGTATCAT CTTGTCTCGT GTCGCGATGC CAAGCACTGT GAGATCTGCT
CACCGACCAA GCTCAACCAA AATATGATCC TGTTAAAGGG GTTGAATCAG CACCGCTTCA
TGCAATATAG GGAGCGGCTG ATCGGCCGTG GAAAGGCGTT GACAAAGGTG TCAAATAGTG
CGCCGAAAAA TACTCCAGCT CAGGCGCAGC ACAAAAGTGT GTCATAAAGC AAACCGGGTG
TAATTGGTAT TGTAGCTTTC ATCGACGTTT CGCAAATGCT GTAA
 
Protein sequence
VGRLALTLRR RLAGISAAEP LSSVTTVVFG LAKTAQGDAR SFLFWAGKAA EIVKKAPCDT 
ENEETTVGAK TERIAQHRTD RRQRHVLMVT MAIPTNVCLL LSEIFRFLMM EMEEHDLAQQ
KDPGKVSSCS DSKGEGHALE VQLVYVSSAV GFRVSWRSAS SLLSSRGRGY PILSDGCGAR
SQSTNEAVSR TTDSSNHTLL QNSNLSQQPP LPLLLPATDS LRHPANPLYS RNRSHDTNSA
IGVSDPATHT SRAMSSHTSL QYSSSGGIAN ISTTTDPPHK RLKLDHAMSH TSLGNPSLSY
HDFAAHYDSR STLHTSSTMD LGVLRKEDSL GMMRKDGDDE DDENDQNDPI SSTAVRQATV
QPTALPNESA KPTHPTTANV ATTNSVSSSD SLRDLSAHRP QHPQNTTRLP VSSSTTTVTS
GSNSPLSAGP VSAQAPPSPL LPLKATKMSH LRQKYMQELE YMLCEFQKLE RQLLGAKATT
AESAGSRERR EKLHSFITHL SDTIQNIQTG CQLESEGKST VGEASKQDIA QEAALADLTC
EKGEEENVQK LEEHILANLL PVKVRLKKQL AAQQGAKHNP AGMPVAQRGL VAPSEGGKGT
FAAAAEERRK QLADAAAAAQ GFDHTHVPAE PVHPDQTQFG KPLQGNGSSL TRNLHGSTLG
SAIKVGTDKS KILFAGLAIG SSQVKSSVNA ASSVHQLVIK DPALLELARQ QSASKQQEDL
PPQTQQEDSP TQSKPNSLLP PSSSEPNDSP EDTNRKAISL KVSPAVASAA ALAASEQPDA
VLSKAPPSRL DDVDATYPDM PSAALTDEER RTLRRLKRRK KRRKRKAEAT PVTAAATAAP
VINRHHKPTT KKRGPRTVEY MCALCNEVYN STCDYNPWWA LAQHDCPKCR KNQIPRVDIS
APANTIEYHP ALLAHADENG GSTPTPPAAI VKPVTTVSAP VTSVPKCGND SDSFGSDLSD
DDLDGLLSDT DSEGSGEIGM ERIDALSPAE QAENEYFGVE YKGPKLKDSE AARLLILMGH
ASTCPCKHQS IKHRETCRNT KWMMLHVRDC PGTTSSFDVC PFPWCRKVKH LLYHLVSCRD
AKHCEICSPT KLNQNMILLK GLNQHRFMQY RERLIGRGKA LTKVSNSAPK NTPAQAQHKT
FIDVSQML