Gene PHATRDRAFT_19762 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_19762 
Symbol 
ID7200151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp750533 
End bp752998 
Gene Length2466 bp 
Protein Length807 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179497 
Protein GI219117405 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTTTATTCAC CTGTCACTCC AGCGCAACGT CAACGAGCAA GCATGCTATC TCGCTTTATC 
GGCTTGCGAA GACTCTTTTC ATCGCACAGC AAAAACGTGC GAGTGGCGGC AGACAAACGC
ACCAACATAG TCTTTGGAGC TAACACAGAT GTTGGTAAAA CTCTCGTTTC CGCTGGTCTC
GTCCGGGCTG CGCTCCGACG CAACACTGTT CATTACGTCA AGCCTCTCCA ATGTGGCGGC
AATGATCAGG AATTTGTAAG GAGGCACGCC TTATTGCAGC CTCGGCCTTC CAATGCGCTA
TCCGCAGAGA TTTTGTTCCA GTGGGATACT CCGACCTCGC CTCATATCGC TTCACGAATG
GAAAACAAGC CCCAGAGTGA CAAGCATGTA ATGGAGGCCG TCAACAAAGT TTTGGTAGAA
ACAACGAACC GTTCAGGTCC TGTCACGACA ATTATTGAAA CGGCTGGTGG TGTACTCAGT
CCTTCTGCCT CGTCGCCCAA CAACACGTCA GCTCGACACG CGACTTCGGA GACTGGCTGG
GGATGGATAC CCCAAGCAGA CCTATATCAA CCATTACTGG GACAAACCCC GGTAGTCCTC
GTCGGTGACG GCAGACTGGG TGGTATCAGT GCGACACTGT CCAGTTTGGA ATCGTTGCTG
ATTCGGGGTT ACGATGTGAC GGCCCTTGTT TTGTTGGAAA CGGACTACGA CAACGTTTCG
GCCATGCGCG AATATGTTTC ACGGGGTCAT AAGCTTCGAG CTGGCAATGG AGAGACGCTC
TTCTCCAATC CGAACGATTC TGTGATGTCG CTACCTGCTA TTCCGTCGGA TCCACAAGTG
CCTTTGGATG AATGGTATGG CTCCGATGCC GTCAAGGAAC GGTTCGAACG TCTGGACGAT
TTTTTGCAAC AAAGTTGGGA AGGTCAGCTC TTGGACTTGC AAAGTTTGCG GTGTACCGGC
CGACAAGTTT TGTGGTGGCC ATTTACACAG CATAGTCAGG TTCAAAGAGA CAATCAAGTT
GCGCTTGTGG ACAGTGCTTC TCAGAACGAC TTGAATGTGC TCGTTGACGG CACAGACGGA
ATATGTCTAG AGCGTGTATC AATGAAAGAT ATGTCAGGAA GTGAAGGGAC TCTCGGTATT
GGACACGGGG ATTCTTCGCT CGCTTTGGCT AGCGCTGGTG CAGCTGGACG ATACGGTTAT
GTTGCCTTCC CTCAGACAGT GCATGCACCC GCCGTCGCTT TGGGCCAAAC ACTTGTAGGT
CCCCGCGGTC CAGGACAAGG GTGGGCGAAG CGTGTTTTCT TTACGGAGGA CGCAGCATCT
GGAATGGAAG CGGCCCTCAA GATGGGCATG AAAACCTATA TCAAGCGAAT GCGTGAAGAA
GAGAATGAAT CAATCGAATG GGTTATTTGC GGACAAGAGA ATAGTTACCA TGGTGATACA
CTGGGAGTTA TGAATGCTAC TGAACCCAGC TTTTTCAATG ACGGACAGCA TCCCTGGTAT
GAATGCAAAG GATTATTCCT GTCTGTTCCA ACGTTAGGTT TTCGAAACGG AGTGCTTTCC
ATTTCGTTCC CGGAGGGATC TGAACTTGAG GGCACACAAA CAACGTTTGA ATCCATCGAT
CACGTTTTGG ATATCGATGC ACGGATGATT TCCAGAAAAC TTCATTCGCA GTACAGGGAA
CTGATTGAGA TGCAATGGCT TGTTTACGAG CATAGTAGTG TCAATAAGAA AATATCATCT
GTTGTGATTG AGCCGGTGCT TTCGGCTGTC GCGGGAATGA GTTTTGTCGA TCCACTTTGG
CAGAAAGCTC TAATTGCCGT TGCTGAGTCA CGCAATGTTC CTGTTATCTT TGATGAGACT
ACATCGGGAC TTCAACGACT TGGAGTGATG AGCGGTCGCG ACATTTTACG GGAAGATCCC
GACATCGGAG TGTACTCGAA ACTTCTCACC GGTGGAATCC TTCCACTATG TGCAACATTA
ACCACCGAAG AAATATTCGA GACATTTTTG GGAGAAGACG AAACGATGGC GATGCTCCAC
GGTAGTTCTC ACTGTGCTCA TGCAAGTGGC TGTACAAGCG CACTGCATGC TCTACACGCG
TATGACCTAC TATCAGAGCA CTCCAGAAAA GCAAAAGTCA GCCCCAGGAT GCTTTTCGAC
GCTGATTTGG TAAGAGCTAT GTCGGAACTA CCTATCGTTG AGCAGACCTT TTCGCTAGGT
ACCGTTCTTT CTATTACGCT CGATGTCGAC GACGACGAGA GCGGAGACAA TGAGTGCTCG
CTCATCAATG TGGCGATACG TCTTCTTCGT AAAGAATGCG TTTTTGCACG TTCGCTCGGC
AATGTTATGT ATATTTTGGT CTCGCCACCG GATAATCCGG AGGATTGCTT GCATCTTTCG
AAGAAAGTTT ACGACACTCT CTCAAAACTA TCTGGAGCTC GATCGATGTC ATCCATGAAG
CAATAA
 
Protein sequence
MLSRFIGLRR LFSSHSKNVR VAADKRTNIV FGANTDVGKT LVSAGLVRAA LRRNTVHYVK 
PLQCGGNDQE FVRRHALLQP RPSNALSAEI LFQWDTPTSP HIASRMENKP QSDKHVMEAV
NKVLVETTNR SGPVTTIIET AGGVLSPSAS SPNNTSARHA TSETGWGWIP QADLYQPLLG
QTPVVLVGDG RLGGISATLS SLESLLIRGY DVTALVLLET DYDNVSAMRE YVSRGHKLRA
GNGETLFSNP NDSVMSLPAI PSDPQVPLDE WYGSDAVKER FERLDDFLQQ SWEGQLLDLQ
SLRCTGRQVL WWPFTQHSQV QRDNQVALVD SASQNDLNVL VDGTDGICLE RVSMKDMSGS
EGTLGIGHGD SSLALASAGA AGRYGYVAFP QTVHAPAVAL GQTLVGPRGP GQGWAKRVFF
TEDAASGMEA ALKMGMKTYI KRMREEENES IEWVICGQEN SYHGDTLGVM NATEPSFFND
GQHPWYECKG LFLSVPTLGF RNGVLSISFP EGSELEGTQT TFESIDHVLD IDARMISRKL
HSQYRELIEM QWLVYEHSSV NKKISSVVIE PVLSAVAGMS FVDPLWQKAL IAVAESRNVP
VIFDETTSGL QRLGVMSGRD ILREDPDIGV YSKLLTGGIL PLCATLTTEE IFETFLGEDE
TMAMLHGSSH CAHASGCTSA LHALHAYDLL SEHSRKAKVS PRMLFDADLV RAMSELPIVE
QTFSLGTVLS ITLDVDDDES GDNECSLINV AIRLLRKECV FARSLGNVMY ILVSPPDNPE
DCLHLSKKVY DTLSKLSGAR SMSSMKQ