Gene PHATRDRAFT_49981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49981 
Symbol 
ID7198768 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011694 
Strand
Start bp19020 
End bp21820 
Gene Length2801 bp 
Protein Length544 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184813 
Protein GI219129265 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000710505 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TGCAATCGAT GGTGACTGGA AGCTGAATCA GTTCTTTCAT TTCTATCTAC GAGTCCCAGG 
GCCGTTTCGT TACTTTGAAG TACCGGGTGA AAGAAAATAT ATTTGGAGGG TATCTGAGGA
GCTGCAGAGC TAAAGGGGCT AGCAATTAGT ACTCGAATAG CGGGCCACCG AATTTCGAAG
TTGCTCTTTC GCTTTATCTC GACAACGAAC ACGATCGACT CTAAAATGCT ACAGAAAAAC
CATCGTATCT TATTCACAAT CAACTTTCGT CTCCGATATG CTATTCCTTG GTTTTTGGTT
TTGGCGCTGT CGCCGTCCGG GGACATGGCT TACATTGGAA AGTTGACTAC ACGTCCTTTG
AAAAAAAATT CCTCTCCGAC GTTTGTTACG GTAGCAAACC TCGTTGGGGA CTTCGGGCGA
CAATCTGGAG GGAGTGGTGA AATCTTCATT CGAGAGTCGG CCAAGAGGAG TTTTGCTTTT
GGACAGTTTG GGGTCAAAAT TGAAACTCGT GACTGTGAGT CGCGTGGTAC ACCTTGATAA
TTCACAGGCA TTTTCTAGTC AAAATTTTGG AAACATATTT CGTGGGACTG AATTGACTGT
AAACCGAAGA TGGCAGATCT GCACGTACGC ACGCAGTTTC TGTCAAGTCT CTACAGGCTT
TGTTTCTAAC TGAAGACCTT TTTTGCTGAA CGTCATGAAG TTTACTGAGA GTTCGCTGTT
CAAAGTGTTA CTACATCTAG AGTGGTAGTG TCGAATACTG TGCTTTCTTG AAAGCATTGG
TAGTGAGAAC AGTAAGCTTT GATTACCGTC ACAGTCGGTC ATCGCTACAA ATGTCTGACG
TTGTCCTCAT TCCATCCACG GAAAGGTAAA GAATGAGACA AAGCACCGCC GCAGCCTCCA
AATTGAAAAG TAACGGTAAG TGATACGTGA CAGGTCTCAA CAAGATCAAC AACTAAGTTG
CGTCACTAAC CGGATCGTTG CCTGGTATCA AGGGTCACCG AAGAAGACTG CGAACAGTAA
GATTGTCCGG AGCAACGGCA TCGACCGATC CAAGGATGAG TCCGAATTGC CCGCGTCCGT
AAGTGAGGCC GACGAGACTA TAATTTTCTT TCAGAGCCGT TGGCGCAAAA TATCGACCGG
GAAATGGGGC CCTTGGCAAC GTACGGAGCT CACCAATCCT CGCACTCGCT CTGCCCTTAC
CCAAAAAAGC CGACAAATCC ATCTGCCTCG CGGTGGTACC GTCACAATCT ATCCCGGACT
ACTCTCCAAA ACCCAACGAG CGACGCTGAG CCAAGAGCTC CTCTCTAGCA ATTTATTTCG
TCAGTATGAG ATTCAAGGTA TGCCGGAACC ACGGGTGCAC TTTTTGCTAC ACGAGCAGGC
TACTGATGAG CCCGAAGCCC CGCAACCCGG ATACAGGTAT GGGAGTGTGC GCATGAAAGC
GTGCCCTTTA CGAGGGCTCC CCAAGCTCTG TCGCTTATCG AAAATTGTGG GCAAAAGGGC
TGGCGTAGAG AGCTGGAATA TTGGGGTCAA TCCAGTATTC TACCGAGACG GCAAAGATCG
AATTGGAGCA CACGCCGACG ACGATCAGGG CGAATCCTGC ATCTTGAGTG TTATCGTGGC
TTCCCCTATT CCGCTTCGAC GACTCTTGAT TCAACCTAAA CCAACCAAAA TTGTGGATAG
GACTGTACTT GGGACAAAGC AGAAAGGAAA CAAACGGTTG AAACTAAAAA CGGGAAACGG
CGATATTGTT GATGATGGTG TGGATGAACA GCATGAGCTT CGACTGGGTC CGGGGGATGC
ATACTGCATG GATGGTACGT GAAATAGAAA CGACTTGTAC GTTTTGATGC GAAATAAAGC
ATTCTTAAAT TTTTTGGAAC CCTCTGATTC ACACGCATTT ATTGGTCAGG AGAGATGCAG
GAGCACTATG TTCATTCCCT CCCCCCCGAT GAACACAGTA AAGGTTCAGT AGAAGGGCGC
CGTATTATTG TCGTCTTTCG TTCAGGCACA CAAAAAGTGT TCAAAAAGGA TTCCGGTAAA
CCATGCAACT TGGCGGCTTT GGAGCCTCGA CCTGATCTGT GTTTTACGTT TGGAAACGAA
ATAAAAGATT TGCGGGAGGG CGACATCTAC GGCGGACGTC AGCTGCGGGC AATGAATGCC
CATCGGTACG TCGAGGTAGT CCAACCGGCT TACTGTTGTG CGAGTTCAAA AAGGTAACCC
CTCACTCATA TTAATCTTGT ATTTTACAGA TCCACACAGC GCAGTGTCAG CGGGAATAAA
GTCATGGGAT GTGATGCAAT TACAGTTTCT CGGGATCGCG AGGATGACAC TTTTGTGAGA
TTTTCCTTTG CAGCAGAAAC ACGTGTTGGT GGGGGGAGCA TGCTCACAAG TTTGCAGAAG
GGATATCCTG TTCGGGTCTT TCGCACGTCA GCTTTGCACA ATAAGTACAA AGCAGTCGCA
AAGAAAAGTG GCCCAAAGTC AAAGTCAAAT GTGTACCGGT ACGATGGCCT TTACCATATA
GAATCTGCGG TGGAAGAACT GGGAGACAAA GTAAACGATG TGTCCCTTGG CTTGAGCAAC
ATGATCAACC GCAAGTCAGA AATTATTTTT CGGCTTTGTC GATCCTCAGA GAACAGTGTA
TCGTCAATGC GCCTTTTGAG ACGCATTGTC ATTGAGCGCA TGACCCACAG CAAAGTGACC
AGCAACAACG ATCTGCGCAA AAACTGTCCG AAGAAGATCA TTGTCAATTA ATGAATCTGT
ATGTTATGCA CAAAATATTC TTATATCAAG CAGAACCAAA T
 
Protein sequence
MRQSTAAASK LKSNDQQLSC VTNRIVAWYQ GSPKKTANSK IVRSNGIDRS KDESELPASS 
RWRKISTGKW GPWQRTELTN PRTRSALTQK SRQIHLPRGG TVTIYPGLLS KTQRATLSQE
LLSSNLFRQY EIQGMPEPRV HFLLHEQATD EPEAPQPGYR YGSVRMKACP LRGLPKLCRL
SKIVGKRAGV ESWNIGVNPV FYRDGKDRIG AHADDDQGES CILSVIVASP IPLRRLLIQP
KPTKIVDRTV LGTKQKGNKR LKLKTGNGDI VDDGVDEQHE LRLGPGDAYC MDGEMQEHYV
HSLPPDEHSK GSVEGRRIIV VFRSGTQKVF KKDSGKPCNL AALEPRPDLC FTFGNEIKDL
REGDIYGGRQ LRAMNAHRST QRSVSGNKVM GCDAITVSRD REDDTFVRFS FAAETRVGGG
SMLTSLQKGY PVRVFRTSAL HNKYKAVAKK SGPKSKSNVY RYDGLYHIES AVEELGDKVN
DVSLGLSNMI NRKSEIIFRL CRSSENSVSS MRLLRRIVIE RMTHSKVTSN NDLRKNCPKK
IIVN