Gene PHATRDRAFT_51017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_51017 
Symbol 
ID7202209 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp5587 
End bp7236 
Gene Length1650 bp 
Protein Length317 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181115 
Protein GI219121525 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGTGCCTTTC TTGGCTACAC AGCAACATCT GAGGCATTGA TACCAGTTGT ATTCAAGGGT 
CCTGTACTAC CCGGAGTTTG AGCCGTAAGA CAACATGTCC CTCGATACTG CAAAAATCGA
CGAACATATC GAGCGGCTGC GGGAAGGAAA CACATTGACA GAGAACGAAG TCAAGGCTCT
GTGCGAAAAG GTACGATTTC CGATTTTGGC CAGACCCATT TCATTGCTTG CGTGTGGGGT
ACGGTAGTGA GCGACGATGA TCGGAAGCAA TCACAAGGCT CTGCATCAGA TGATGGAAGT
TGGTCAACGG GACAGCTAGA GGAAGGCCAA CTGCACGTGA TGTTTTTCTT TCGATCTATG
ACTATGCTGT CCTGTCTATG CCTGCATTTC TCATCCGTTT TCAATATTTC CACGATCTTT
CTCGTAGGCA AAAGAGATCT TGCGAGACGA ATCGAATGTG CAGCCTGTTA CTGCTCCCGT
AACTGTTTGT GGGGACATCC ATGGCCAATT CTATGATTTG GCGGAGCTTT TCCGTATTGG
CGGAGCCTGT CCAGAAACAA ACTATCTTTT CATGGGAGAT TACGTGGACC GCGGCTATTA
TTCCCTCGAG ACAGTTACCT TGCTCATGGC TTTGAAGGTT CGCTACCGTA GCCGTATAAC
AATTCTGCGA GGTAACCATG AGAGTCGTCA AATTACGCAG GTGTACGGAT TTTACGATGA
ATGCCTCCGG AAATATGGAA ACGCTAATGT GTGGAAGTAT TTCACCGATA CGTTCGATTA
TTTACCCATG ACTGCGGTTG TATCCGACCG CATTTTCTGC TTGCATGGCG GACTTTCACC
CTCGATCGAT ACATTGGATC ACGCCCGCGA GCTTGATCGA GTCCAGGAAG TTCCTCACGA
AGGTCCCATG TGTGACCTTG TGTGGTCCGA CCCCGATGAC CGGTACGTAT TGTTATATGA
GCTAGATACA GTGTTTGAGT TCCTTTCTAA CAATGCACCA ACTCATTTAC TGCAGCTGTG
GCTGGGGTAT ATCTCCGCGA GGTGCTGGTT ACACCTTTGG TCAAGACATT ACGGAACAAT
TCACGCACAT AAATGGTCTC CACTTTATTG CTCGCGCGCA TCAACTCGTC ATGGAGGGTT
ACCAATGGCA GCATAATCGA AGTGTCGTCA CTGTCTTTTC GGCACCGAAC TACTGCTATC
GGTGTGGAAA TCAGGCGGCA ATTATGGAAA TTGACGATAC CGTCGACCAG ACCAATAAAG
ACACCGTCCA CGATCACTGC AGATTGTAAG TGCTTGCGAT TTGAATTGCT CTTTGTAACC
ATATTCTGAT TGTGCTAGTA TTCTAACGCA TTACTTCTTC ATTTCGGCAG TTCGCAATTT
GATCCTGCCC CGCGAGACGA AAGTTGGCAC AAGAGTTCAA GAACATTGGA CTATTTTTTG
TGAGCAGCTT GTAATACCCT TGAGTCTATT GAGTTTGTAC AAGCCTAGAA GGGGATAAAA
ACGGTTTATT GAAGTTTCGC ATGGTTGTTC TATTCCAGAT TAGATATGGG TGTACTTTAA
ATGGATCCGA TGAACTGTGG CGTAGTTCCT TACACCTGGG TGAGAATGCG CTTCCACCAC
GTGCCCTACT TAAATTTATT CTGCGACTAC
 
Protein sequence
MSLDTAKIDE HIERLREGNT LTENEVKALC EKAKEILRDE SNVQPVTAPV TVCGDIHGQF 
YDLAELFRIG GACPETNYLF MGDYVDRGYY SLETVTLLMA LKVRYRSRIT ILRGNHESRQ
ITQVYGFYDE CLRKYGNANV WKYFTDTFDY LPMTAVVSDR IFCLHGGLSP SIDTLDHARE
LDRVQEVPHE GPMCDLVWSD PDDRCGWGIS PRGAGYTFGQ DITEQFTHIN GLHFIARAHQ
LVMEGYQWQH NRSVVTVFSA PNYCYRCGNQ AAIMEIDDTV DQTNKDTVHD HCRFSQFDPA
PRDESWHKSS RTLDYFL