Gene PHATRDRAFT_30019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_30019 
Symbol 
ID7195250 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011688 
Strand
Start bp205931 
End bp207088 
Gene Length1158 bp 
Protein Length299 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183570 
Protein GI219126661 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0257867 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TGGACTCCTG GACTCCTTGA ATTATTGACT CTGTTTTCTA CAGCGCAGAA GAAAGCAAGC 
AAGGCTCAGT ATTTCCATTA TTAGCGTCTG TCGCGTGTGG TCTGGCTTAC AATGAAGTTA
GATCCTACTG TTTTGCGTAC CATGAGCAAA GAGGACTTTC GAGTTCTCGA AGCTGTAGAA
AAGGGCATGA AAGATCATGC TTTGGTGCCA TTGCCACTTA CAACTTCCAT TGCCAATCTG
CGACACGGAG GGGCCCACAA AATAGTTTCC AGTCTATTAC GTGATAAGCT ATTGAGTCAT
GAGCGAACAA AGAACGGATA CGATGGATAC CGTGTTACGA ATGCTGGATA CGATATTTTG
GCGCTCCAGA ATCTCAAAGC TAGGAAAATT GTCGCTGCTC TCGGTCAGCG GATCGGCACC
GGAAAAGAAA GCGACGTCTA TCTTGCGGTC GATCTTTCGG GTCAACAAAT TGTGTTGAAA
TTTCACCGAT TGGGTCGAAC GTCTTTTAGA AACGTCAAGA AGAAGCGGGA CTATTTTGGA
AACGCTGCAC AACAAGCGCA TTCCTGGCTG TTTCTTAGCA CACTTTCGGC TTTGAAAGAG
TTCGCTTTTA TGAAAGCACT TTACGATGTT CATTACTCTA CACCGGTACC GATTGCACAC
AATCGACATA TTGTCGCCAT GGGCCTTGTC CGTGGCGTCC CTCTATATCA AGTTTTTCCA
AAACAGCTTT CTGCGGAGCA GGCGGCCGAC ATTTATGAGC AGGCGATTGC TTTGGCGGCT
CGGTTAGCTA AACACGGGCT CGTCCATTGT GACTTGAATG AATTTAACCT ACTGGTTGAT
TTATCCGGTA TACAGTCACT CGCTACATCA GGTGATGATC CTTACATAAG ACACTCGGGT
ATGTCTGTCG CTGGAGAAAA GTCGGTAGGT GCCCTTTCCA AGCCTGCGTG GGAACAGTCA
CTGGAGGAGG GCGATAAGAT TGCTGAAGTT TTGCCAGAAC CGATCGCCCG TCTGGATAAT
GGCGATCCGA AACCGGTTGT GACGTTAATT GACTTTCCTC AAATGATTTC TACGAAGCAT
CCGAATGCTC AGGAGTTGTA TGAGCGAGAC TTGGCATGCC TGCGAAGATT TTTTGAACTG
AAAATTCAGT GCACTATA
 
Protein sequence
MKLDPTVLRT MSKEDFRVLE AVEKGMKDHA LVPLPLTTSI ANLRHGGAHK IVSSLLRDKL 
LSHERTKNGY DGYRVTNAGY DILALQNLKA RKIVAALGQR IGTGKESDVY LAVDLSGQQI
VLKFHRLGRT SFRNVKKKRD YFGNAAQQAH SWLFLSTLSA LKEFAFMKAL YDVHYSTPVP
IAHNRHIVAM GLVRGVPLYQ VFPKQLSAEQ AADIYEQAIA LAARLAKHGL VHCDLNEFNL
LVDLSEPIAR LDNGDPKPVV TLIDFPQMIS TKHPNAQELY ERDLACLRRF FELKIQCTI