Gene PHATRDRAFT_16499 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_16499 
Symbol 
ID7198760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011694 
Strand
Start bp349889 
End bp351025 
Gene Length1137 bp 
Protein Length378 aa 
Translation table 
GC content58% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184867 
Protein GI219129378 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.570154 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGACA GTGGACAAAC CTACACGTAT CTCAATTATC CGTTGGAGAA CGGACAAGTC 
CTCGCCGAAG CCCAACTCCG CTACCAAACC TACGGACAAC TCAACGAAAC ACGGGATAAC
GTCATGGTTG TTTGCCACGC CTTGACCGGC AACGCCTCGC TACACGCCTG GTGGGGCGAC
ATGCTCGGAC CCGGGAAAGT CTTCGACACG GACAAGTATC TCGTGGTCTG TTGCAACATT
CTTGGTAGTT GCTACGGCAG TACGTCACCC GTATCGATCC GCCCCGGAAC GGACCAACCC
TACGGACTCG ACTTTCCCGA CGTCAGTGTC AAGGACACGG TACGGTTGCA GCTCTGCATG
CTCCGCGACG AACTCAAAGT CGCTTCCGTC CACGCCGTTG TGGGCGGTTC CTTTGGCGGC
ATGCAGGCCG TCGAATTCGC CGTCCAGGCC GGATCCACCC GGGCCGCCTT TACCGACGCG
CACGGACAAC CCTTTTGCAA ACACGTGGTG CCCATTGCCT GCGGCGCCCA GCATTCGGCC
TGGCAAATCG CCATTTCTGA AGTCCAACGC CAGGCTATCT ACCAGGACCC GGCCTGGCCG
ACGGATCCTT TCCGCGCCAC GCACGGATTG CGTGTCGCCC GACAGTTGGG TATGATTTCC
TACCGTACGC CGCAAGGGTA CGGCAGCAAG TTTGGCCGGG AACGGCAACG TGGTCGGGGC
GACGATGACA CGGACGGCCC CGCCTACGGT AGTCACGCGC GTTGGCAAGT TAAATCCTAT
TTGGAATATC AAGGAGTCAA GTTCCTCCAA CGCTTCGATC CCGTCACGTA CGTCAAACTC
ACGGAACAGA TGGACTCGCA CGACGTGACA CGGCAACCTG CCGGTAGTTG TCCCGGAACG
GTCAGTAAGG AACAAGTGCT GGGCCATGTG ACGATTCCCG TGCTCGTACT AGGCATTGAC
AGTGACGTGC TGTATCCGTT GGCGGAACAA CAGGAACTGG CCCGACTCTT GCCCAACGCC
ACGTTGGAAG TGATTCATTC GGACGACGGA CACGACGGGT TTTTGTTGGA ACAGGAGCAA
GTGGCGGCCC ACATTCAACA CTTTTTGACC CTCCACGAAC GACCCACTAC TATCTAA
 
Protein sequence
MDDSGQTYTY LNYPLENGQV LAEAQLRYQT YGQLNETRDN VMVVCHALTG NASLHAWWGD 
MLGPGKVFDT DKYLVVCCNI LGSCYGSTSP VSIRPGTDQP YGLDFPDVSV KDTVRLQLCM
LRDELKVASV HAVVGGSFGG MQAVEFAVQA GSTRAAFTDA HGQPFCKHVV PIACGAQHSA
WQIAISEVQR QAIYQDPAWP TDPFRATHGL RVARQLGMIS YRTPQGYGSK FGRERQRGRG
DDDTDGPAYG SHARWQVKSY LEYQGVKFLQ RFDPVTYVKL TEQMDSHDVT RQPAGSCPGT
VSKEQVLGHV TIPVLVLGID SDVLYPLAEQ QELARLLPNA TLEVIHSDDG HDGFLLEQEQ
VAAHIQHFLT LHERPTTI