Gene PHATRDRAFT_4339 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_4339 
Symbol 
ID7203155 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp350069 
End bp351258 
Gene Length1190 bp 
Protein Length222 aa 
Translation table 
GC content44% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182204 
Protein GI219123797 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00625498 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GGCATTGTAT GCAATGACCG TGGGAGTATT ATTATACTCG ATCTCAAAGC TGAGAGACTC 
ACTGGTCGAA TACCTGAAGA GATCGGTCTT CTGAAGGAGT TGACTGTTCT TGATTTGTCA
AGGAACTTTT TGGAAGGCCC TATCCCAGGA AATGCAGTAG GGAAGCTGAC CAAATTAGGT
AAATCGATTG GATGCGCCTT TTGTTTTGAT TGAAGAATAG AGCACTAACC CTTTGATGAA
ATGGTGTGAG GAAGAAAAAC TGTCGCTCGA GTACAATGAG CTAGGTTCCT CCTTACCCTC
CGAAATTGGT CACTTGACGG AGCTTACTTC CTTAGCTGTC AACTTTAACT ATTTGACTGG
GACTCTTCCT TCGGAATATA TTCGCCTACC TAAACTTGCA GCGTTTAACT CCAACGGTAA
CATCGGACTC ACTGGCCCCT TCCAAGAACA TTTGATAGCA TGGACAAATT TGCAAGAGTT
TTTCATACCT GGAACTCGCT TTACAGGATC CATTCCTGAT GAAATTGGCG AGCTCAGTGA
CTTGCGGAGA CTGAGTTTCG ATGACACAAT TTTGGGGGGT ACCGTCCCCA GTTCGCTCAG
AAAATTGACA AATCTTGAGA CGCTGTTTCT AGGAAGTCTT TCGATGTCTA TGAACATGAC
TTCCATCATC GGCAATTTGT CATCTTTAAG TAAGTCGACA GTAAATGCTT CTTACAAAGA
CTAGCTGACA TCTGATTGTA TTTTCCTATC TTATCTTATA TTCACAGTGG AACTCTCAAT
TCGTGACTTA ACAATACAAG GAAACGTTCC AACAAGAATT GGTGCTCTCT CCAATCTTGA
GTTTGCGCAG ATCACTGGTG CTGCCCTGAC AGGAACGCTG CCGACCGAAG TGGGCCTACT
CACGAAGCTT CGCGTTATGA TCTTGGTATG CAGCTACCTC TGTTGTGTTG AGGACCCAAT
GCCAAGGTAA ATAGAGGGCA CTCACAATGT TTGTGCGATA CACATAGGAT GACACAACAA
TGACTGGTTC TTTGCCGTCC GAGATTGGAT ATCTGACAAA GCTCGGTAAG TCGTTTGTAG
ATTTTTGGCT TTGTATGCAA CCATGATGAT TTACTCATCA ATGGTGTTTC AATCCAGAAA
CACTCCAGAT AACTCGAAAT AACTTTAGCG GAATCATTCC GCAGTCGTAC
 
Protein sequence
GIVCNDRGSI IILDLKAERL TGRIPEEIGL LKELTVLDLS RNFLEGPIPG NAVGKLTKLE 
KLSLEYNELG SSLPSEIGHL TELTSLAVNF NYLTGTLPSE YIRLPKLAAF NSNGNIGLTG
PFQEHLIAWT NLQEFFIPGT RFTGSIPDEI GELSDLRRLS FDDTILGGTV PSSLRKLTNL
ETLFLGSLSM SMNMTSIIGN LSSLKTLQIT RNNFSGIIPQ SY