Gene PHATRDRAFT_45973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45973 
Symbol 
ID7200845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp818866 
End bp820346 
Gene Length1481 bp 
Protein Length475 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180323 
Protein GI219119113 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGGGGA AAAAGCGCCG CTCAACGAAA ACGCTCGAAT CTTTGTCGAC GACTCCATCC 
GATGGTGTCA ACGGAGAGAA ATCTGAATGT GTCGATGAAT CGTATTCCAG TCTTTTGCAT
CCCCGACCCC TCGCAACAGT TTCATGGAGT ACATCTTTTT GGGTTGCTCT CGTTTTTAGC
TATCTAATCA TTCTGTTCCT CATCTCTAAC TGGGCGAGCT CTCAGTTGCC AAATCCCTCC
GTTTGTTCCG TGGCTGCTCC CGCCGTAGAT CCGTCCCCGA AATCCCACCA TAAGTGGCTG
AAGGCCTGGA AAGCTTCTCC CGTGGTCAAG GGTTGGAAAG GAGCCCGCAG CTTTACTCGC
TTTGTATTAC GACGCAAGTC CAAAACAACT CCCTTGTCTG CTCTTAACAG TAGATACAAT
TCGATCGCCA TTCCACAGGC TTCCACGTCG CCTCCAAAAA TATCGAGCTT GGAACAGCAG
CTTTTGACCG AGCTGGCTAC TCGTGTGCAC CAGAAATGTC CGGATGTGCT TGATCGAGCC
AAAAAAGTGC CCTGGGGCGG GCATGGTGAT GCCTCTTGGT GGTTTCCGGT AGGAAATCAT
ACAAAACCTA AATCATTGAC ACCGCTCGGA CAAAAAGACG GGGGCTTTCT TTTATACGGC
CACTACAGGA TCTTGTCCAA GTCCCTGCGA AATCCTCGCG ATCTTTCCTT CACCCACTTT
CCTTTCCGTT TATGCAAAAC GGGGTGTCCC GCCGAGCAAG GCGTACTGCA CACTTTGCAA
TGGCGTGAAA CGTACCGACC TTGGATGATG TCACCGTCAG GCATTTCGGA GAACCGTATT
GGATGGGTAT ACACGAGAGG ATTCGCCAAG GCCTCGCCCC AGAATTCTCG CTATGGTCGT
CATGCCATGA TTTGGGTCCG TCCCGGGATG CACCAAACTG TTGATGGCAT GGCTTATTTT
CGTGTAATTC TCAACACAGT TGACGCAGCC ATTGCCGCCG CGCTCCGCGA TTCCCACGGG
CGTGTCGGCA AGTTTAATGC GGTCATCGAT GCCACCAATT ACGAGTGGTC AAAAATGCCA
AACATTGCAC ACATTAAACA GCACGTCACT ATGCTCCAGG ATCATTACCC GGACCGGCTT
GGAGTGCTAC TTTTGATCAA TCTCTCGCGA TCGGCCGAGT TTTTCGTCAA TATTGTCAAA
AATTTATTGA CCAAAGAAGT CAGAGAAAAG ATCATGGTGT TGCCGCATAA TAAAGAAAAG
GCTCTTGCTC AGTTGGGCGC GGTAGTTGAA AATGAATACA TACCAGACTG GCTAGGCGGG
CCAGACAGAT TCCGATTTGA TGGTTTACAT TACTATGCAA AACAGCAACG CATGAGTGAA
GTTGATAGCC GCTCCTTCCT TGTCGCTATG CCCTACCACG CAAATTAAGT ATGAAGCACA
AAAACCATGT AAAATATGAA AACAAAGCTC TATTATATTA C
 
Protein sequence
MVGKKRRSTK TLESLSTTPS DGVNGEKSEC VDESYSSLLH PRPLATVSWS TSFWVALVFS 
YLIILFLISN WASSQLPNPS VCSVAAPAVD PSPKSHHKWL KAWKASPVVK GWKGARSFTR
FVLRRKSKTT PLSALNSRYN SIAIPQASTS PPKISSLEQQ LLTELATRVH QKCPDVLDRA
KKVPWGGHGD ASWWFPVGNH TKPKSLTPLG QKDGGFLLYG HYRILSKSLR NPRDLSFTHF
PFRLCKTGCP AEQGVLHTLQ WRETYRPWMM SPSGISENRI GWVYTRGFAK ASPQNSRYGR
HAMIWVRPGM HQTVDGMAYF RVILNTVDAA IAAALRDSHG RVGKFNAVID ATNYEWSKMP
NIAHIKQHVT MLQDHYPDRL GVLLLINLSR SAEFFVNIVK NLLTKEVREK IMVLPHNKEK
ALAQLGAVVE NEYIPDWLGG PDRFRFDGLH YYAKQQRMSE VDSRSFLVAM PYHAN