Gene PHATRDRAFT_45800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45800 
Symbol 
ID7200930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp315394 
End bp317796 
Gene Length2403 bp 
Protein Length800 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180215 
Protein GI219118897 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATGG TCTCTCAACC GCTTTCGCCA ATAACGCCGT CTGCAGCCTC GTGGTGGTCG 
CTCATGGACG ACGAACGCTT TGACGACGTA AAGTCAATCA ACCTTGATCG ACTGCCAGGG
TCACCGTCAA CCCAGCCAGC CACAAAGAGC AGCTTGTGTG CAATGGCCAT CGCTGAAAGC
TCTTCTGACG AAACGACACT TGCTAGCTTT GATGATGACG ATGACAAACG AGCTTGGAAG
TCTCCAAAAG ATGCAGACAA ACGGGTTGGT CAGCAGTCTC CAAAAAAGGT TCAGTGGCAA
CATCCAATTC AAAAAACACT TCAGTTCCAT AATAACCATG ACTGCACTCC TTCAATTCGA
GATGCAACTG ACAAGCTGCT TGATTTGGTC GAAGGTAGCG CCTGCAAGTT CAATACCAAA
AAATCTGAAA GCTCAAAACG GCTAGAAATC CTCGAAAGTA ACGTATTGGA AAACAAAAAA
TCAGCTGTCT GGGATGCTGT CGACGATAAC TCTAGAAGCG AGAGTCGCAA TCACCCAAAG
AGAATTTCCT TTGTCAATGG ATCCTTTGAT TTCAATCCAG ACCATGGGAA GGGCGTAGCA
TTTAGCTCAG ACGATCGTTT CTTTGACCTG GAAGATGAGC TGGGCAACAT TCGGCAGCAA
AGAAGTACCG GTTCTGAATT TTTCTCCAAA TACAAAGACT GGCTGAGGGG ATCAGGATCT
GTGTTTGATC GAGAAACCCC AAATTTGGAA AGGGAATACA GGCATTCGAA AAATTGCAAA
AGTCTAGCTG CAGAAATTGT AATTCGAAGA TATAAGAAGA ACGAATCCCA CCCGGCTCTT
CCGCCAGACA TAAAACGCAA ACCCCACAAG ACAGGGTCGA AGGCGAACAA AGCTGTCCCA
GAGACCGTCA ACATCTCTCC GAAAACAGAA GTAAACCTCC GTTCACTTCA TCAAGTGTTC
AGTGGTCTTA CTCCAAGTCC TGGAGGGAGA AAGTGTTTTC ACGACGAATC ATTCGCAGAT
GAAAAGATCC CGAAAACCTG CATGTTGTCA ATTTCACCTG TCCTCGCTTC CAAGACGCGC
AAGATTTTTA AAAGACTTGC ATTTGCCCAC TCTGCATCAA CATTGTCAAC ATCCATGATG
ACTGGAGAAA GTCGACAACT TCATACTCCG GAAAACAATC TTGAAAATTC TGCAAAAGCT
AGTTTTTACT TTGCAGACAA CAGAAACTCC TACGTGGCTT ACTTCCAAAG AGGCGATGAC
GCTCATAAGT GTGTTGACCT CTACGAGCAG CCATCACCTT CAATTTTCCC AACTCTCGAG
TCTGAAGTTG TCGTCCGCAT TGAAGCGTCA ACAATTTCAA AAGCAGACTG TTTTGTTCGA
AGTGGCTTAT GGTGGGGAGA GGACAGCATG TCGAAACTCA CGCTCCCTAT CGTTCCAGGT
GTTGCCTTCT GCGGAATCGT ACACCAAATT GACAAAAGGC GTCATCGAAG CGGTTTGAAG
AGAGGTGACC GAGTGATTGC TCTTGTACGG GTTGGGGCTA ACGCTCGGCA TTTGTGCGTC
CATACTGATC GAGTAATAAA AGTCGCAACA GATCTGACAG ATGTGCGATC ACTGGCTTGT
CTTCCCGAAG TGTATCTCAC GGCATTTCAG TCGCTAAATA TTATTCGCAA GAATTCTTGT
CGCTATCGTT CGTCTTCATT GACAGGCAAA TCTATTCTGG TACTCGGAGG GGAAACCTTG
CTTGGTCGAG CAGTCCTTGA GCTGGCGTGC GCTTCCAACG CAGCCACTGT GTACGCCACG
GCACCAAAGA GCCAGTTCAA TCTTATTGAA CAATGGGGGG CCGTTCCTTT GGAAGAGAAT
CCCCATCATT GGTTTTCCCT GCTCAAAGGG CGCCTAGACA TGCTTATCAG TGTCAAGGAT
TCCACAAGCG ATGAATCGGA GCTCAAATCT GAGCACGCAC AAGCACTTAA CCGAAAGGGT
ATCATTGTAG AAGTCGGAAA GCCTGAGCGA AAGGAACGAT TGATGGTTTC CCTAGACAAT
CTCGAGACTT GTGGTACTGA AATGGATCGG AAGCTTTATC ACTACAATGT GTTTGATGCC
TGGGAGAAAG ACCCGAAACA AGCTAAGCGC GACCTGTCCC ACCTCTTGAA CATGTTGCAA
AAGGGGTTCA TCCGGCCCAA GATCCTCAAA ACTGTTCCTT TAAGTAAAGT TGCCATGGTC
CACAACTTTC TTGAAAGTAA GTGCCGAGAC GGATTCATGC TTTGTGAGCC TTGGGCGAAT
TCAGCGAGGC GTGAAATTAG CACCTCGGGA ATTGGTTTTC ACGGAGAGTT GGCTAGCTTT
CCTGTTGGTG ACGGCCGAAA AAGAAAAAAA AATATAACAG AGTCACAGCG AGTCGCCATC
TGA
 
Protein sequence
MKMVSQPLSP ITPSAASWWS LMDDERFDDV KSINLDRLPG SPSTQPATKS SLCAMAIAES 
SSDETTLASF DDDDDKRAWK SPKDADKRVG QQSPKKVQWQ HPIQKTLQFH NNHDCTPSIR
DATDKLLDLV EGSACKFNTK KSESSKRLEI LESNVLENKK SAVWDAVDDN SRSESRNHPK
RISFVNGSFD FNPDHGKGVA FSSDDRFFDL EDELGNIRQQ RSTGSEFFSK YKDWLRGSGS
VFDRETPNLE REYRHSKNCK SLAAEIVIRR YKKNESHPAL PPDIKRKPHK TGSKANKAVP
ETVNISPKTE VNLRSLHQVF SGLTPSPGGR KCFHDESFAD EKIPKTCMLS ISPVLASKTR
KIFKRLAFAH SASTLSTSMM TGESRQLHTP ENNLENSAKA SFYFADNRNS YVAYFQRGDD
AHKCVDLYEQ PSPSIFPTLE SEVVVRIEAS TISKADCFVR SGLWWGEDSM SKLTLPIVPG
VAFCGIVHQI DKRRHRSGLK RGDRVIALVR VGANARHLCV HTDRVIKVAT DLTDVRSLAC
LPEVYLTAFQ SLNIIRKNSC RYRSSSLTGK SILVLGGETL LGRAVLELAC ASNAATVYAT
APKSQFNLIE QWGAVPLEEN PHHWFSLLKG RLDMLISVKD STSDESELKS EHAQALNRKG
IIVEVGKPER KERLMVSLDN LETCGTEMDR KLYHYNVFDA WEKDPKQAKR DLSHLLNMLQ
KGFIRPKILK TVPLSKVAMV HNFLESKCRD GFMLCEPWAN SARREISTSG IGFHGELASF
PVGDGRKRKK NITESQRVAI