Gene PHATRDRAFT_48695 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48695 
Symbol 
ID7194680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011686 
Strand
Start bp633181 
End bp636099 
Gene Length2919 bp 
Protein Length972 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183260 
Protein GI219126009 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTTGT TTTTGAAAGC AAAAAAGAGA AGTAGTAGCG ATGGCCCGCG CTCAGGCGCC 
CTAAAGTGCT CTGCCAAGGT GGACCTTTTT AAGCCCACGA AAAAGCTAAC ACGAGCTCCT
GAGGATAGCG ACTACAACGT TGAAAGTGCT AAATCCATGG TCGCCTCGGC CGGAATAGCG
TCTGGACAAG AACAGTGCTC TATAGACAGA ACCGTCGGGA TCCCAACCTC CACCCTTGCT
CTCTCAGCGG AAAAAGGCAA CCACCAATGG ATGTTTTTCA ATGTAACGCA GGGTTCTGGT
GTCTACGAGA GCCTCCAGCA GTTTTCTCAC GAATTGGACA AGATTCCAGA TTTTGAGAAA
AGCGCGTACA TAGAGGCTTG CCAAACCATC CCAATGATCT TGTATCGAGA GTCGAACCCA
GAGATTTTCT TGCGCTTCCG TTGTGAAAAC GTCGCCGCCG CCGCACTCCT CTTTGCACAG
CACTGGCAGG TCCGGAAAAA TGTCTTCGGG GAGCGAGCCT TTTTACCCAT GAACTCGACA
GGAGAGGGAA CCTTATCACC GACCGATTTG TTGCTGTACC GCACGCACTA TTTGGTCGCT
TTGCCCAACA ATCAAGATGG TTGTTCCGTG CAGTTCTTCG AACCTGGAGT CTTAGCCTCC
TTTACACCAG CGCGTCTCCG CTGCGTTTTT TATACCCAAT TTCTGGCTAT GCACAACACT
AACAATGCCA AAGATGGCTA CACAATTCTG ACGTACATGG ATGAAAGGGG CTTTGATGGC
ATCTTTGGTC GTGAGACACC AGCCAGCCTC TTAAAGGCGA TGCCAGTTCG TTTGGGGGCC
CTGCTCTTGT TCCACACCTT AACTGGTGAC GAGATGAATG TTTTTGAGCG CTGCGTTCCA
CAACTCACTA GATTGTACAG CTGTCCTGTC AAAACATTTT CCTTGGCAAC TTGTACAAAA
GAGGAGATTG GGGACTTCTT CCAGATCCGG GCTCTCAAGC CCGAAGCTCT TCCTGTAAAA
GTTGGTGGAG CCCTTTCTCA GGCAGATGTT ATCAATCTGG ATAAAGCGAG AACTCAAATG
GAGTGGCAAG CCCACGCTGA ACTTGCGCAC TCTGACTTGC TCAACCTTGC CGATACTGAC
AAGGTTTTCC AATCCAAAAC AAAGGGCCAT GACTTATTGA ATGTGCCCCA AAGACCCAAA
AGTTGGATCT TTCAGACGCA ATTAGACCAC TGTAATTCCA CAGCGCTCGA TGCCTTTGCG
AAGGCGCTTC GCGAGATTCC TCAAAGCGAA AAATCTTCTT TTTTGGAAGC ACTTGAGTCT
GCTCCAGACC TGCTCCACCG AGAAACGCAC CCAAGGATAT TTCTTCAGTT TGAAGCAAAC
GATCCTTTAG CAGCTGCAAG ACGATTTGTT TCTCACTGGC AAACACGCCA AAGACTGTTT
GGAGATCGAG CGTTTTTGCC CATGAATACA ACGGGGGACG GAGCTTTATC TAGTGATGAT
ATCGAGCTTC TATCCTCCAC CTATATCACA CTTCTGGAGA AAGACAACGT CGGCAGAGAT
GTGATTTTTT ATACTCCGGA ATTGGCTGGT GCTGAGTCAA CAGAGCGCCT GAAATGCGGA
TTTTATACCC TGACTCGAAT TATGCAAAAT GAAAAAACGG TTGAGAACGG CGCCGTTGTG
ATTGCTTTGC TAGGCCAAAC AGAAATGGGA CAGGCGGCAG GGAGGTCTGG AGTTGCTGAA
ATGTTACGAC GGTCAATACC AATACGTTTC AAGAGAGTCC ATATAATTCA TTGCTTGAGT
GGTTTTGAGG AGCAAGTGTT TGAAAAGAGA GTCCTACCAT CTGTCCTTGG TTTGTTTGGC
TGCCAAGTGC TTGCGCACGA TGCAAATGCG CCATCGGAAG TTTTGGCTAC GCTTGTTGCT
TGTGGCCTAA AGGAAGAACA CCTGCCTGAT TTGATCGGCG GAAAGATGAG TTTTGCAGAT
ATTGTGGCTT TGAACGAAAA TAGAAGGGGC CAGGAAAGAG CAGATATTAC AGGCCTTTTA
TCCGGTGTTG GAGAGCCTGG TGAAGTACAT GTTGCAGACG ACCTATCGCT AAACGGTGGG
TTTGACCATG AGAACGGCGA TCCGCACTTT CGCGATGAAA TGATTGTCAA AAACCCAAAG
GAAACGCAAG AAGTGAGGGA TTGGGAGCAA AGAAACGGTG CAATTGTTTC ATCGTCTGGA
GAAGTCGAAG TCAACGAAGA AAATCTCCAC TTTCTTAAAG GAGAAAACCG TAAACGAATT
CTCAATGTAA TCGCATCACG GCGAAAGCGG CTACGACGAA AAGAGCGCTA TGAATCTCTA
GAAAAATTTT GCAATGACCT TTGCGCACGT AAGGCAGCTA TGCAAGAAGC AAATGCTCGA
CTCGAGGAAC TTCTGCGAAA GGCCAGCGGC GTGGTGGCAG CATACAAAAT GTCAATGCGC
ACGAGTATGA GCAGCAGCTT TCCCCTCGAC ACTTATACCT TGTCAACAAT GGGGTATGGT
GCACATGCTT TACATGGATC TGTAATGTTT TCTCCACATG GTATGCCTAT TCCTGTGACT
AACGGTCTGA TCGAAGAACA GCTGTGGAGG CATTTCATGG ATGGACAGCT TGTAACACAG
GCAGCGGCGA ACGAAGCAAA TCACAATGCT TCCGTTGCAC AACGTCATAT GCTGCCGATG
TTCCTGAGAC CGTCCGACAG GGGGGCCCGA CCGTACGGAT CATTGCCGGT CGGTCCTAAT
CTATGTGGCG ATCTGCGAGA TATCGCGGGC AGGGCTCTGA CTGCTAATTC TTTGCGAGAT
GGGCTAACCG CTTGGGAAAT TGGCGGCGCG CTTGATGCCA CACACAAAGA CCGTTTGTAT
GCGGAGCTCG ATCGTAAAGT GCTACGAAAA CCCGATTGA
 
Protein sequence
MDLFLKAKKR SSSDGPRSGA LKCSAKVDLF KPTKKLTRAP EDSDYNVESA KSMVASAGIA 
SGQEQCSIDR TVGIPTSTLA LSAEKGNHQW MFFNVTQGSG VYESLQQFSH ELDKIPDFEK
SAYIEACQTI PMILYRESNP EIFLRFRCEN VAAAALLFAQ HWQVRKNVFG ERAFLPMNST
GEGTLSPTDL LLYRTHYLVA LPNNQDGCSV QFFEPGVLAS FTPARLRCVF YTQFLAMHNT
NNAKDGYTIL TYMDERGFDG IFGRETPASL LKAMPVRLGA LLLFHTLTGD EMNVFERCVP
QLTRLYSCPV KTFSLATCTK EEIGDFFQIR ALKPEALPVK VGGALSQADV INLDKARTQM
EWQAHAELAH SDLLNLADTD KVFQSKTKGH DLLNVPQRPK SWIFQTQLDH CNSTALDAFA
KALREIPQSE KSSFLEALES APDLLHRETH PRIFLQFEAN DPLAAARRFV SHWQTRQRLF
GDRAFLPMNT TGDGALSSDD IELLSSTYIT LLEKDNVGRD VIFYTPELAG AESTERLKCG
FYTLTRIMQN EKTVENGAVV IALLGQTEMG QAAGRSGVAE MLRRSIPIRF KRVHIIHCLS
GFEEQVFEKR VLPSVLGLFG CQVLAHDANA PSEVLATLVA CGLKEEHLPD LIGGKMSFAD
IVALNENRRG QERADITGLL SGVGEPGEVH VADDLSLNGG FDHENGDPHF RDEMIVKNPK
ETQEVRDWEQ RNGAIVSSSG EVEVNEENLH FLKGENRKRI LNVIASRRKR LRRKERYESL
EKFCNDLCAR KAAMQEANAR LEELLRKASG VVAAYKMSMR TSMSSSFPLD TYTLSTMGYG
AHALHGSVMF SPHGMPIPVT NGLIEEQLWR HFMDGQLVTQ AAANEANHNA SVAQRHMLPM
FLRPSDRGAR PYGSLPVGPN LCGDLRDIAG RALTANSLRD GLTAWEIGGA LDATHKDRLY
AELDRKVLRK PD