Gene PHATRDRAFT_49168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49168 
Symbol 
ID7195623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011689 
Strand
Start bp116811 
End bp118604 
Gene Length1794 bp 
Protein Length597 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183815 
Protein GI219127173 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGACA AAAACTGCCA GGCCTTAACC TTGGACTTTA TTGCCGTAGA GGACGACGCG 
GAAATGATCG CCGTGGAGGA TGAGATTCGC CGGGACTTGA AAGAAATTGG CGTTACAGTC
AATACCCGTT TTCTTAGCCG AGAGGATTAC ATTGAAGCTG AGTTGAATGG TGACTATAAC
ATGCTCTTCA CTCGTACCTG GGGTGCCCCG TACGATCCTC ACAGCTATTT CAATTCGTGG
GCCGTTCCAA GCCACGTCGA GTACACGGCC ATTGACACGT TAGAAGCACC TCTTAGTCGC
GAGCTCCTTT TGAAAAAGAT TGAAAATGTG CAGAAGGAAC TAGATGAGAT GCAGATTCAG
GCACAGTGGC GAGAGATCTT GAACGATGTC CATCAGCAGG CCATCTTTTT GCCGCTCTGG
GGCACCCGAA TTCCATACGT GATCAACCGT CGTCTTTCAG GGTTTACGCC AAGTGATCAG
GCCTTTACGT ATCCATTAAG TAGCATTCGT ATCTCAAGCG GATCTGCCAA CATCACCATC
GCGCCGGGTT CCGGCGGCTC GCTCTTCACG TCGGTCGGAC CCTTGAATCC TCACCAGTAC
TTTCCCAATC AGATTTTCGC CAGCGATTGG ATTTACGAAG GCCTCGTGAA TTACGGACAA
GATGGTGAGA TTGTTCCATC GCTAGCATCG GAGTGGACTA CGGAACGCAC TGCCGAAGGA
CAGCGCGTTA TCTTCCAGCT TCGTGAAGGC GTCAAATTTC ACGATGGCAG TGATTGGAAC
TGCACTGTCG CCAAGCTCAA CTTTGACCAC ATTTTTTCCG ACACGGTCCG CGAACGTCAT
TCCTCATTTG GAGCTACAGC GAATCTCAAG AGCTGGACGT GCAATCAGAA TGGGGAGTTT
GTTTTGGAAA CGTCCGCACC GTTTTACCCT CTGCTCCAAG AGCTTACGTA TAGTCGCCCG
TTTGTTTTTG CGTCTGCTAG TTCCTTTGCT GCAGGCATTG ACTCTGATCC AGAGACTCAA
AACTCATGTG AATCCGGAGA TTTTGGGTCC AAATGGGACT ATCTTGAGGA GTTTGTTACC
TGCCTCGGTC TCTCGGCTCC CATTGGTACG GGACCGTTCA AATTTGCGGA TCGTGAATAC
CTCCCGGGAA CGAACGAGAC AATGGATGCC AAGGTTACGT TTGCGCGCCA CGAAGACTAT
TGGGGTGGCT TGCCCGCAAT CGAATTCCTT GAAATAATCC ACTTTGAGGA TACGGATGCG
GTCGAAGCCG CGTTATTTGA CGGTCAGCTG GATATGGTTT TGGGCTCCGG TCCCCTTTCT
GCCAAACAAG TTCAGAATAT CAAGTTTGTA CATAGCGACA AGTTTGATGT CCGCCACAGT
GCAGTTTTAC AGAATGCACT GGTTGTCTTA AACTCTGGTA AGGCACCAAC GGATGACATC
CAAACACGCC AAGCCATTAT TCACGCCGTC AACAAAGCAA TCTTTATTGA AGATGAGTTT
GCGGGCTTGG AACAAGCCGT TTCGCAGCTT TTGCCGCTCA CCGCACCGTA CAGTAACGTT
GATCTCAATC CAAAGTGGAA TTACGATTTG GAAAAAGCCA GATTTCTCAA CTGCCCTGCA
GATATGAATG GCAGCTCGGA GGACAGCTTG TCGGGTGGTG CAATTGGGGG TATTGTTGCG
GCAATTTTGG TGGTACTGGC AATGGCTGTC TTTTTGGGAC GTTTGATTCT ACGCGAAAAA
CAGGGGAAGC CAATGTTTGC CCCAGAAAAG ATACGCAAGG GCGAACAAGC TTGA
 
Protein sequence
MADKNCQALT LDFIAVEDDA EMIAVEDEIR RDLKEIGVTV NTRFLSREDY IEAELNGDYN 
MLFTRTWGAP YDPHSYFNSW AVPSHVEYTA IDTLEAPLSR ELLLKKIENV QKELDEMQIQ
AQWREILNDV HQQAIFLPLW GTRIPYVINR RLSGFTPSDQ AFTYPLSSIR ISSGSANITI
APGSGGSLFT SVGPLNPHQY FPNQIFASDW IYEGLVNYGQ DGEIVPSLAS EWTTERTAEG
QRVIFQLREG VKFHDGSDWN CTVAKLNFDH IFSDTVRERH SSFGATANLK SWTCNQNGEF
VLETSAPFYP LLQELTYSRP FVFASASSFA AGIDSDPETQ NSCESGDFGS KWDYLEEFVT
CLGLSAPIGT GPFKFADREY LPGTNETMDA KVTFARHEDY WGGLPAIEFL EIIHFEDTDA
VEAALFDGQL DMVLGSGPLS AKQVQNIKFV HSDKFDVRHS AVLQNALVVL NSGKAPTDDI
QTRQAIIHAV NKAIFIEDEF AGLEQAVSQL LPLTAPYSNV DLNPKWNYDL EKARFLNCPA
DMNGSSEDSL SGGAIGGIVA AILVVLAMAV FLGRLILREK QGKPMFAPEK IRKGEQA