Gene PHATRDRAFT_47759 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47759 
Symbol 
ID7202923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp750158 
End bp752634 
Gene Length2477 bp 
Protein Length790 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181970 
Protein GI219123310 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.308372 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TGAGGATCCG TGTTTACAGT CTGCAGCATA GAGTCATTCA CCGTCAATCG TATAAGCAAA 
GTGTCTGCTT AATTGAAACA GTATGAATCT TCCAGGAAAT GCAGACTTCT TGGAAAAGGC
TTCGCTCATG GAGCCGTTGC TTTCCATCAC TGAAAAGGCA ATAAATGTAT CTTCGATTCC
CGGAAACGAC TCGTTTAACG CACCGACGGC CGCTCTTCGA GGTGAGCAGT TCGAGATCGC
GTTGCGACGG CACGTTGTTA AAGCAGCATC AAAAGTTACA GAAAAAGGCG TCAGTGAGCT
TGACGAATTG GGAGTGTTTT GGGATCTTTG CTTAAAAGTG TGTCTACACA TTATGATAGT
CAATGACAAA GCGCTTGTCG ATTCCTCGGC TGCCGATCCC CGGTACAAGG ACATGTCGGT
CCGTAAAGTA CCCTTTGTAC TGCTAGAAGA CATTTTGGAT GTACTGTCTT CACCACTGGC
TTTGCAGTTT TGGAGTAGTC GCGTACGGCC GTCGTACGAT TTCCTCTTTG CTCCAACCCT
GTGGAGTCCA ATACGTGGTG ATAGTGCTGC GTCTCCATCG CATCCCTGTT GGTTGCCCTT
TCTCAAGATC AGCCAGAAAT TTTTGCGTCG ATTGGTTCCA GAAGCTGCCG CACCTATTCT
AGTGCAGCTT TCCACTGTCT ATCCGCTTTC GGAAAAGTCG GCTACTAAGG TTTGGGGAAG
TCACGGAGAA AGCACCACCG AATACGACTC GTTGGAAGAC TTTCACAAAG AAGAGCAGTC
TATAACTCTC GACACTACAA CCCCAAATGG TTCATCTTCA TCAGTATACG ACTACTCCTT
TTACGAATCT TTTTGGAGAC TACAAGAAGA CTTGTCCAAT CCCAACTCCA TCAAGGTGGC
CGGATTTTTA TCTCGTGTTC GCTCCATGAT GACTGCCTTT GAAAACCAAA CCTGCGATAC
AAATACCTCT CAAGACTCAC CGATTAACCT GTACGGTCAC TATTTAACGA GTTCCCGAGT
TTTAGCGATT CAACTATTGG ATGCGTCGTT TCAAATTCAT GTATTGACGC AGTTTCTGAT
TGTGGCAAAA CACTTGATGG CTCAAGTTCC AGTTTTGGAA TCTCAATTGG CAGATCACGT
TACTCGCGCC AAGAATCGAC TACAGTTGGA ATTGGGAGAC GCTGGGCGCC ATCAATTGGA
ATTATTGCAC CACCTGTGGC AAGGGTCAGA GTCGTTGTGG CGAGATTGGA AAAGGAAGAA
ATGTCCGGCC GATATCGACG CACCCAAGCT TGCACTTTCT GCTGCTGGAG GGTCTCCGCC
ACGTAAGCGA CTCCTTGGCG CGCTCGGTAG TGGCAATGGT GAAAGCAATG ACGCAGACGA
GAGGAATACG GATTACTCGC TGGCACAAGT ACACGATGAG CTACCCGCTC TATCGAAACG
TATGAAACGG TTGGCGCCGG ATCTATACAC ACATTTGCAA GACTACGTGG AGGCCTTAGA
TCCCGATGCA GGAATCGAAG CCGAGTATCA TCCTAAAAAC AACGCGTTGT TCGGCTGGCG
GGCGCTGCGG TTACTTTCCG TCGACCATCT AGGAGAATTC AATCTACTGG ATCGCAACGG
TGATTGTGAA GGTCTTGTGC GAACAATTTA CCAACGCAAA GGAATTGTTA TACCTGGCAA
AATACCTGAA AGTGCGCTAG ATGAGCTCGA AGAGTTCGAG GCTGGAGATC CCGGTGTCGG
AAACGATGAC AGTGAAGGGA ATACAGATAC GTTAAAGGAA GTCAGGGGCG TAATGGAAGA
AGATGCTGAA AACCATCTCG ATAGCGGAGA TGACCATGTC GAAGAAAAAG TTGGAGTGCC
TATAAAGGTT GAGCAACCCT CCATTGATAC CGAGGACCGT GTTGACACCA GTCACAAGGT
CGAAACGAAT TTGAAACAAT CTTCATCTTC GCCTCATAAA CAAACTGAAA AGCTTGTTCT
GGAAAGCACT TTGCCTCGCG AGACTGGTGG GCAAATTGCA AAATCGGGAG ATGGCAGGAT
GACCCAAAGT TTCGTGCAAG AAGGTTCCAA AGAAGGATCC CGCAAACGGA GTCGATCGCC
GGTCCGTTCC GAAGACGATC GGAGTCGTTT GAATCGGGAG GAGTCCCGTT CCAGAGTCCG
AGATCACGGC CAGCGCGGTA TCGTAGATCT CGAGGGCCGA GGTGGAGGGA AAGGAAACGG
CGGTTCTAGT AGAGGCCGTA ATGGACCGGG ACATCATGGT ATGTATAGAC AAGATCAAGG
AGGTCGTGGT GCTGGAAACA ATCGACCGCT GCTGCCCCGG GAATCCGCTC CCAGGGACGG
GCCTCCCCCA CTGCGTGATG GTAGGGCTCG CCGTGGTGGT GACAATAGGC ATGATGATTG
GCGTGGCGAC GAACGCCAGG GAGGGGGAAG AGGCAACCAT AGAGGCCGGC GGTGACAAAA
CGTGGCGTTG ACACGAT
 
Protein sequence
MNLPGNADFL EKASLMEPLL SITEKAINVS SIPGNDSFNA PTAALRGEQF EIALRRHVVK 
AASKVTEKGV SELDELGVFW DLCLKVCLHI MIVNDKALVD SSAADPRYKD MSVRKVPFVL
LEDILDVLSS PLALQFWSSR VRPSYDFLFA PTLWSPIRGD SAASPSHPCW LPFLKISQKF
LRRLVPEAAA PILVQLSTVY PLSEKSATKV WGSHGESTTE YDSLEDFHKE EQSITLDTTT
PNGSSSSVYD YSFYESFWRL QEDLSNPNSI KVAGFLSRVR SMMTAFENQT CDTNTSQDSP
INLYGHYLTS SRVLAIQLLD ASFQIHVLTQ FLIVAKHLMA QVPVLESQLA DHVTRAKNRL
QLELGDAGRH QLELLHHLWQ GSESLWRDWK RKKCPADIDA PKLALSAAGG SPPRKRLLGA
LGSGNGESND ADERNTDYSL AQVHDELPAL SKRMKRLAPD LYTHLQDYVE ALDPDAGIEA
EYHPKNNALF GWRALRLLSV DHLGEFNLLD RNGDCEGLVR TIYQRKGIVI PGKIPESALD
ELEEFEAGDP GVGNDDSEGN TDTLKEVRGV MEEDAENHLD SGDDHVEEKV GVPIKVEQPS
IDTEDRVDTS HKVETNLKQS SSSPHKQTEK LVLESTLPRE TGGQIAKSGD GRMTQSFVQE
GSKEGSRKRS RSPVRSEDDR SRLNREESRS RVRDHGQRGI VDLEGRGGGK GNGGSSRGRN
GPGHHGMYRQ DQGGRGAGNN RPLLPRESAP RDGPPPLRDG RARRGGDNRH DDWRGDERQG
GGRGNHRGRR