Gene PHATRDRAFT_29758 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_29758 
Symbol 
ID7194860 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011686 
Strand
Start bp545623 
End bp548698 
Gene Length3076 bp 
Protein Length895 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183121 
Protein GI219125718 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GCGCTCGCTA GTAGTATTGT GAGAGTCAAC TGGCCGTACG TATCGTGTCT GTGCATTATA 
GTTATCACCA CTACCAGAAC TACCACTGTT ACCGTTAACA TGGGGTCCGG CAACCACGAC
GACAAGACGG CGGGGCGGGT CCTCTTGCCC GCACACGTTG TACCCACTCG GTAAGTGCCT
GGCAGCGTCT CGGGGGGGGG GAGGCGTTGG GGTCGCGACG CTATTCGGTC CAACCTCGTG
GCGGCGCGTC TACACACACA CACACACGTA TACGTATATA CATACGCTCT CCCTCACACA
CACACACACT CGCTGTAATC AGTTACGATT TGGCGCTGAC CCCAAATATA GAAGCCTTTA
CCTTTACGGG TACGGTGGAC ATTACCTTCC GGATTGACGG TAGTTTGTTG AACGAGACCA
ATAACAAGTC GATTACCTTG CACGCCAAGG AACTCTTGTT CTCAACCGCG TCTTACCACT
TGTTGGATGG CCCCGACGCG ACGCCCGTGA CGGCGGAACA AATGAACGTC AATCTCAAGG
CTACCACGGT GGAGTTTCTC TTTCCCGAGC CCATTCCGCC CGACGCCTCC ACACTCAAAC
TCACCGTTGC CTACACGGGA TTCCTCAACG ACCAAATGGC CGGCTTTTAC CGGTCAACCT
ACACCGACAT ACAGGGACAA TCCAAAATTA TGGTGTCCAC GCAGTTCGAA GCCCTCGATG
CCCGTCGCTG CTTTCCCTGT GTCGACGAAC CCTCGCGCAA GGCCGTGTTC GGGGTTACCC
TTACCGTACC CGCGCATTTG ACCTGTCTCT CCAACATGCC CGAAGCCAAG GTTACCGCCA
TCAACGCACA GCAGAAGTGT GTCACCTTTA TGGACTCGGT CGTCATGTCC ACCTACCTCC
TCGCCTTTGT CGTGGGCGAA TTCGATTTCC TCCAGACCCG CTCCGCGCAC GGTGTTCTCA
TCAAAGTCTA CACGCCGCCG GGGAAGGCCG CGGCGGGACA ATTCGCCCTC GACGCCGCCG
CCCGCGCCTT GGACGCCTAC AACGACTTTT TCAATCTACC CTACCCTCTG CCCAAACTAG
ACATGGTCGC CATTCCCGAA TTCGCCGCCG GTGCCATGGA AAACTGGGGA CTCGTCACCT
ACCGCGAAGT CGATTTGCTC ATTGACCCCG TCAAGGCCAG TACCATGCAG AAACAACGCG
TCGCCGTGGT TGTCACGCAC GAACTCGCCC ACCAGTGGTT CGGAAACCTC GTCACCATGG
CCTGGTGGGA CGATTTGTGG CTCAACGAAG GATTCGCATC GTGGGCCGAA AACTGGGCCA
CCAACGTACT GTATCCGGAA TATCGAATGT GGGATCAGTT CACCACGGGG CATTTGAGTA
CGGCATTGCG GTTGGATGCT CTGCAAAGTT CACACCCCAT TCAGGTACCC ATTGCACACG
CCGAAGAAGT GGAACAAGTC TTTGACGCGA TTTCCTACTG CAAGGGGGGC AGTGTGGTGC
GCATGATCAA GGCCGTAATT GGCTTGTCTG CCTTCCAGGA CGGACTGGGT GCCTACATGA
AAAAACACGC CTACGGAAAC ACGGAAACGT ACGATTTGTG GAATGCCTGG GAGGCCTCCT
CGGGCATGCC CATTGGTGAA ATGATGAAGT CCTGGACGGA GCAAATGGGA TTTCCGTTGG
TGCGTGTGCG GAAGGAAGAC TTTGCGGACG ACAAGGTTGT GCTGGAGTTG GACCAGACGT
GGTTTTTGTC GGATGGATCC GATATGCAGT CCGACAAGGT TTGGACTATT CCCATCTTGA
CCTGCACGGG CGCAGGGGCG CAAGCCGATA TGACCTTGAT GCGCGACCGC ACAGCCACGG
TCACGATTCC GTTTGATCCC AAGGACACGG CGCCCCGGTG GATCAAGCTC AATGCCGGTC
AAGAAGTCCC GATGCGTGTT TTGCCGGGCG TGGAAATGCT TCGACGCATG TTAGTTGCCA
TTGCGTCCAA GTCGATGAGC GCAATTGATC GCGCGGGGGT GCTGAATGAT TCAATGGCTG
TTGTCAAGGC TGGTCACATG TCGCCGGAAG CCATGATGAC GCTTTTGAAA AGTTACAAGG
ATGAGGATGA GTACGTTGTT TGGGAAGGGC TGTCGGATGC GTTGGGTGGC TTGGATGCGG
TCCTCTCGGA CGACGAGAAC ATGACGGGCT ACTTTCGAGT GTTTGCCAAG ACTATGGTTG
TGAATCTTAT GAATAAGGTT GGCTGGGAGG CGTCCGATTC GGATGAGCAT CTGACTAAGT
TGTTGCGTGG GATTATGATC AACCTGCTTG GTGCCTTCGC CTACGACGAC GAGAGTGTTC
AACAAGAGGC GAAGAAGCGC TTTGAGGCTT TCCTGGAAGA CGCCAACGAT ATAGAGTCGC
TCCCCAGTGA CATGCGCACC GCCGTCTTCA AGATTGTTCT AAAAAATGGC AGTGCCAAGG
AATACGAACA AGTGAAAGCT TACTTTGCCA CGGCATCGGA CAACGCCGAG CGCAAGCATG
TTCTTAATTC GCTCGGGTGC ATTCAGGACG ATGCGTTAAA ACTTGCTACC ATGGAATGGT
CGCTTTCGGG TGAAATTAAG TTGCAGGACT TTTTTTACCT CATGGGATCG GTAGGCCGGT
CTTCAAAACA GGGGCGTGAG ATTGCTTGGA AGTTCTTCCA GGAAAACTTT GAGCGCATTC
GCATTCTGCT GCAAAAGGCA CACCCCGCTT TGATGGACGC TTGCATTGTC ATGTGCGCCG
GCGGCTTTTG TTCGGAAGAA AGAGCGGACG AAATCGACAC GTTTTTTCAA GCCCATCCCC
TGCCGTCCAG TACACGCAAG ATTGCGCAAA CGACCGAACA CATGCGGGCG AACGGCAAGT
TCTTGCGAGT CCTGAAAGCC AGTGACTTGG CCAAGGCGGA GTTTTGGGAA AAATTGTAAA
GTCCAGAATT CGTTACACAA ATTACTGCGC GCTCACAGTC AAGTTCGTCG AGCTTGGCAC
CTACAATAGT TTACGGGTCG ACCGGAAACG AAACGACCAC AGACTGTGAA CCTCTAGAAA
TTTCGAAACT AGGCTT
 
Protein sequence
MGSGNHDDKT AGRVLLPAHV VPTRYDLALT PNIEAFTFTG TVDITFRIDG SLLNETNNKS 
ITLHAKELLF STASYHLLDG PDATPVTAEQ MNVNLKATTV EFLFPEPIPP DASTLKLTVA
YTGFLNDQMA GFYRSTYTDI QGQSKIMVST QFEALDARRC FPCVDEPSRK AVFGVTLTVP
AHLTCLSNMP EAKVTAINAQ QKCVTFMDSV VMSTYLLAFV VGEFDFLQTR SAHGVLIKVY
TPPGKAAAGQ FALDAAARAL DAYNDFFNLP YPLPKLDMVA IPEFAAGAME NWGLVTYREV
DLLIDPVKAS TMQKQRVAVV VTHELAHQWF GNLVTMAWWD DLWLNEGFAS WAENWATNVL
YPEYRMWDQF TTGHLSTALR LDALQSSHPI QVPIAHAEEV EQVFDAISYC KGGSVVRMIK
AVIGLSAFQD GLGAYMKKHA YGNTETYDLW NAWEASSGMP IGEMMKSWTE QMGFPLVRVR
KEDFADDKVV LELDQTWFLS DGSDMQSDKV WTIPILTCTG AGAQADMTLM RDRTATVTIP
FDPKDTAPRW IKLNAGQEVP MRVLPGVEML RRMLVAIASK SMSAIDRAGV LNDSMAVVKA
GHMSPEAMMT LLKSYKDEDE YVVWEGLSDA LGGLDAVLSD DENMTGYFRV FAKTMVVNLM
NKVGWEASDS DEHLTKLLRG IMINLLGAFA YDDESVQQEA KKRFEAFLED ANDIESLPSD
MRTAVFKIVL KNGSAKEYEQ VKAYFATASD NAERKHVLNS LGCIQDDALK LATMEWSLSG
EIKLQDFFYL MGSVGRSSKQ GREIAWKFFQ ENFERIRILL QKAHPALMDA CIVMCAGGFC
SEERADEIDT FFQAHPLPSS TRKIAQTTEH MRANGKFLRV LKASDLAKAE FWEKL