Gene PHATRDRAFT_18337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_18337 
Symbol 
ID7197236 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp1197770 
End bp1200001 
Gene Length2232 bp 
Protein Length596 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177778 
Protein GI219112053 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAGAGCTGCT TGTGGCAGTG CGTGATCCAT CGGATTCTGT TCACAGTCAA ACATACATCC 
TGCCGAGAAG TCGTGTCCAC ACGAGCTATA TAGTATATTC CGATAACTTC ATCGTTTGCT
GGTTGACGAT TCAAGCAAGT CTTTTACAAA ATTTCCCGTA ATTACTGAAA AGGCATTTCC
TTGAAGTCTT GCTGCAGCAG CTGTGCCCTC ACATTGGTCA TCTCAATCAT GTGTGGAATC
TTTGCCATTT TCTCCTCTAG CCTTCCGGAA AGCGACCTGC GACGCGAGCT GATTTCCTGT
TCTTCGCGGT TGCGTCATCG TGGTCCGGAC TGGTCGGGTT ACAAGGTAAT CGAGGCCAAT
GTCGAAGCTG GCATTCCGCT TTCGCACGGT ATCGCCCACG AAAGACTCGC TATTATGGAT
CCTGAGTCCG GATCGCAACC GCTAGTTTCG CACGACGGGT CCTTAATCGT CGCCGCTAAC
GGAGAAATCT ACAACTACAA GGAACTCTAC GAGACGCTCG AGACGCCTTA CAAGGCCAAG
ACGGGGTCGG ACTGTGAAGT CATTTTACCT CTCTACGAGC AGTTTGGTGC ATCCATCGAA
ATTCCTCGGT TGCTGCGAGG AATGTTTTCC TTTATCTTAT ACGATCGCCA CAATGATTCC
TTTATGATTG TTCGTGATCA TCTCGGTATT ACACCACTCT ATATTGGATG GGCGAACGAT
GGATCCGTGT ACGTGGCTTC CGAGATGAAA AGTTTGGTAG GGCATTGCAG CAAGTTCCAG
AACTTTCCTC CAGGACACAT TTTCTGTAGC AAGGGAGAAC ATGCGGGCGA ATTCCAACGG
TGGTTCAACC CATCGTGGGC TCCCGAAATG AAGCCGGGCG TCCCCCTGCC GAAGCAACCA
TATCAAGCGG TACGTCCATA GCCTTATTGT TGAATGAAAA CTTAATGGTC TGATTTGTCT
CACCCACATG GATTGGTCTC GTTAGGATGT TCTCCGTCAT GCATTTGAAC GTGCCGTCGT
TCGTCGTATG ATGTCAGATG TCCCCTGGGG AGTTCTTTTG TCGGGAGGCC TTGATAGTTC
ATTGGTGGCT GCCATTTGTG CACGCCACAT TGGTCGCCGC AGTGCATCCT TTCCCAAACT
TCACTCGTTT ACCATAGGTT TGGAAGGTTC ACCCGACATT ATTGCCGCGA AGAAGGTCGC
TGACTACTTG GGAACCATTC ATCACGCTTA CACGTACACC ATTCAGGAAG GTGCCGATGC
TGTGCGAGAC GTCATCAGGG CGCTCGAAAC ATATGACCTC ACCACGGTCC GAGCGTCGAC
GCCAATGTAT TTGATGAGCC GTAAGATCAA AGCCATGGGC ATTAAGATGG TTTTGTCTGG
AGAAGGCGCT GACGAAGTCT TTGGTGGGTA CCTTTATTTT CACAAAGCGC CCAACGCGCA
GGAGTTCATG GACGAGACGA TTGACAAACT CAGCCGTTTG CATATGTATG ATTGCTTGCG
TTGCAACAAA GCAATGAGTG CTTGGGGTGT CGAACCGCGT GTACCTTTTC TAGATGCTGA
CTTTTTGGAA GTGGCCATGA ACCTCGATCC GGAAGAAAAG ATGATCCGTC TCGGTGAAGA
TGTTCCAAAG GAGGACCGTC GTGCCGAAAA GTGGTGTATC CGTAAGGCGT TCGACACCCC
GGACGATCCT TACTTGCCTG ATGACATCTT GTGGCGTCAA AAGGAACAAT TTAGTGACGG
CGTCGGCTAT GGCTGGGTTG ATCATTTGAA GGAAGTTGCG GAGCAGGAGG TGTCTGACCA
GATGTTTGCA GCTGCAAAAA ATCGCTTTCC CCACAACACG CCTACGACCA AGGAAGGATA
CCGCTATCGC ATGATCTTTG AGGAGATTTT CCCGGGCGAA GCGCCGGAAA AGACCGTTCC
AGGAGGCAAA TCGATCGCTT GCTCAACTGA ACGTGCTATG CAGTGGGATG CTTCTTTCGC
GTCTCGGGCT GATCCTAGTG GACGTTCTGC AGGAGTCCAC AGTGCAGCGT ACGACGAAGC
CTTTGAGGCG GATACCAAAG TTAGCGAGCC CGCTATCAAG AAGGCTAAAG CGTAGGCCAA
TACTTCGTTT CTAATCTTGG TTTGCAGTCA GGCAAGCACT GATAATCATT AAGTAATGCA
ACGGAAAGTA TAATTTGAGA AGCATCTTTC ACCTGTCAAT AGCTAATATT TAGTATTTCC
GTCATTTAAA TT
 
Protein sequence
MCGIFAIFSS SLPESDLRRE LISCSSRLRH RGPDWSGYKV IEANVEAGIP LSHGIAHERL 
AIMDPESGSQ PLVSHDGSLI VAANGEIYNY KELYETLETP YKAKTGSDCE VILPLYEQFG
ASIEIPRLLR GMFSFILYDR HNDSFMIVRD HLGITPLYIG WANDGSVYVA SEMKSLVGHC
SKFQNFPPGH IFCSKGEHAG EFQRWFNPSW APEMKPGVPL PKQPYQADVL RHAFERAVVR
RMMSDVPWGV LLSGGLDSSL VAAICARHIG RRSASFPKLH SFTIGLEGSP DIIAAKKVAD
YLGTIHHAYT YTIQEGADAV RDVIRALETY DLTTVRASTP MYLMSRKIKA MGIKMVLSGE
GADEVFGGYL YFHKAPNAQE FMDETIDKLS RLHMYDCLRC NKAMSAWGVE PRVPFLDADF
LEVAMNLDPE EKMIRLGEDV PKEDRRAEKW CIRKAFDTPD DPYLPDDILW RQKEQFSDGV
GYGWVDHLKE VAEQEVSDQM FAAAKNRFPH NTPTTKEGYR YRMIFEEIFP GEAPEKTVPG
GKSIACSTER AMQWDASFAS RADPSGRSAG VHSAAYDEAF EADTKVSEPA IKKAKA