Gene PHATRDRAFT_34199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_34199 
Symbol 
ID7197906 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp1353815 
End bp1354959 
Gene Length1145 bp 
Protein Length321 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178687 
Protein GI219115783 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTCTA CAGAAAAGGA GGGTACCCAA GCTCCTCCAC AAAAAGCCAA AAACGCTCTA 
AGAGTTCCAA GAGCGGCAAA GGAAAGCCTA AACGCAGCAA GAGCTCAAAG AAAAGTGGGA
AATCGTTTTA TTCAGGCGGT GGATTCTTTG TACCGTCATC ACCGCAAAGT CCAGTTCTAC
CGGTAAACTC ACCGGTACCG GCACCACGGC ACCACCGTCA GTATCGAGCA ATCCCACCGG
TGAGCCAACA CGTTCCATTA GCACTGAGTT TCCAACGGCG TTGCAAGAAG TGGAAGTGCC
AACCTTCACC CCCACTGAAG CATTCCTTCC GTTTGAATCT TTAGCGGAGC TGGTGGAGGC
CGTTGACGAA TACGTCGACG ACCGAAGCCC CGAGTCAAAC GTTGCACGCA TACGAGGATT
TCCCATCAAT GCCTGGGACG TCAGTCAGTT ATCGGATTTT AGATTTTTGT TTAGTCCTAG
TTCGGAACGT TCCTCTTCGT TGGGCGACTT CAACGAAGAT TTGGATCAAT GGGACATGTC
TAACGCAGTA TCGCTTGATT CAATGTTTCT GAATGCAATG TATGTACAAT ATCGAGGAAA
GAGATACCGT TTGCTCCGGT AGTCCTAGCC CTTTAGGACT TTTCCCGCTT CCACTGACTC
TGTATTTGTT TTCGCCTCAT AGCGCTTTTA ACGGAGATAT TTCCACATGG GATACCCGAA
ACGTGCAAAG CGCTACATTT CTATTTTCCG GGGCTGTATC CTTTCGTGGG GACTTGAGTT
CATGGGACAC GTCTAGTTTT CAGAGCGCGT TTGGAATGTT TCGCGATGCG TCTGCATTTG
ATTCTGACAT TGGTGGATGG GATGTATCGA ATGTACGTGA CATGGATGAC ATGTTTTTGA
ATGCTGCATC TTTCAATCAA GATATTTCAT CCTGGGACGT GTCTGGCGTA GTCGACGCAG
CGGGTTTTAC AGATACATTT GCCGGAGCAG CCTCTTTTGA TCAAAATCTA TGTGCCTGGG
GAGATTTGAT CCAAGGTGAT AGTCGTCAGG TTGAGCGAAT GTTCATCAAC ACAGGCTGCG
CCTCAGCGTC AGATCCTGAC CTGGAGTTTT TTCCCAAAGG CCCATTCTGC TCCTTCTGTG
GTTGA
 
Protein sequence
MSSTEKEGTQ APPQKAKNAL RVPRAAKESL NAARAQRKVG NRFIQAVDSL YRHHRKPTRS 
ISTEFPTALQ EVEVPTFTPT EAFLPFESLA ELVEAVDEYV DDRSPESNVA RIRGFPINAW
DVSQLSDFRF LFSPSSERSS SLGDFNEDLD QWDMSNAVSL DSMFLNAIAF NGDISTWDTR
NVQSATFLFS GAVSFRGDLS SWDTSSFQSA FGMFRDASAF DSDIGGWDVS NVRDMDDMFL
NAASFNQDIS SWDVSGVVDA AGFTDTFAGA ASFDQNLCAW GDLIQGDSRQ VERMFINTGC
ASASDPDLEF FPKGPFCSFC G