Gene PHATRDRAFT_54834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_54834 
Symbol 
ID7203248 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp768938 
End bp770298 
Gene Length1361 bp 
Protein Length392 aa 
Translation table 
GC content52% 
IMG OID 
Productdehydrogenase 
Protein accessionXP_002182453 
Protein GI219124316 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.286619 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCCTTTC GGTCTACCCA AAACGATCGA TCCCGGACAA TCAGGATGAA TGCGACCAAG 
AATCTGTTAC GTTCTCGTGT CGGAAAATAC TCGCGTGCAT TGGTACGACC AGTAGCAAAG
CTTACGGGAA ACGCGTTCCC GTTGCGTGCC GACTTTCGTG GAACGAGTCA GTCCACAGTC
GTTCCGCATC ATCGATTCTT TGGCACCGAC GCGCACGTTA ATGTCGTTCA CGTTTCGGTC
CAAGAAGCCC GAGAAACAAC CGCCAAGGCG CTGCAAATGA TTGGCTGGGA TCACGAAGAT
GCAGCTCTCC AGGCAGAAAT TATGACGGCC GCCGAATTGT GTGGCAACAA TCAGGGACTC
GTCAAAATGT ACCAACCCGC ACTCATGGCG CCGTCGCCCA ACGCCGGGAA ACCAACTGTC
GAACGCGAGA CTCCCACATC GGCTGTCGTT AACGCAAATC AATCACCCGG GATGCTCGCT
GCCGTCAATG CTGCCGACTT GGCGGTCCAC AAAGCAACAA CCAACGGTCC TATTGCCATC
GTTACCTCCT ACAATACCTC TACTTCGTCG GGACAGCTGG CCTTTTATGT AGAACGCATG
GCACGAAAAG GAATTATTGG GATTGCTATG GCCAATTCAC CAGAGTTCGT GGCGGCGGCT
CAAGGAGGAA AGCCCGTCTT TGGGACCAAT CCCATCGCCG TGGGAATTCC ACAAAAAGAC
GCGGTTCCCT TTACGGTACG GTAATATTGA CTGTGAACAC TGTACCGGAA AGCGACCCTA
CTTTCACGTC TTGACACACC GTTTTCTCAT CTATTCTTTC CTTTCGTTAC CATAGTTTGA
CATGGCGACT TCGGCAATTG CCTTGTTTGG GTTACTGACT TCCAAAGCGC AGAACACGCC
GCTGCCATCC AATGTTGCCT ATGGTAAAGA CGGTGGTTGG ACCACTGATG CCGCAGAAGT
GTTGGACGGC GGTGCAATTG CAACCTTCGG TGGACACAAA GGTGCAGGGC TGGCCTTGTG
TATTGAGCTA TTGGCTGGGG CTCTCTCGGG AAGCGCAGTA CTTGGACTTG TGGAATCCAA
AAAGTCGGCC AAGTCATGGG GACATTTGTT TATTGCTATC GATCCCAATG CTTTGACGGA
CGATTTTGAA AGCAAGACAG CCTCTGTTAT CGCCGCGGTG AAAGCGTCCG GTGACAACAT
TCGTATCCCG GGAGAGCGGT CCGCAAACAT GTCGGAGGAA CGAAAAGCTG TGGGAATCAT
GCCCGTACCA CAGAAGATTT GGGAATCCAT TGTCTTGACT GCTGAGCACG GCATCCAAAA
CTAGACAATA ATTTGAAAGG TAAAGAAAGA CTTTCGCGTC T
 
Protein sequence
MNATKNLLRS RVGKYSRALV RPVAKLTGNA FPLRADFRGT SQSTVVPHHR FFGTDAHVNV 
VHVSVQEARE TTAKALQMIG WDHEDAALQA EIMTAAELCG NNQGLVKMYQ PALMAPSPNA
GKPTVERETP TSAVVNANQS PGMLAAVNAA DLAVHKATTN GPIAIVTSYN TSTSSGQLAF
YVERMARKGI IGIAMANSPE FVAAAQGGKP VFGTNPIAVG IPQKDAVPFT FDMATSAIAL
FGLLTSKAQN TPLPSNVAYG KDGGWTTDAA EVLDGGAIAT FGGHKGAGLA LCIELLAGAL
SGSAVLGLVE SKKSAKSWGH LFIAIDPNAL TDDFESKTAS VIAAVKASGD NIRIPGERSA
NMSEERKAVG IMPVPQKIWE SIVLTAEHGI QN