Gene PHATRDRAFT_48044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48044 
Symbol 
ID7203031 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp809463 
End bp811515 
Gene Length2053 bp 
Protein Length577 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182460 
Protein GI219124331 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGGAAA CCCAGCTGAT TAAAAGATCC AACAGACGAT CTTTTTTCAA AGTTGGTGGG 
CTGCATTTTG GCATGCACGG GTTGCTAGGT TTTTCTTCAT TATCTCTGAC CATCCTCGCA
TACTACAGCT ATCCAAGCGA ATACCCAATA TGGATTGGAT TGTCGCAAGT TGCAAATCTA
GTAACGGTTA CCCATGCCCG AAATCTTCTC TCTCAAGTGC CGGCATCCAC ACAAATTTTT
CCTGGTATAG TTGCCCCGCA CAAGGAAGCA TTCCAACGGA CAATCAGCGG AATGCAATAC
CTCGTCACAC GTGTCACATG TCTGGTCTTT CGGGACCATT CCATGGATAT TGGATTCCGT
AGTACCTTGG CACTCTTACT GTGGCGTGCT TGGCCTTTAA TTCCTTCGTA CCAAGCTGAG
TGGCTCAATG GCAATACGTG GATTTTTGTT ATTCCAATGG CCCTCGGTGT CGCAGGAGAT
TTGATCCAGT TCTGGAACGG TGACGTCTTT TCGTCGCGGC AAATTCTGTC AATTCAATTG
CATGGGTTGC TTATGGCCTT TGGCTTCACA CTGGGTTTTC GAAACTATTT ACCCATGCCA
CTTGGTAAGG TCGTTTGTTT TGTATGATCG AAAAGCATCA ACATCCTGTT ATGTTTAAAA
GATAAGACCA ATCGCGTTCT GACTTTTTGT TGCGATTGTT GTTTTATACA GTTTATATGG
GAGCTGCGTT CGGTGTGTGG AAGATCCTGC GTGAGGGAAT AATGACCTTT GAAAACGCGT
CGCGCGAACG GCTCGCATCC CGGATGGAAC TGTATGCACT CCCGGAGTAA ACCGGCCAAC
TTTTTTCGGG GGAGGTGCGA AACTGGTCCA TCCAACCGTA AACATTCGAA AAAACGTTGA
TTTGGGATCG GTAGAAGACA CGACTCCTCT GAACGCCCCC ACACGTTTGG CAACAAACGT
CCGTTTAATG AAGTCGTGGG CCCCGCACAC TGTCAGCAAT TTTGCAACAA GCCTGGTGTC
CCGTGCCCCC CTTCCAGACA CTTTTTCCGA CTCAGTACAA ACGCAAAGCC ATTGCTTCAG
AATATCAATC AGTGGTCACA GCGCCATGAG ATCCCAGATT TTGTTCATAT GCTCCATTGT
GCTTATCAAT GCTGATGCTC TTTGTTTTTG CCAGCGTGAC CATTTTTTCA AACATGGAAA
CGTCCTTTGC AGGAAAGAAA AAGCAGATTT CTTTCTCCCT GGGGAAGCAT TTTCGGTGTG
CACAACAGTT TTGGATGCCA CGGTGGTTGG AATGGACACG TTCACCATCG TGTACAACGG
CGGTAGCTCG ACCGCGACGC CTGTCGCGCA CGGTGGTCTA GTGCACGACA CCGACGTCAC
CCGACTCGAA TACGCCAACG GGCAAGTTAT GATTACCACT ACAATCGTTG ACGAAAATTT
TGCTTCGCAA CACTCTGAAA TTGAAATCGA CGAGATGGTC GCCGTTACCA TATTGCAAAA
CATTGGTGAA ATCCGTCAAT CCTCACGACA CCTTATTCAC TATCACGGTT TACCGCGGAA
GGGAGATGAC ATTCAAGCTC GACCAAATCT GAGACGGTTG CAGCCAGGCT TAATTTCCAA
CGACTCATTG ACACCAGCAG AAAATTCGAC TCGGCTTGTG CCATCGCCTC CATCTCCTAC
AACCAGGACC GTCTCTTACC CCGATTGGAA GATTCTGGAG TGGTTGGATG AGTTACCCAC
AGGGATGGCC TATTTTATCT GGGTTGTCCT GGCTGCCATC GTACTGATCA CTGGCTTTTG
CTGTGTACTA TTTTCGGTCC TTCCCCTTAT TTACCGCTTG GAATCTTTTC GAGCTCACCG
TGCTGAACAG CTGGCGATTG CCCGAGCTTG CGCAAAGATG GATGTGTTTA CGAACGAAAA
TCTTACACAA TGCTTTGGAA GCAACTGGTA CAACTTATAC ATTGACGGTA CGCTGCCACT
GGAAGCGATC AAATTTGATG ACGGCCTACT GCGTGTGGAA CGAAACCGCA AGCGTTACGA
GCGCATGATG GAG
 
Protein sequence
MGETQLIKRS NRRSFFKVGG LHFGMHGLLG FSSLSLTILA YYSYPSEYPI WIGLSQVANL 
VTVTHARNLL SQVPASTQIF PGIVAPHKEA FQRTISGMQY LVTRVTCLVF RDHSMDIGFR
STLALLLWRA WPLIPSYQAE WLNGNTWIFV IPMALGVAGD LIQFWNGDVF SSRQILSIQL
HGLLMAFGFT LGFRNYLPMP LVYMGAAFVN RPTFFGGGAK LVHPTVNIRK NVDLGSVEDT
TPLNAPTRLA TNVRLMKSWA PHTVSNFATS LVSRAPLPDT FSDSRDHFFK HGNVLCRKEK
ADFFLPGEAF SVCTTVLDAT VVGMDTFTIV YNGGSSTATP VAHGGLVHDT DVTRLEYANG
QVMITTTIVD ENFASQHSEI EIDEMVAVTI LQNIGEIRQS SRHLIHYHGL PRKGDDIQAR
PNLRRLQPGL ISNDSLTPAE NSTRLVPSPP SPTTRTVSYP DWKILEWLDE LPTGMAYFIW
VVLAAIVLIT GFCCVLFSVL PLIYRLESFR AHRAEQLAIA RACAKMDVFT NENLTQCFGS
NWYNLYIDGT LPLEAIKFDD GLLRVERNRK RYERMME