Gene PHATRDRAFT_32803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_32803 
Symbol 
ID7197303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp833009 
End bp835411 
Gene Length2403 bp 
Protein Length800 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178015 
Protein GI219112527 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.028074 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAATAG CGCTCAGAAG AATGATAAAG CTGCGGGCAT CTATACTTGC GGTGTTGTTA 
ATATCGTGTT ATGTTTCGGC TTTCCAACCG AATCCGATTC ATTCCAATAG GAGGGCGACG
AGCAGGGGTC CAAGCACGTC CTGTGAAAGG AACATTCCGA CTGGCTTCGG TCGTCCGAGT
CCATCAAGAT TTGACCTTGA CTTTCAGAAA CATCAATCTC AGTATGGAAA GCTGTTTTCT
TTGAATAAAC TCGTTGAAGA TATCAGCAGC AGATCTCCGG GCCAACTACC CTCCACCGTC
TTTGTCGGTG GAAAAGGAGG AGTCGGGAAA ACCACCGTGT CATCGGCGCT GGCCGTTAGT
TTGGCTTCAG CCATCGAAAA GGATTTGAAG GTTCTGATCG TATCTACCGA CCCTGCTCAC
TCTCTAGGTG ATGCCTTGGA TGAGGATTTG CGCAAGAACA ACGGTCGTCC TGTTGCTATG
ACGGACTCTC TGACAGGTGG TAGATTGGAC GCATGCGAAG TCGACGCTTC GGCTGCGCTC
GAGGACTTTC GCGAAAACAT TGCTGCCTTT GATATCGATC GACTAGCTGA TGCTCTCGGT
GTTTCTGTGG ACTTACTGGA AAGCTTCGGT TTGAGAGAAT TCAGTGGTCT TTTGAACAAC
CCTCCGCCAG GTTTGGACGA ACTCGTGGCT TTGTCGAATG TATTGGATTC GGAATCTGTG
GCCAAAGGTT ACGACGTGGT AATTGTGGAC ACCGCACCCA CCGGACATAC TTTGCGACTG
TTAGCTTTGC CGAAATTCTT GGACGGCCTA TTGGGGAAAC TTATTAAAAT TCGCTTGCAA
CTATCGGGGC TGGCGTCCAC TTTACAAACC TTCTTTGGAA ATGACGAAGC ACAGAAACGT
GCAAAAAGCA TCGACGATGC CGTCAACCGA TTGGAGCAGT TTCGTCGAAA GATGAGTAAT
CTTCGCGAGC GGCTTCAAGA TTCCCAGTCG ACGCGTTTTG TTGTCGTGAC AGTCCCTACC
AAGCTCGGAG TTGCCGAATC GAAACGCCTT GCCGCCGAAC TCAATTATCA AGGAGTAAGT
ATCACGGATA TAGTCGTGAA CCAATGTGTC GGTGGGATAG ATGACGATGT GGACTCTGAA
GCTCTACAAC AATATTACGA TCGACGAAAG GATGGACAGA AAAAATGGAT CGCCAAGCTT
GAAGAAGCTG TTCAGGACGT GAGCTGTAGT GAAGAGTACA AAGCAAATGG TAGTTCCGCT
CCTATTGGCA TTACCAGGGT TCCATTTTTC GATGTTGAAT TGGTCGGAGT GCCCGCATTG
GGATACCTTG CTGCACAATG CTTTACAGAA AACCTCAGCT TTGCGCATTT GATGAATGTC
GATAGCTCGA ATGAGCCACG AGTTGTAATT TGTGGGGGGA AAGGAGGAGT CGGAAAGACA
ACGACTTCGT CGGCACTAGC GGTTTCGATG GCTTCGAAAG GCCACAAAGT AGCGCTGATA
AGCACGGATC CGGCTCACAG TATTGGTGAT GCTATCGAAA TAGACCTCTC TGGTGGAAAG
CTTGTGGATG TTCCGCTAAT AGGAATCCCG ACGACGGATG GCTCACTGTC TGTTTTAGAA
ATCGATCCGT CGACAGCAAT CAATCAATTT AAAGGTGTTG TGGATCAACT CATTGGTGGA
GACGATAATC CTTCAGATGC TGGTCTTCGA AATACGCTGC GTGACCTACA AGAGGTGTTT
GATACTCTTC CGGCAGGCAC GGACGAGGTG GTGGCTTTGG CGAAGATTGT CAATCTGGTG
AAGAAGGGCG GATTCGACCG GATTGTATTG GACACGGCCC CAACAGGGCA TACACTTCGA
ATGCTGAGCA CACCAGGCTT TCTTGCCGAG CTTATAGATC GCCTGCTTAT TATAGCCGAA
AAAGTGAATT CGAATACGGC AATAAAAATG TTAATCGGAA GTTCCGCACG GTCAGAGGAC
ATCTCAAATG CTGCAGCAAC AGCAAAGTCC ACTCTTCTGT CCTTCCAGCT CCAAATGTAC
GATCTCGAAA ATTTGTTTGC TGATGCTGCA CAAACGGAAT TTCTCATCGT AACAGTGCCC
ACGGAGCTTG CCGTAAGGGA AAGCATGCGA CTTCTAAATG ATCTGACGTT TGAGTCCCCA
GACATGCCTA TTAAATGCCG AAACATTGTG GCAAACCAAG TTCTTGGGGA CGATGGAAAC
GATGCAAAGA CTTTTCTGGA TCATGTGGGG CAGACTCAAG CAATATCCGT AAAAGACCTT
GAAGATGCTG TTTCGAGTTA CCCTGCACCT CCTCTAATTA CCAAAATTAA GTACCTGGAC
ACGGAACCCC GCGGTGTGTT TGGACTTAAG GTATTGGCCG ACGAACTACT GAGAGAGATA
TAG
 
Protein sequence
MVIALRRMIK LRASILAVLL ISCYVSAFQP NPIHSNRRAT SRGPSTSCER NIPTGFGRPS 
PSRFDLDFQK HQSQYGKLFS LNKLVEDISS RSPGQLPSTV FVGGKGGVGK TTVSSALAVS
LASAIEKDLK VLIVSTDPAH SLGDALDEDL RKNNGRPVAM TDSLTGGRLD ACEVDASAAL
EDFRENIAAF DIDRLADALG VSVDLLESFG LREFSGLLNN PPPGLDELVA LSNVLDSESV
AKGYDVVIVD TAPTGHTLRL LALPKFLDGL LGKLIKIRLQ LSGLASTLQT FFGNDEAQKR
AKSIDDAVNR LEQFRRKMSN LRERLQDSQS TRFVVVTVPT KLGVAESKRL AAELNYQGVS
ITDIVVNQCV GGIDDDVDSE ALQQYYDRRK DGQKKWIAKL EEAVQDVSCS EEYKANGSSA
PIGITRVPFF DVELVGVPAL GYLAAQCFTE NLSFAHLMNV DSSNEPRVVI CGGKGGVGKT
TTSSALAVSM ASKGHKVALI STDPAHSIGD AIEIDLSGGK LVDVPLIGIP TTDGSLSVLE
IDPSTAINQF KGVVDQLIGG DDNPSDAGLR NTLRDLQEVF DTLPAGTDEV VALAKIVNLV
KKGGFDRIVL DTAPTGHTLR MLSTPGFLAE LIDRLLIIAE KVNSNTAIKM LIGSSARSED
ISNAAATAKS TLLSFQLQMY DLENLFADAA QTEFLIVTVP TELAVRESMR LLNDLTFESP
DMPIKCRNIV ANQVLGDDGN DAKTFLDHVG QTQAISVKDL EDAVSSYPAP PLITKIKYLD
TEPRGVFGLK VLADELLREI