Gene PHATRDRAFT_33046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_33046 
Symbol 
ID7197274 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp1397495 
End bp1399589 
Gene Length2095 bp 
Protein Length670 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177821 
Protein GI219112139 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.531262 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATCGA GCAACTTGTC GCAGCGCAGT CTCGACTGCC CTTGTCGGAA TTTCTCGAGC 
TTGGAACGTA TTTCTCAAGT GCTCGCATCG AAAAGCCAAA CACAAACTCT GATCGATTGG
TTGGCAGGGA AAAGCGACGA ATCTTCTTTT GAAGAACGAC GACCTTTATC TTGCAGCGAA
AAATATCGCT GCAAGTTTCG CGCAAATACT TCGCTATTCA TATTGAACGG AGTCCGCGAT
GCTTGTCGGC CGTTTTTGGA GTCCGTTTTG GTCTCAAACG AAGGTCCTGC GAACCCGCGA
CCCTCTCCAA TCGTGCCGTA CGAAGATGCC TTTCCAACTT TACTTTCTAC CCCTCCTCCA
AATCTGCCTT TTTCTTCCAA AAGCCTTCCA ACACACTCTT TCCAATATGC TCGTTCAGCC
ACTGACAAAG CTGAGAAATC AAAACCAAAG CGGCGTGTTC GACCTTTGAC AATTTCAACG
TCCGGACCAT CGGCTTGGGG CGAAGGAAAT TTGGTGACTG CGTCACTGCC AGTCAGCACG
ACTTTGCAAC AGTGTGATGT ATGCCATCCA AAACTACCGT TAGAAATGCC AATCCCTCAA
GGTAAATCAA TACCGAAGTC AGCGCGGTCT GGCGCCACGC GGAAAACACC GACAAAGCAA
GAGAAAAGCG CCCACCAACG GATGGAACAA ACCAGCATGA CTGAGGTGCA AAGGCTAATA
GAAGTATACT GTGAGCTGGT GAAGAGTGCA CTCGTCCCCT CGACGGCGTT GGAGTTGCAT
TTGCTGTGTC GTATGTTGTC TGTTCCGATC AGCGATCTAT CGAATCCTCA GAGCTTGCAG
GATTCCAGTT TTTCCTGTGT TTTCTCCTCA GCGACACAAT GTACATATTT TGCGAGGGAA
TGTTTGCTTC GCTTGTCTGG TATCCTCCGT GGCTACGGTA AATCCTTTCT GATAGAACTT
GTTCGATGCG CTCCATTCCG TGTTCACCTT CCCGATATGG TTTCGGACTT TGAAGCACTC
ATACGGCTCG ACGCGACGCG TGTAGATGCG TCTCTCGACT TTTCACCAAA CAGCCAAATC
CCTTTGTTGA CCCTTCCTTT CAATGAGGAA AGAGACTCGC GACACAACTA CAAGACTCGT
GAGGAACAAG CCCTTTACAA GAACCGAGAG GAAAGCCGCG ACGCATTTCT CTACCAGTTT
AGATCCTTTT TAAATGTGAG AGGGAAATTA GTTGATACAG GAGCCGCGGA AAAAGAAATT
AAGAAAATCG AACGGTCTTC TCGAACTGTC GTAAACGGAG TGATGGGCGA CAACGTTCCT
TGGTTTGCGG ATTTTTTTTG CGATTTATTG TTACAGACTG GTCTAGTACC TCTGCAAGAG
ACAGATACAG ACCTTCTTCG TATCGCCGGT AAGGAAAAGC TCCAAAAACT GCACAAGCGC
TTTGGATCCA TGGTTCCTCT TACTGAAAAA AGTACCAAGA AACTCGTTGC CGAGCGGCAC
TTTAGAGAAT CTATCCCGGC GGTGGCAGCA CAGCAGTTCT TTCCTGGCCA TCAGGAGTTT
TTCTTCCTTT TTCTTATGTC CGCCGACTCC TTCATCTTTG GTTTGCATTT ACGCCGAGTA
TTGGCCCAGA ATATTAAGAA ACTCGCTGCC GCTACAACGG TAAGGGATTT TGAAAGGCAG
ATATTGAAAA TGCAATTGCT AGGTCGTTTT GTTGGTGTAC TCTTCTTCGC TCCGAATTGG
GTTTCATCGA CCACAAAGCA ACAATCTCCG TCTGTACTTT TATCAACTGC TCTGTGCGAA
ATCTCAATTG CTGGTTTACC AATGCTAGAA ATGCTCGGTG AAGCGTGCCA AAATGGTAAC
CTCGTGAGTT TCGTTCCATG GGTGGTTGAA GTTCTTAAGA TGTCCATCTG GGACAGAGGG
GCGCGCAACA GTTTTGAGGT GCGTCAGCTT CTAGCGTATT TGCGGCAGAT ACAATTATTG
TATTGCAGAA GGGATGAGAG AACTGAAACA ACCAATTCAG TACGCGAGCT CATTTTTGGT
AGCATCGAGG CTATGCTTGA TGAAGTTCTT GGTCTTGGAC GGACCACGAG TCTAG
 
Protein sequence
MKSSNLSQRS LDCPCRNFSS LERISQVLAS KSQTQTLIDW LAGKSDESSF EERRPLSCSE 
KYRCKFRANT SLFILNGVRD ACRPFLESVL VSNEGPANPR PSPIVPYEDA FPTLLSTPPP
NLPFSSKSLP THSFQYARSA TDKAEKSKPK RRVRPLTIST SGPSAWGEGN LVTASLPVST
TLQQCDVCHP KLPLEMPIPQ GKSIPKSARS GATRKTPTKQ EKSAHQRMEQ TSMTEVQRLI
EVYCELVKSA LVPSTALELH LLCRMLSVPI SDLSNPQSLQ DSSFSCVFSS ATQCTYFARE
CLLRLSGILR GYGKSFLIEL VRCAPFRVHL PDMVSDFEAL IRLDATRVDA SLDFSPNSQI
PLLTLPFNEE RDSRHNYKTR EEQALYKNRE ESRDAFLYQF RSFLNVRGKL VDTGAAEKEI
KKIERSSRTV VNGVMGDNVP WFADFFCDLL LQTGLVPLQE TDTDLLRIAG KEKLQKLHKR
FGSMVPLTEK STKKLVAERH FRESIPAVAA QQFFPGHQEF FFLFLMSADS FIFGLHLRRV
LAQNIKKLAA ATTVRDFERQ ILKMQLLGRF VGVLFFAPNW VSSTTKQQSP SVLLSTALCE
ISIAGLPMLE MLGEACQNGN LVSFVPWVVE VLKMSIWDRG ARNSFEYASS FLVASRLCLM
KFLVLDGPRV