Gene PHATRDRAFT_48468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48468 
Symbol 
ID7203697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011685 
Strand
Start bp585662 
End bp587130 
Gene Length1469 bp 
Protein Length431 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182865 
Protein GI219125181 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CATCCCTCCC GTCCTCCAGT ATCGGATCCT AACGACTCCA TCGGCTGCGC ACACAAAATT 
CCTCCCACGC AAACCTTTCA TAATCATAAA AAGTCAGATT TACTGTTAGA TCGAAGCAAT
CATGATGCCG TCCGCCTTTT TTGTGCTGGG TACCATCAGT CTTCTTTTAA GCGAGTCTCA
GGCCTGGACG ACTCGCCCAG CGCAGACATC AATTAGACTT TTCTCGCGAC TTCATTTATC
CCGACCCTCC TCCGAGTCAT GTTCGAAACA AACGAATGTC GATAACGAAA GACGCTCGAA
GAGATTTGAT AGTTTCTCTC GACGCGAGTT GTTCGCGGCT ACCGCTGGTG CAGCTGCTAG
CATTGCCTTG GGGCCGTCCA CAGCGGCATT CGCGCCAGCC CCGTCGATTG TTACAACCGC
CGCAACGTGC GACACCACTG TATCAGTATG GCAACGCGGT GACAGAATCG TATACATCCT
GGGAACAGCA CACATTAGTG AAATATCCTC TGATCTCGCT GGCCAACTAG TAAAGGATGT
ACATCCCAGT GCCGTTTTTG TCGAGCTCGA TCTTAAGCGG GTTAGTGGAG TTACGGTGTC
GCCCGGTACA CCCGTGACTA GTCGTCTACC CATTTCAACC GACGTAACAG AGTTACCTGC
CACAGGAGAA GGTGCGTCTA CAAAGCAATC GAAAATTATT GTCTCCGTAC CAGCCCTTCC
CGACTCTGTC GCGTCACGCC CCACCGAATC GACTGGTATT GCCTCGCTAG CGGCAACCAC
CAAACAAGAC GATGAGCTCG CAAGCGCCTC ACCCGTCCTT ACAGAATCCC CACGCCGCGG
GCTTGGTCAG CGTATGCTTG GTTTTGGTGC AGCTGCTGTC GGTAAGGCCA TTCAAGGCAT
GTACAAAAAC TTGAACGACT CCGGATTCAA GCCTGGCGAA GAATTCGTCG TAGCTGTACG
GGAAGGGCAA AGAATTGGGG CCGACATAGT GCTAGGTGAC CAAGATGTTG AAGTTACGCT
TCGTCGTATG ACCCAAGCTC TAGCTCAAAC GGATCTCAAT AAGCTCCTTG ATCCTGATTC
GGAACTAGAA CGCGGCATGC GGGAGCTCAT GGGAGACTCG GATCCGTCTT TGGCGAGTTC
GCCGGACGCC TTTAAGTCAG AACTCTCTAC CTATGTGGAG AATATGAAAA CACGGGATAG
TGTTCGAAAG ATAATGGCTC AGCTCCAGAA AGTTGCACCC GCACTGGTAC AAGTTATGCT
AACAGAACGC GATGCTTACA TGGCGGCGGG CCTCGATACA CTGAACCAGT TTGAAGTCAT
AACTGCCGTC ATGGGTATCG CGCACATGGA TGGCGTCGAA CGCAATTTGC AATCACAAGG
ATGGAAACAA ATGCGCCCCA GTTGCCCCCG CGTGTAAGCT CCTATTTAGG CTAGACTCTG
CGATAACCAT GATAAACTCT CCAAATCTT
 
Protein sequence
MMPSAFFVLG TISLLLSESQ AWTTRPAQTS IRLFSRLHLS RPSSESCSKQ TNVDNERRSK 
RFDSFSRREL FAATAGAAAS IALGPSTAAF APAPSIVTTA ATCDTTVSVW QRGDRIVYIL
GTAHISEISS DLAGQLVKDV HPSAVFVELD LKRVSGVTVS PGTPVTSRLP ISTDVTELPA
TGEGASTKQS KIIVSVPALP DSVASRPTES TGIASLAATT KQDDELASAS PVLTESPRRG
LGQRMLGFGA AAVGKAIQGM YKNLNDSGFK PGEEFVVAVR EGQRIGADIV LGDQDVEVTL
RRMTQALAQT DLNKLLDPDS ELERGMRELM GDSDPSLASS PDAFKSELST YVENMKTRDS
VRKIMAQLQK VAPALVQVML TERDAYMAAG LDTLNQFEVI TAVMGIAHMD GVERNLQSQG
WKQMRPSCPR V