Gene PHATRDRAFT_47468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47468 
Symbol 
ID7202584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp705787 
End bp707403 
Gene Length1617 bp 
Protein Length538 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181789 
Protein GI219122930 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.204515 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAGAGC CTACCGATTT GCCCTTGGTA AGCCAAATTC CACCATTACC AAAAGAATGG 
GCTCGATGCC ACGCCTACAT TGATAATAAG CGGCGATATT GCCGACAACA TCCTATCGAA
TTCGAAGACG CGCCGGCAAC GGATCACAAA GCTGATACCG GCAGACCTCG CTACTGCGGC
AACCACCAAC ATCTTCTGGC CAATCGCAAA CGAAAACGTA TACCCTGTCC AGCCGATGCA
TCTCACTCCG TATACGAAGA TTGCGTAGCC AAGCACCTGT CCGTCTGTCC AATGCTGAAG
AAACAGAGAG GGCAGGAGAA GCAATCTTTC TACCGGAAGA ATATAAATAC AGGTGGTTAC
GGCCCCTTGG GCGAGATCGA AGAAATCGCT ACCAGCCAAA TACGGTCAAA GTTGGATCAT
GAAATGAGTC AATTCGCTGA ATCCGTATTA AGGTTACATC AGAAATTGTT TGCCGGCGAC
AATATTCAAG ACCCATCGGT CCTCTCCGCT GAGGATATTC ATCAAGCGAT ACCTACGGAG
GATCTATCCT CTGCAGAATT TGACGCAGGC TTAGCCCAGG CAGTTTCCGA CTATCGGATT
AAGTCGGGTG GGCAAAAGCA TTTGCACCAG CAAGCCAGCC TCGTGGGACA CTTGCGCCGT
ATCGGGGCGC TGCCATCGCT TTCCAAAGGG TCACGCCATA CAGACAAAAT CAAGTCAGTC
GGGAAACACT TGATCTTGGA AGTCGGTGCG GGTCGTGGGA TGACCGGCTT GGTCGCTGCC
GGTGTTTCTG TAGTCCACGG TGACCCAACA GATCTAATCA TGATTGAACG CGCTGGATCC
CGAGGCAAGG CCGATACTGT GCTGCGCAAT GCCCCAATAT GTGTCAAACA GACCACGTCG
GATGTCCCAC CGTATTTGGA TCTCAAAGGG CCTCTATCCT GGTCGCGTGT GCAATGCGAT
CTTTCCCATC TCAGCTTGTC ATCCATCCTT TTGAATGCAG ATTTAGAAGA GAACGACCGT
GTTACGGTGT TGGCGAAGCA TTTATGCGGT GCCGGCACAG ACCTGGCTCT GAAGGCGTTG
GAACCAGTAA AGACGTCGAT TTCCTCCTGT ATTTTGGCAA CATGCTGTCA TGGAGTTTGT
AACTGGCAAG ATTACGTCGG AAGAAAGTTC TTAGTGGACG CTTTCCAGAA AAACTGCCCG
TCACAACCGT TCGGAGCCTT CGAGTTTGAG CTACTTCGGC GATGGAGCAC CGGCACAGTC
AAGGCAGGGG CGGCAGAGAA TCGCCTTGGC GAAAACCGCA TAGAAATCGC TGATTCCAGC
CTTCAAGAGC ATGGTCTCGG CACCGTCTTG GTCGAGAATG CATCTGCTAA TATTTCCAGA
ATTGTGGAAG CGAGCCGTCT ACGATGCGGA GCCCAGGGTC TGGGCCGCGC CTGTCAGCGT
CTCATCGATT ACGGACGGCA GGAATATCTC CGTGGGGTTC TGTTTCCTTC CGCGAAACAT
GCAAATGGAT CAGTCGAGAC GACAAACATT GACATGCTGC ACTATGTTGC TCCCGGTGTT
ACTCCTCAAA ATGCTGCACT GGTAGCATTT TCCGAAAAGA CAGGCCATAT ACCATGA
 
Protein sequence
MGEPTDLPLV SQIPPLPKEW ARCHAYIDNK RRYCRQHPIE FEDAPATDHK ADTGRPRYCG 
NHQHLLANRK RKRIPCPADA SHSVYEDCVA KHLSVCPMLK KQRGQEKQSF YRKNINTGGY
GPLGEIEEIA TSQIRSKLDH EMSQFAESVL RLHQKLFAGD NIQDPSVLSA EDIHQAIPTE
DLSSAEFDAG LAQAVSDYRI KSGGQKHLHQ QASLVGHLRR IGALPSLSKG SRHTDKIKSV
GKHLILEVGA GRGMTGLVAA GVSVVHGDPT DLIMIERAGS RGKADTVLRN APICVKQTTS
DVPPYLDLKG PLSWSRVQCD LSHLSLSSIL LNADLEENDR VTVLAKHLCG AGTDLALKAL
EPVKTSISSC ILATCCHGVC NWQDYVGRKF LVDAFQKNCP SQPFGAFEFE LLRRWSTGTV
KAGAAENRLG ENRIEIADSS LQEHGLGTVL VENASANISR IVEASRLRCG AQGLGRACQR
LIDYGRQEYL RGVLFPSAKH ANGSVETTNI DMLHYVAPGV TPQNAALVAF SEKTGHIP