Gene PHATRDRAFT_11009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_11009 
Symbol 
ID7197628 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp1082486 
End bp1084393 
Gene Length1908 bp 
Protein Length535 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178359 
Protein GI219115127 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTCCGGCAGT ATCAACTGGA GGGAATCGCT TGGTTACGTT TTCTACATAC ACTCAGACTG 
AACGGCGCTC TGTGCGATTC TATGGGACTT GGGAAAACAC TACAGGCTTT GATTTGCGTT
GCCATTTCTC ACGACGTAGT CCACCATGCC GCACCAGATA GCAAACCAGT CTCCATCGTT
GTCTGTCCTT CTACTCTGGT TCGACACTGG ATCGCTGAAA TCAACAGATT TTTCAAAAGC
GACGATCCGG TTTTCTTTCC TCTCGAGCTT TCTGGTAGTA GCACGAGTCG TCGAGCAGTA
TGGGAAAAGG GCTTAGTATT TTGCAACATA ATTGTTACAA GCTATTCTGT TCTACGGAGC
GATATACGAA TGCTTGCATC GCAAAGCTAT CACTATTGTG TGCTCGATGA GGGCCACCTC
CTCAAAAATC CAAAGACAGG TACGCTGGAT TATCTCGAAT TTGAAAACTT GACAACCACG
TTCTGACTTC CGCATCGTTC CCTCGTTGCA CAGAGACGGC GAAGGCGTCT CGTCAGCTGC
GGTCGAAGCA CCGTCTCCTT TTGTCAGGAA CGCCGGTGCA GAATCATGTT CATGAACTGT
GGGCTGTATT TGACTTCTTA ATGCCAAATT TTTTGGGATC CTCGGTATTT TTCTCAGAAA
AGTATGCCAG AACTATATCG AAAGGACAAG CTCCTGGCGC ATCTGTGAGA GAGATCAGCG
AAGGAATAGA AAAGTTGAAG ACGTTACATC AGCAAGTACT ACCGTTCATA CTCCGGAGAG
AGAAACAACA AGTTCTTAGA GAATTACCAT CAAAATTAGT CACTCAGATT GAAGTTCCCA
TGAGTGATCT GCAGCGAAGG CTCTACACTG ATTTCTGCTC GTTTGCAGAT GTCCAACAGT
CACTCCGGGC TCTAGATCGC GCTGCGAAGG ACGATCTCGG CGATAGATGC CTGGAGCAGG
CAGGGCGTAG CTCGCTACAG GCCCTTTTGT TTTTAAGACT TCTCTCAACT CACCCATGGC
TTGTCAGATC CGCCATACCA GTCGCCTCGG AAATCAGCGA CAACGATTGG CTTCGTTTTG
ATACCTCCGG TAAAATCAGA GCGTTGGCTG ACTTACTCCG AGAGCTTAGT ATTTTCACCG
ACGACTTAAG CGCTGCCGAT AACGATTCGT CACTTTTGTA CTGCGAGGAC GACCATGTTG
ATGTAGATGT TTATTCCAGC CTCGTCAATT CATCAGACAA TCACATGCAA CCCGCACCCA
CAACCTCCGA AGTCCAATCG CAGACAAAGT GCTTGATCTT CGCTCAGTTC ATTCAAAGTC
TTGATGTTGT GGAAAAGCTT TTATTCAAGC CTCACATACC ATCGCTGAAA TATCTTCGAT
TGGATGGAAG AGTTCCTGCC AGAAGACGCT ATGCCATTGC CGAAGAGTTC AACCGTAACG
ATGAGATCAA GGTTTTGCTG CTAACAACAA GGGTCGGTGG TCTTGGACTA AACTTAACAG
GTGAGTAAGG GTCACGAGCT TCTGTCTCAC TGTCACGGGG ATATGTTTTT CTCATGGACA
TCGTACAATA TAGGAGCGGA CACTGTAATT TTTCTCGAAC ATGACTTTAA TCCTTTTGCT
GATCTTCAAG GTATGCAACA ATTGCGAGAC ATGGAGATAT TTGCATAGCA CTGGCTCAAC
CTTCTATATC ATTTTAAAGC AATGGACCGG GTCCACAGAA TTGGCCAAAA GAAGGCTGTA
TGCGTTTACC GGTTAGTTCT GGTCGACTCA ATTGACCAGA GAATTATGAA GTTACAAGAA
AAGAAGTTGG CTATGAGCGA GGCGATAGTG AACGCCGACA ATTCTACTAT GTTCAGCATG
GGGACTGATC GATTGCTTGA CATTTTCACG ATGAGAAGCG ACCAAGAG
 
Protein sequence
LRQYQLEGIA WLRFLHTLRL NGALCDSMGL GKTLQALICV AISHDVVHHA APDSKPVSIV 
VCPSTLVRHW IAEINRFFKS DDPVFFPLEL SGSSTSRRAV WEKGLVFCNI IVTSYSVLRS
DIRMLASQSY HYCVLDEGHL LKNPKTETAK ASRQLRSKHR LLLSGTPVQN HVHELWAVFD
FLMPNFLGSS VFFSEKYART ISKGQAPGAS VREISEGIEK LKTLHQQVLP FILRREKQQV
LRELPSKLVT QIEVPMSDLQ RRLYTDFCSF ADVQQSLRAL DRAAKDDLGD RCLEQAGRSS
LQALLFLRLL STHPWLVRSA IPVASEISDN DWLRFDTSGK IRALADLLRE LSIFTDDLSA
ADNDSSLLYC EDDHTKCLIF AQFIQSLDVV EKLLFKPHIP SLKYLRLDGR VPARRRYAIA
EEFNRNDEIK VLLLTTRVGG LGLNLTGADT VIFLEHDFNP FADLQAMDRV HRIGQKKAVC
VYRLVLVDSI DQRIMKLQEK KLAMSEAIVN ADNSTMFSMG TDRLLDIFTM RSDQE