Gene PHATRDRAFT_47967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47967 
Symbol 
ID7203207 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp572683 
End bp574505 
Gene Length1823 bp 
Protein Length587 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182424 
Protein GI219124256 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATACCGGACG CTGGGTGATG ACGACACGAG TCACTGGCAA CGAGAGAAGA GGAAGACAAC 
TTCTGCTACG ATTTTCCGGT TTTCTTTCAG CTGCCCTGAC GGTGATGGCT TTGATCACGG
CGACACGAGC ATTGGCGCCG GTTGTTGCCT ACATCGCACC ACAACGAGTA AGGTCACATT
TCATGAGGCG CGCGTTTGCT TCCGGGAACT GTCGCTCCAG GCGTTTCAGC GTTCTGCGTT
CGACAGTCGG AACGGAAACG CGCAACGAGA ACAACATCAC TACACCTTAC CATTCACCCT
TGCCGACGTC TAGATATGTG TACCAATCCT CTCGCACTCT CTCTTCACGT AATCCAGCGC
TGCCTTTAGA ATCCACGCTT ATGCACTCGA ACGCAAACGA ATTGACAGAT GTCGAACTGA
AGAACTTGGT TGCTCATTGG AGAGACCACC CTGTCTTAAA TCCGATGGTC ACTTTCCGCT
CATGGGTCGT TCCTATCAAA GGCAAGATGA TTCAAGCAAT TCTCAATTCT AAATCCCTGC
AACCCTATCT AGCAAGCAGA CACGAACTGC TGCAAGAAAT GCATGTGAGA CTGAAAATCG
TTCGCGATTA CACCGACTCT ACCGATGGCA CAGAAAAATT GATACTCTTG CATCCCGATA
CGCCACCACT GTCGGAACTT CCCGCAGATG TCCAGCAGCT ATTACGCAAT TGTCAGATTC
ATGAAAATGG TCCCGTCATG CCTACGAAGT TTACGTATAA AGACTTTACG GCATCGTACA
TTCTATCTCA GCTGTTGCCT ATAGCTGTCC ATCCGCCTCC GACGGCCTTT GAGACCATCG
GACACGTGGC GCATTTAAAT TTGAAGGAGC GTCACTGGCC TTACCGCTTT CTTATCGGCC
AAGTGCTGCT AGAAACGTTG CCCTTGATAG AAAGCGTCAT TAACAAAGTG GGGGAAGTCT
CGGGACCGTA CCGCACCTAC GATTTTGGAC TGCTTGCGGG TCGCAATGAT ACGCGGGTCA
AGCTCACAGA ATCAGGCGTG CAACTGCAAT TCGATTTAGC GGACGTCTAC TGGTGCTCAA
GGCTTTCGGA GGAACGACAA CGGCTCCTTC GTACCTTTCA GCCTGGGCAG ATCATTGCCG
ATCCCTTTTG TGGAGTGGGC GCTCTATGTC TGCTGGCTGC ATCGTTGCCA CAACGGAATT
GCACGATCTG GGCCAACGAC TGGAATCCAA AGGCGGTGGA ATACTTGCGC GAGAATGCGC
GGCGGAATCA TGTGTCCGAC CGCATAGAAC GGCTACAATG CGGAGATGCC TACGACTTTC
TTATGGATAT GGGTCTACAA CAACACCAGA AAGCATCGAC CAGATCAAGG AAAGAGGATG
TTACGAACAA GGATGGGAAC CATGTAACTC CAACCGAACC TATGCGACTC CCGGATCACG
TCGTAATGAA CTATCCAGTA GAAGCACCCA AATTTTTGGG TGCGTTACGG TGGTGGCCTG
TCCCGCCAAG CTCAAGAAGG GGTAGCACCA CACGCGATGG TGGTATCGGA TCGGTCATCG
TACCACGTGT TCACGTTTAT ACGTTTGCCA GGGCTGATCC CACAACAGAC CGAGATGCCG
AAGAGGTGGC GGTAGATCTG GTTGCGGCCA ATTTGTTGCC ATTGGGAAAC ACGATACACT
GTCGGACCGA AATGAACGAA GACTACGATT GCGACATCCA GGTCCATCCG GTGCGTGATG
TCGCACCCGG AAAGGTCGTC CTGTGCGTAA GTTTTTCTGC CACACCCAAG CTACTTCGGT
ATATGCAGGG CGACTTTCGA TAG
 
Protein sequence
MTTRVTGNER RGRQLLLRFS GFLSAALTVM ALITATRALA PVVAYIAPQR VRSHFMRRAF 
ASGNCRSRRF SVLRSTVGTE TRNENNITTP YHSPLPTSRY VYQSSRTLSS RNPALPLEST
LMHSNANELT DVELKNLVAH WRDHPVLNPM VTFRSWVVPI KGKMIQAILN SKSLQPYLAS
RHELLQEMHV RLKIVRDYTD STDGTEKLIL LHPDTPPLSE LPADVQQLLR NCQIHENGPV
MPTKFTYKDF TASYILSQLL PIAVHPPPTA FETIGHVAHL NLKERHWPYR FLIGQVLLET
LPLIESVINK VGEVSGPYRT YDFGLLAGRN DTRVKLTESG VQLQFDLADV YWCSRLSEER
QRLLRTFQPG QIIADPFCGV GALCLLAASL PQRNCTIWAN DWNPKAVEYL RENARRNHVS
DRIERLQCGD AYDFLMDMGL QQHQKASTRS RKEDVTNKDG NHVTPTEPMR LPDHVVMNYP
VEAPKFLGAL RWWPVPPSSR RGSTTRDGGI GSVIVPRVHV YTFARADPTT DRDAEEVAVD
LVAANLLPLG NTIHCRTEMN EDYDCDIQVH PVRDVAPGKV VLCGDFR