Gene PHATRDRAFT_46543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46543 
Symbol 
ID7201687 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp658228 
End bp659928 
Gene Length1701 bp 
Protein Length532 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181046 
Protein GI219120623 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATCAC AGAAGCAACC ACAGCGGAAG GACTCGGACG TTTCGGAAAA ATCGGACTCC 
GGCGGGGGCG ATCCCTACGC TTCCGCAGAC TATCAAGAAG CGCTCGAAGA TGTGCACACC
CGCTTTATTC TGAATTTGCC GCCCTCGGAA CTCGAAACAG CCGACCGACT TTTCTTCCAA
CTGGAACAAG CATGGTGGTT CTACGAAGAT TGGATCTGTG ATCCTCATCC AGAGAAAGTT
TTGCCTCGGT TTTCCAGTTT CAAACCCTTT GCCCAGAAAA TGTTCGCCTA TTCGGAAATG
CTACCGGAGT CCCACAAATT CGGATCCATG TGGGCGGAGT TTTCGCAGTA CAAGCGCGGA
ATATCTAACT ACGGATGCAT TCTCTTGTCA GTGGATTACA CTAAAGTTAT TTTGTGTCAA
GGTTAGTGTT GGGCGGGATC TCATGCGTCC CCGGTACGAG TTTTCTCTCT TTTGGTCGCT
TTTCTGCCTG ATTTCTGACT GCCGTTTGCT TCTTTCAATT TAGTATGGAA TGGAAAGACG
TTCACCTTTC CAGCTGGAAA GATCAACCAG GGCGAAGATG GATTGACCGC CGCAGCCCGT
GAAACCTACG AAGAGACAGG GTTCGATCCC AACTGTGTGT TTGGACAAAC CGCTTCCTGG
AAAGCGACGG ATCCTGCGAA GATTACCTGG AAATCTTTGC AGGAACAGGA CGCTCTGATT
TTTCAAGAAG ACAATGGTAA ACGCCGAACC TGCTATGTCT GTCACGGTGT TCCGGAAGAC
TTTCCGTTCC TGCCCGTGGC ACGTAAGGAG GTCGCCAAGG TAGCCTGGTA CCGCGTGGAT
AAGATACCGA AATCTTCGTA TGCGGTATTT CCCTTTTTGT CCCAACTGAG ACGATGGATT
GCTCAACGCA CAAAGTCTTC GCGCGACAAG TCGACGGGCC GATCGAATGC TCGTAAAAAG
GGTACACCCA AGCGTTCCGG CAACAACTCC AGAGGTCGCG ATTCTCGCGG TAAAGTGCGG
GATGGCGACG GCCTGGTCAC CAGTGGACTA GCTGCGCCCG GAGAAGTATC CAGATGGTCG
GAAGACGATA TGTTCGCCGT TAACGAACGT CTCTTGGGGC GAAAAATCAC GTACGACGGG
AACCCTCATT TGTTTGAACA AGGATTTCAA GGCCAGGACC CGCACGCCTT TCACGTCGTC
GGTGGGTCTT TCCTCAATAC CAATGATTCG ACTCTGGCCC CGCCGCCGGC AACTTCAAAA
TTGCAACCTT TGTTTCGTGG CAGTAATAAT GATACGGGGA AAGATGAGTT GCTGCCCTTT
TTCTCGGATG ACGGTGCTAC ACCGTGGGGA GAAACAGTGG AAGACGCCAA AGGAGCACCG
CCACCCAGAG CTCTCAAGGA CGACGCCGAC GCCTTGCTGG CACTTTTACA GCAAACGAAG
GATCCACCCA AAAGCTTGTT GACAACCGGG AACGATGTCG ATGTGGCATT CCTAACGGAC
GCGGAAGTAA CCGCTCGCAG TAATGAAACC AAAACAACTG ATCGGCGAGT CACTATGCGG
GCGCAGTACG AAGCCGATAT GGAGTTTATT CGCGAATGGG TTGCGAATCT ACCCAAACCA
GGACCTTCCA AACATTTTGG AACGTTCAAG CTTGACGCCG ACGCAATCAT GGCCAACGCG
TTGGCCAGTG TATCAAAATA G
 
Protein sequence
MASQKQPQRK DSDVSEKSDS GGGDPYASAD YQEALEDVHT RFILNLPPSE LETADRLFFQ 
LEQAWWFYED WICDPHPEKV LPRFSSFKPF AQKMFAYSEM LPESHKFGSM WAEFSQYKRG
ISNYGCILLS VDYTKVILCQ VWNGKTFTFP AGKINQGEDG LTAAARETYE ETGFDPNCVF
GQTASWKATD PAKITWKSLQ EQDALIFQED NGKRRTCYVC HGVPEDFPFL PVARKEVAKV
AWYRVDKIPK SSYAVFPFLS QLRRWIAQRT KSSRDKSTGR SNARKKGTPK RSGNNSRGRD
SRGKVRDGDG LVTSGLAAPG EVSRWSEDDM FAVNERLLGR KITYDGNPHL FEQGFQGQDP
HAFHVVGGSF LNTNDSTLAP PPATSKLQPL FRGSNNDTGK DELLPFFSDD GATPWGETVE
DAKGAPPPRA LKDDADALLA LLQQTKDPPK SLLTTGNDVD VAFLTDAEVT ARSNETKTTD
RRVTMRAQYE ADMEFIREWV ANLPKPGPSK HFGTFKLDAD AIMANALASV SK