Gene PHATRDRAFT_47384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47384 
Symbol 
ID7202531 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp482487 
End bp484283 
Gene Length1797 bp 
Protein Length491 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181566 
Protein GI219122468 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCAGGATCAT CCAACGCAAG CAGTTGTAGC AAGCTGTATT CTTCTATACG CACGCACGCA 
CATACATTCC AGCGAAACGA ATTTCTCACT CGGGAGTAGG ATCCACGTCC ACCGCACGTG
TGTGAAAAAG TTCGTTCGCT CGATTGTCGA GCCATCGCCT CACACTGTCT CTTTGCTCTA
CTAATTGTCT TATGAGTTCG GAAAAACCCG AAGAAAAGAA CCCCGAGGAA TCTCCTCCGA
AAGTAGATTT GCCGATAGGC GAAGAGAAGG AGAAATTGGC AGAAAACTCG GATGTACTGG
ATGGGGTTGC GACGGTGGAT CAAAAGGAGG AAGCTGCCGA AACCCGCATC AACTCTAAAA
AGGACAGCGA TGAGCTTACT AAGAGATTGC AAAAGCTTGC TTTGGAACGT CAAAAAGCGA
GCACGGGGGA TCGTTTGAAA GTCATTCAAT CTGACACGAG CTCGCACTTG TCCAGCGTGA
AAACATTCGA CGAACTCAAT TTGCCGAAGC ATTTATTGGA CGCTGTTTAT GCCATGGGTT
TTGATCGACC GTCGGCGATT CAAGAAGAAG CTCTACCGCG AATTTTGGCC GACCCCATGC
GGAATTTGAT TGGACAAGCT CAGGCTGGTA GCGGCAAATC GGCGGCCTTC ACCTTGGGAA
TGCTCTACCG GATCGTGGTT GATTCACCAG CGACGACGCA AGCTCTGTGT GTCACACCAA
CACGGGAATT AGCCATTCAG ATTGTGGATA AAGCGGTCAA ACCGCTGGCG GCTAATATGA
AAGGTCTCAA AATATGTCTA GCCATCGCCA ATACGTTCAT TGACCGCGGA AAGACTGTAG
ATGCACACTT GGTCGTTGGA ACCCCGGGAA AGGTTTCTGA TTTCCTCAAG CGTAAAAATC
TAAATCCCAG AACGATTAAA GTCTTCGTCT TGGACGAAGC GGACCATATG GTGGAAGAAG
GTGGTCATAG AGCGAATTCG CTCGTTATCA GAAAGGTCAT GCCGCCAACC TGTCAGTCGC
TCTTCTTTTC GGCTACTTTT CCTCCCGAGG TCGTTCAGTT TGCGGAGAAG ATGGTTGAGA
AACCCGACAA GATTCTCATC GAAGATGGAC CCGAATTTTT AGTAAGTCAA TTCACAAGCC
ACTTTTCCTT CGTTCGTTCA TGGTGTCGCC GAGGATGCTC ACTCTTTGTT TCAAATCTTT
ACTTAAGGTC GTGGACAATA TTCGACAACT CTGGGTTGAC ACACGAAACT ACGAGGGCGG
AAAAATCGAG TTTTTGGCTG ACATTTATTC TCTCATGAGC ATCGGTCAAA GTATCGTCTT
TGTTGGTACT GTTGTTCAAG CTGACAAAGT GTACAACACG CTGACGAGCT CTGGGTACAC
CTGCTCCGTG CTGCATAGTA AAGTTGGCCC CGAAAACCGG GACACAACTA TGGAAGCCTT
TCGCAACGGC GAAAGCAACG TCTTAATTAC GACGAACGTT TTGGCACGAG GTGTGGACGT
TGATAATGTT GGCCTCGTCA TAAACTATGA TGTGCCAATA GACAAGGATG GCAATCCTGA
TCATGAAACG TACCTCCATC GCATTGGTCG CACCGGGAGA TTCGGACGGA AGGGAACAGC
AATCAATCTG ATTTCGGACG AAAAGTCCAT TGGGATATTG GCTGCCATTG AAAAGTTTTA
TTCGCCCGCC AAAGAAATGA TCAAACAAGT AGAGGCTGAT CCCGAAACAC TAGCGGACCA
CATCCAAATC TAACATTAGC GAGACAGGGA TCAAAAATAG GAAACAATTT GCTCCTT
 
Protein sequence
MSSEKPEEKN PEESPPKVDL PIGEEKEKLA ENSDVLDGVA TVDQKEEAAE TRINSKKDSD 
ELTKRLQKLA LERQKASTGD RLKVIQSDTS SHLSSVKTFD ELNLPKHLLD AVYAMGFDRP
SAIQEEALPR ILADPMRNLI GQAQAGSGKS AAFTLGMLYR IVVDSPATTQ ALCVTPTREL
AIQIVDKAVK PLAANMKGLK ICLAIANTFI DRGKTVDAHL VVGTPGKVSD FLKRKNLNPR
TIKVFVLDEA DHMVEEGGHR ANSLVIRKVM PPTCQSLFFS ATFPPEVVQF AEKMVEKPDK
ILIEDGPEFL VVDNIRQLWV DTRNYEGGKI EFLADIYSLM SIGQSIVFVG TVVQADKVYN
TLTSSGYTCS VLHSKVGPEN RDTTMEAFRN GESNVLITTN VLARGVDVDN VGLVINYDVP
IDKDGNPDHE TYLHRIGRTG RFGRKGTAIN LISDEKSIGI LAAIEKFYSP AKEMIKQVEA
DPETLADHIQ I