Gene PHATRDRAFT_47240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47240 
Symbol 
ID7202618 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp585 
End bp2368 
Gene Length1784 bp 
Protein Length544 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181466 
Protein GI219122259 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCAAA TTGCAGTCGG CATCAAAGCG TTTTTCTTGG TCTCTCTTTG CTCCGGAGCG 
ACCGCATCTT CCAAAAACCG TGATCTTCAA GAATTTATTG CCATTGCATC TTACCAGCCA
AGAACAGATG TCTCGAACCA GGTAGGTATC AAGAAGGAAT TATGTCTCCT GAAGCAAGTA
TTTAATGTTG TTGCTCAACT CCTTTTTCAC TTTAAAGAAA AACATAGCTT TGGATCAGTA
TGAATTATCG AGCTTTCTCG GAAACCCAGC GAACGATGAT CTCGAGGCGG CCAAAGCAAT
TTACGAAAGA GGAGCGTTTG TCACTCCGAT CGCGCGCCTT ACTTTGACGA ACGAAAGTGG
TCTTCCCACT ATGATTACTT CAGACGAGAC ACTCGTAACA GGAAAGACTG CAAACGGCAC
AGAAGTAACT GGAATCGCGT ACGAATCGTT CAACCCAGGA GAAATGGAAA TTTCGGTCCA
GTACGCCAGC GATGCGCCAG ATAGCTGCGA AGTCGGTGGG CTCCTAGAAC CGTACATGCA
CGGATGCTTT GCAGCTGATG GTGAGCTGGA CATAGAAGGA GAGCGTGTTG CTTACAGATA
CGACCCCTCG ACCGATAATT ACAACGGGAG GACTTTGCAA CAATTCAGCA CTGGTGCATC
TTTTACATTC CGCGATCCTA ATGCGGGTAC CGAGTACTTT GATGAATTCG AAAAGTTCTT
CGACTACTAT GGGAAAGCCT CCTACGCCGA TATTTTGATT CAAGCCGCTT TCAATAAGAC
AAATACCGGC TTTCGAAATG GAAACTTGGA TTTTTCGACT TATCTTGACG GTGACGGACA
AAATGGTGAG TTCAGCTTTT TAGGCGTCGG CCTCCCCGAA TGTTGCTTTC GTGGCTGACC
ATGACCTTTT GGACACAGCG GCCATCGCCA CGGCGACGGC TTACATGATT TTAGGAATGG
AAATAATTGG CAAGTTGGAG CATGCTGTGG TGCAATGTGA CCTTCCCTGT GAGACTGACG
ACTGCAAGCT TGACCCCGTG CATAGTCTCG ACGAAGCCGT CGCATTTTGG ACCGGAGTGT
TGGAAGACTT TGATCAGGGT TCGGGGCGCA GCAACCTGTT GTACGGATTG GCTGACGAAA
CCTGTCGCCA GTTTCGCACT TGTGGAGTGA CCGGAGATTC AACGGAAGGC ACGTCGCGCG
TGAACATTGA TCTGTTTCAG CTTTTCCGAA CCATGCAGGA GCAGCTTCTC GGTAACCAGT
GCATCGAAGC TCGGGCTAGC AAAGATCGCA TGATTCCCCT GATGTTTCTC CCGTTGATCC
AAGCAACTCT CAGCAACGTC TTCTTGGCCA AGAGCATGTC TTTTGACGAA GTCGTCGACG
GCGAGGGTGC GGTACTAGCG GCGGCGTTGC TTCCACGCCT GGCCTCTTGC AATTTCGAGG
ATGCGCAGCG GTTGTATTTA CAAATGCGAG TCGGTCAGCA CGGAGTGGCA GGTTACTCGG
AAGTTCGTCA GGCTTTGGAG CGCAACTACG AGTGTTTGGG AGTCACCTGC GCCGACGTTG
GGGGTTTGTA CGATCGAGAC AGGGGCGAAT ATGAAGCCGA AGGGGCTCCT TGTGGCGGGG
TCGCGCAGGG AGGCGGCGGA ACGAATCCGG GTCTAGCCGT TGGCTTGTTC CTTGGTGGAA
TGGTGGCCGT GCTGTTGGGC TTTGTGCTAA TCCGCCACCG CCGCCGTAAC AAGTCGCTAG
GGGCCGCTGA GTTTGCAGTG GAGGGGGATC ACGTGATTGC GTAA
 
Protein sequence
MAQIAVGIKA FFLVSLCSGA TASSKNRDLQ EFIAIASYQP RTDVSNQKNI ALDQYELSSF 
LGNPANDDLE AAKAIYERGA FVTPIARLTL TNESGLPTMI TSDETLVTGK TANGTEVTGI
AYESFNPGEM EISVQYASDA PDSCEVGGLL EPYMHGCFAA DGELDIEGER VAYRYDPSTD
NYNGRTLQQF STGASFTFRD PNAGTEYFDE FEKFFDYYGK ASYADILIQA AFNKTNTGFR
NGNLDFSTYL DGDGQNAAIA TATAYMILGM EIIGKLEHAV VQCDLPCETD DCKLDPVHSL
DEAVAFWTGV LEDFDQGSGR SNLLYGLADE TCRQFRTCGV TGDSTEGTSR VNIDLFQLFR
TMQEQLLGNQ CIEARASKDR MIPLMFLPLI QATLSNVFLA KSMSFDEVVD GEGAVLAAAL
LPRLASCNFE DAQRLYLQMR VGQHGVAGYS EVRQALERNY ECLGVTCADV GGLYDRDRGE
YEAEGAPCGG VAQGGGGTNP GLAVGLFLGG MVAVLLGFVL IRHRRRNKSL GAAEFAVEGD
HVIA