Gene PHATRDRAFT_55114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_55114 
Symbol 
ID7198589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011693 
Strand
Start bp146920 
End bp149505 
Gene Length2586 bp 
Protein Length617 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184743 
Protein GI219129117 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.950731 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCACG CCAGCAGCAG TAGCAGCATT GCTCCTGTCG AATCCCGCGC GGCTCGTTGG 
CGACAGCAGT TGAGAATCCG GCCCCAGCCG CATACCAACG TAGTTAGGTC ATCATTCGCA
CAACTCGTTG CCCAATTCCC CTACCCCTCC TACGACGACA ACGACGGCGA CCCCACCGCC
AACAAAGCTT CGCCCAACGC GACCGCATCG AAGGTTTCAG AGTTGGCACA CGGTGGCGTA
GTACCAGATC TGGATCCGCT CACGGCGTTG GTCCGGGAAA CATCGGAACA ACAACAACGT
CTGGAGTCGT TGGAATTAAA GTATCGCAAG GAAAAGGCGT TGCGCAATCG AGCCGGAAAA
GGTGGGGCTA GCGATCAAGG ACGGACTTTA GTCGAATCGG AAGACTATGA TGAAAATGCT
GTCACCTTGC AAATGATTGA CAAGGACCTG GCGAGATTGC CGCCCCCAAA AGGCTCCGGA
CAAAATGGAT CCCAAAATCT TGCTGGTGTT GTTGTTTCGA AGGACGAGGA TACGGCAGGC
ATACCCACCA GTAGCGGTAC TAGCGATGAG CGCATAAAAA CGTTGCGCCG TGTCTTGTAC
ATTTACGCCT GTGCTCATGC CGAGGCAATT GGCTATCGAC AAGGCATGCA CGAAATTGCT
TCCTACATTT TGTTCGCTTT GGAGTTGGAC CAGCAAGCAG AGGAGAGTCT CGTTGCCGTC
GCCACCAGCC AAGAGCAAAT TGCTTCCGAT GCGTACGAAC TACTCGAAAC TATTTTAACA
TCGATTGAGT GCGTCTACGA CGCAACGCCT CTACCGGGTC AACACGAAAA ACCACTCGAA
GCCAGCGCCC GACGTGTACT GCAAGGCGTG CAAACGTACG ATGCTGCCCT GGCGTTACGT
CTGTCTCAAT TAGGCGTACC TCCTCAGTTG TATCTGACCA AATGGATGCG ATTGATGTAC
AGTCGCGAAG TCACGGATGT TTTGTCTCTG TGGGATGAAC TTTTTGCTTA CGTAGGCGAA
GGCAGCACGC TGGTGACCGT TTTGGAAGCC GTGGCTGTGG GTCGTTTGTT GTCATGGCGT
GATCGTATTT GCACCGATCC AGATGCGTTA CACTTTCTCA TGAATTTGCC CATCGAGACA
AACGTGCAAC GGTGGCTGGA TTTATCTCGA AAGGTTATTC ATAAACAAGG CATACCGTTG
CCACCCATCA AGGCCACGAC ACCGGTCGCG CCAGCTACAT CAATACCCGC GTACGCCGTG
CCGCACTCCG CGCCAACCGG GAGCGTCAAT AGTAATTGGA GCCAGCCCAA TGCTTCTCTG
ATGTCGACCC CCCAACGAAC GTTTCCGTCA GAGGCTGGCA ACGAATCGGG AGTCTTTTCC
GTGGGTCGCT TTTCTTTATC AGCGGTGAAG GAAAAGTTTG AACAGGCGAA ACACACGACG
CAGTCGTTGA GCAAACGCTT GTACGACGAA TGGGAGCAGC AACAACATCA CCGCGCCACG
GATGCTTTCG AACGCCCTTA CTCCGACGCC TTTCCGGACA GTGAGCACGA CACTCCAACA
GCGATCAATG ACCCACTGAC TCAAGTTCCC TACCGCAACG ACGACACGGC GGTTCCCGCC
AATGTGTACA ACGGTCAGTC GACGCCGCAA CGGCAATCAC AAGCTTCCCC ACCTACTTTG
GAATCGCAGT GGGCTAGCCG TGTGCAATCC GACGTACACG TGTTGCAGAA CTATTGCATG
ACCATGGAAC GGAGTCAAGC GCACGTCCCG GGCACCGTGT GGGAAGCCTT GGCCGACTTG
GAAATGCTGC GACAGGATCT GTCACGTCGA GCAGCCGGGA CCAGGCGAAC GTGAGCGCGA
GCGAAAACCT CACGGTCTCC TGTATGGGTT GGTCGGCGGA GCCGGGCCGG CGATAAGGCC
GCCAGTGACG CAGTCATTGC TATTTTGGGA TGGTCTACAA CAGAGGGCGG ACTGGAAAAA
GTGCGTGCCT TGAAGAGGGG GTCGTACCGG AGAGCCATGG TGTGGTGCTT GGAGGGTGTG
TTTCGTTGGC AGCGACTATA CCGGTGCCGA GCCGGAAAGG GAATGGTAAG AGTGATTGTG
GTAGTAGTCG AGAAAACGTA ATGATTGCGA TGGAGAGCTG TTTGCCACCT TCTTTGTGAA
CGCCAAGGGA TTGCTGACAG TGAGATGCTG ATTGTACCGT ACAGGCGCTT CCCGAACTAA
CTCCAGGGCG AAGCCTACTA CGTATGTTCC ACTGGAAACG AGACCAAAAT TATGGAACGA
TGTTCTGGTC GAGCATTGAT CAATGAGTTC CCTTGTTGTG ATTAGGGGAA GTTTGGAGGA
GAGGAAACAA AATACGATTA TTGTGGACGC GCCGTCACCC TCGAGAATTG GACGGATGAT
CCGTTCGTGG ATCACGAAGT GTGCGTCTCG GTCCAACAAT CGATGTGCCA CAGATACAAA
TGTACCAACG AACAACGTTC CCGTGGGGAA GAAGAAATCG ATGGAGCGCC CTGTTGCCTA
CGGTTTCCGT ACCAGCTACT GTAGTATGAA TGCAATTCCG TAAAGTAGAA TGCTGCTCAC
AAGTCC
 
Protein sequence
MMHASSSSSI APVESRAARW RQQLRIRPQP HTNVVRSSFA QLVAQFPYPS YDDNDGDPTA 
NKASPNATAS KVSELAHGGV VPDLDPLTAL VRETSEQQQR LESLELKYRK EKALRNRAGK
GGASDQGRTL VESEDYDENA VTLQMIDKDL ARLPPPKGSG QNGSQNLAGV VVSKDEDTAG
IPTSSGTSDE RIKTLRRVLY IYACAHAEAI GYRQGMHEIA SYILFALELD QQAEESLVAV
ATSQEQIASD AYELLETILT SIECVYDATP LPGQHEKPLE ASARRVLQGV QTYDAALALR
LSQLGVPPQL YLTKWMRLMY SREVTDVLSL WDELFAYVGE GSTLVTVLEA VAVGRLLSWR
DRICTDPDAL HFLMNLPIET NVQRWLDLSR KVIHKQGIPL PPIKATTPVA PATSIPAYAV
PHSAPTGSVN SNWSQPNASL MSTPQRTFPS EAGNESGVFS VGRFSLSAVK EKFEQAKHTT
QSLSKRLYDE WEQQQHHRAT DAFERPYSDA FPDSEHDTPT AINDPLTQVP YRNDDTAVPA
NVYNGQSTPQ RQSQASPPTL ESQWASRVQS DVHVLQNYCM TMERSQAHVP GTVWEALADL
EMLRQDLSRR AAGTRRT