Gene PHATRDRAFT_49529 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49529 
Symbol 
ID7195743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011690 
Strand
Start bp529727 
End bp532267 
Gene Length2541 bp 
Protein Length846 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184152 
Protein GI219127876 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.135063 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTGTTT CTGGAGACCA CAACAGTAAC AATCTGACTG TGAACAACGC TGCAGCTTCC 
ATACTCGACG ATGAAGTGGG TGTTCCGGAC GACGCGTGTT TCCCCCCATC CGGGACACTC
ACAGCATCGT TGTGGTCCAA TAGTAACAAC GACCAACAAC ACGAGACTAC CGATCCAGTC
GCCGTCATTC GCTTGGACGC TTCGACTTGT TTGCGCCAGC CCGCGCAGAA CGGCCGTAGT
CTCGTGGTGG GTCGACACGT TTCTGTCCCA ACCTCTGTCG ATACCAACAG CAACAGCAAA
ACAACGACCA CTAGACGTAA CACTCCCGCC AATTCGCTCG CCGACATTCG CATTCAACAC
AAGAGCATGA GTCGCAAACA CGCCGTCTTT TACTACCAGC GATCGATCGC GTCGTCCACC
GAGGTCTGCT GGACCTTGCG TCTGAAAGAT CTGGGATCCA AGTACGGGAC GACCGTCAAC
GGACAACGCA TCGCTACCGG GAACGACGAT GATAACCATA ACAACACAAT TGTACTGCAA
CACGGAGATA CCATTGTGTT TGGCAACGTA CGAGAAACGG TCTTTCGCGT GCACTGGAAG
CACGACCCGA CAATTGCCCC AACGTCCACG TCTACCGCCA CCACGGCTAC CACGACGATT
CCCGTCAGTG TGGACTCACA GTCAGTCGAG CTCGCCACGA ACGATCCGCA CCGTCAACCA
TTACCCGATA CAACACCCGA CATTCTCGAA CGAGCAGGAC AGGGATTGAC GGGTCGAGCC
AAGAGACAAG CGGAAATTGC CGCCATGATG GCGACTTTAG ACCAAACGCC GCATTATCAA
CAACAACAAC AACAACAACC ACAGTTGGAG GAAACGTTAC CGGATAGACA AACCGCTATC
CATCACCACA ACGACAACAC TACTGACGAT CACTTGTCGA TCACTGCCAA AAATAAAAAT
AACAACAACA ACAGTTGGAA GCCGCCGTTG CCACTGCCGG TAAGCCAGCG ACTCGTGCTC
GCTTCGGAAA GCGAACGTCA TAATGAAACC ACCTGTCTAG CAATGGACCC GTCCGGAGCA
CGCTTCGTTG TTGGAGGGCG GGACACAATG CTCCGGTTTT ACGATTTCGG TGGCATGGAC
CAATCCAAAA CTGGTGCCTT TCAATCAGTT CTCGTCGACG ACGGGCACTG GTTGGTCGAT
GTCTGTTACA GTAACACCGG AGACCGAATA TTAGTTGGTA CCGGCAGTGT CCAACCCAAA
GTACTCACCA GGGACGGTGA AGAAATCTTG CAGTTTGTCC GTGGGGATAT GTACGTGACG
GATCAAGCAA AGACCAAAGG GCACACAGCT GCCGTTACCG GGGTTTCGTG GCATCCTTTG
GAACGGGACC TGGTCTTGAC CTCCTCGCTT GACGGTAGCG CCCGCTTGTG GAACCTCAAC
GGCAAAACGT CATTTTCCAT GCTAGTATGC GACAAGGTCT TTTTGGTCAA GAATGAACGG
GGTCAGCGCA CTGCCGTGAC GGTGGTATGC TTTCATCCAG GGGGTCGGGA ATTTGCCGTC
GGCACCGCGC GTGGAAGCGT ACAAATTTGG AATCGCCACC GAGTCGCGGC CCGTCCCGAA
CGAGCGGTCT ATAACGCCCA CGGCAAGGGC AACCCGGTCT ACGCACTATG CTACAATGCT
GACGGTTCGC AACTGCTAAG TCGGAGTCCG AGCGATGATA CGGCGAAAGC TTGGAACGCT
CAGCGCCTTT CACGATCAGC CCAACCAACC GCTATCTTCC GGGGATTATC CACCATTCAC
GACCGTGCCA ACGCTGTCTG GAGTCCAGAC GGTAATTTTG TGTGTGCCGG TTCGGCGGAG
GTACAAAAGA TTGATGGGAA GCGTCGCGAA GTCGGGGCGT TACACTGGTA CCAATTAGGT
CTGATCAAAA ACAAAGATAG CCGCCAGAGT ATGGGTGCCG GTACCACATC GTCACTACCT
CATACCCTTG ATCCCGTTCA TTCGATTGAT ATGGTCGACA ATGCGGCTCC CGTAGTTGTG
AACTGGCACG CTCGCTTAAA TCAGATTGTC GTCGGGTGTT CCGACGGAAG TGTCAGTGTC
TTTTACGATC CGACGCTCAG TTCAAAAGGG GCGCTCATGC CAGCTTCCAA GGTCAGTCGA
GGAGTGGATA GTCTTTCCGA ACTGCTCAAG TCCAAGGCAC CAACCGGATC CGCTGCTTTT
GTGGGCGAAA TCGTTACACC ATTCGCTTCG TCGGATGGTA CCAACAAGAA GAAACGCAAG
TTTGACGAAC CAGTGCATAC CATGGAACCC GAACGGCCTA CATCGGGCAA GCACAAAACA
GGCAGCCAGG CCGGCGGAGT TACCAATTTT CAACAATTTG TGGCGGACCA GACGCAAGTC
AAGGGTAAGG TGATTGCCGG TAGGGATCCT CGAGAAGCTC TACTACAGTA CCAAAAAGGA
AAAAGCTATC TCGGTAAAGA AACTAAAATC TTGGCTGAGA AAACTGCCGA GGAAGAGGAG
GAGGAGTCAA AGGCAACTTA A
 
Protein sequence
MGVSGDHNSN NLTVNNAAAS ILDDEVGVPD DACFPPSGTL TASLWSNSNN DQQHETTDPV 
AVIRLDASTC LRQPAQNGRS LVVGRHVSVP TSVDTNSNSK TTTTRRNTPA NSLADIRIQH
KSMSRKHAVF YYQRSIASST EVCWTLRLKD LGSKYGTTVN GQRIATGNDD DNHNNTIVLQ
HGDTIVFGNV RETVFRVHWK HDPTIAPTST STATTATTTI PVSVDSQSVE LATNDPHRQP
LPDTTPDILE RAGQGLTGRA KRQAEIAAMM ATLDQTPHYQ QQQQQQPQLE ETLPDRQTAI
HHHNDNTTDD HLSITAKNKN NNNNSWKPPL PLPVSQRLVL ASESERHNET TCLAMDPSGA
RFVVGGRDTM LRFYDFGGMD QSKTGAFQSV LVDDGHWLVD VCYSNTGDRI LVGTGSVQPK
VLTRDGEEIL QFVRGDMYVT DQAKTKGHTA AVTGVSWHPL ERDLVLTSSL DGSARLWNLN
GKTSFSMLVC DKVFLVKNER GQRTAVTVVC FHPGGREFAV GTARGSVQIW NRHRVAARPE
RAVYNAHGKG NPVYALCYNA DGSQLLSRSP SDDTAKAWNA QRLSRSAQPT AIFRGLSTIH
DRANAVWSPD GNFVCAGSAE VQKIDGKRRE VGALHWYQLG LIKNKDSRQS MGAGTTSSLP
HTLDPVHSID MVDNAAPVVV NWHARLNQIV VGCSDGSVSV FYDPTLSSKG ALMPASKVSR
GVDSLSELLK SKAPTGSAAF VGEIVTPFAS SDGTNKKKRK FDEPVHTMEP ERPTSGKHKT
GSQAGGVTNF QQFVADQTQV KGKVIAGRDP REALLQYQKG KSYLGKETKI LAEKTAEEEE
EESKAT