Gene PHATRDRAFT_54602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_54602 
Symbol 
ID7201709 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp759320 
End bp761809 
Gene Length2490 bp 
Protein Length740 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181068 
Protein GI219120669 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTTAAAGAAA ATGTAAAGCG AACAGTTTGG GGATGGGGCA ATTTGTGCCT TCTCTTCGAG 
AGGCCCTTAA CTGATCTAAA GTAGCTCCCG CAACTGATGT AACGTCCAGC TCCGAAACAA
AGCCGAAAAA CAGGATTGGA AATCCCCACG GTCGATGGAG CGCGATATTT AGTCGAACGT
TGGATTCTTG TCATAAAATG TTGCGCTAGT CCAAGAGCTT TAAGATGCTT ACCACCCTTC
ACATCGGCGT GCCAAATTCG ATCCCGTCGA TTGCTAGCGA AATGAAGGAG CTCAAGCTGA
CGAGAGCGAT CGCACTTCCT CAGCATCTGG AAAGCGTCAC CGATAAATGC AATATTGTGT
CTGGTCATAC GGGGAGTCCG TTACTAGCTC AAGAAAACCA TTTCGTCAAC AGAAGACAAC
ACTACGATTC AATCGCAAGC TTAAACAACG CAGGTATTCA ATTTGCCGAG TCCTCTAGGA
TGAGTGCCGC ATTACAGTGC TTCCGCCAAG CTCTTTGCGA TGCGGAAGCG ATAGCTACGG
TGCACGTTTT GCCTGCTGCG GATACTCGAC CGACCAGCAA CGCCTACCAA AAGTGGAGTG
GTGCAAAGGG GCCGACATTG GGTATCAATC GAGAGACTGT GTCCGGTTTC CGAAGACGTG
AGTATGACGA AGGAATGCGA GTCTTTGCCG CATATCTTCG CCTTCCCATG CCAAAAAGAG
ACACGGCCGA AACGCAGAGC TCAGAGATTG CGACTGTACT GTACAATATG GGTCAACTTC
AGGTAGACCG ACTCGACTAC GAAACTGGCC ATGGACTGTT TGTCGACGCC CTAGTCATGA
CAGAGTACAT GTCGAGAGAC CAAGCGCGCC AAAGTGTTGT GCCGATCCTG CACGGCATTG
GATACACGCG TTATAGAAAC GGAGATTTTG AAAAGGCCAT TGAGACCTTT CAAGAAGCGC
TGGTATTTAG CTGGGAGGAT GGAGACAGGC GACACCTGGC AGCGACATTG AACTGCCTTG
GGGTTCTGTA CTTCCACCTC CCCGAAGCCA AGCCCAAACG TTCTATGGAA TTTTTGACAA
GGGCCCTCAC GCTACAGAGA AACATACCCG GAGAACTGGT CGCGATTGCT ACTACTCTTA
ACAATCTCGG AAGAGTTTAC TACATGGAGA AACGTTACAA GGAGGCTCTC TCTGTCTATT
CAGAAGCGCT GAACATTCGA AGGAATTCAC TGGGAGCAGA AAATTTGGAT GTCGCCGCAA
CTGTGTATAA CACTGGACAA ACGTTTCAGC AACTGAAGGA AATGGATACT GCAATCTACT
ACTACAAAGA CTTCCTACGG ATTGCCATTC CCAAATTAGG TCGAGAGCAT CGAGACGTCT
GCACTATCTT GAAATGCATG GCGCAGATAT TCCACAAGAA GCGAGATTTT CCAAGAGCAC
TAAGCCTGTA CCACGAAGTA CTCTCAGGAT ATCGCTCTTC GATGGGGGAA CATGCTGAAG
TCGCATCAAT AATGAACAAG ATCGGAAATC TACATTATGA AGCCGGAGAT TTTGACTCTG
CCATCGACAT GTACCTACAA GGATTGTATA TGGAACGTGA AGTGCTGGCG GATGCACATC
CAAACATTGC CGTAACGCTA TCAAACATTG GACAAATTTT TAAGCAGCGT GGAGAGTACG
ATTCGGCTTT GAGGCTCTAC GAAGAAGCGT TTTCGCTCCA AGTCCGAGCC TTCGGAAAAT
GCGACCCAAA TGTTGCCCTG ACTTTGTCAA ATATTGGGCT TATTTACTAC CAAAGTGGAA
ATTTTGCTGT TGCATTGGAG ATGTATCAAG AGGCTTTGGC GATTCGTCGC AAGCTGTATA
CTGAAAGCAA TCTTGACGTG GCGTCCTCCC TTAATTCCAT AGGGCTTGTT TTCTTCAAGC
TTGCTCAATT TACCAAGGCT CTGACTAGCT TTGGTCAAAG TTTGAACATT CGGCGAAATG
TTTTGGGTGA TTCTCACCAA GATGTTGCCA TTATCCTTTA CAACGTCGCA ACAGTGTATA
TGGAACTAGG GCAAGAGGAT GAAGCGGTAG AATTCTATCG GGAAACGATA CGAGTGGAAA
AAACTGCTCT CGGTCCAACC CATCCTGATG TCTGCCTAAC GCTTCGGTAT GTCGGTCAGA
TCTATCAGCA GCGAGGAGAT CTCCAGGATG CATTGAGCTG CTTTCGTGAA ATCCTGCAGA
TACAACGGGA CAACTTCTTT GAAGAAGATT TGTGCATCGC CAGGACGCTC AATAGTATTG
CCAACCTCGA ATTACAAAGA GGGAATACCG ACGCCGTGGT TGAAACAATG TCCGACGCTG
CACGTATTTC CAAGCGAGCT GGGGGCAGTG AGTTTGATTT TCGTTTATGC GGGTTCCATT
TGTACGGGTT CGCTAAACTG CACCCTCAAG GAGCCGCAGC GGCATGATAA CTGTTGGCAA
TAATGAATAA AGAGCGTTTG AGTCGTCTGA
 
Protein sequence
MLTTLHIGVP NSIPSIASEM KELKLTRAIA LPQHLESVTD KCNIVSGHTG SPLLAQENHF 
VNRRQHYDSI ASLNNAGIQF AESSRMSAAL QCFRQALCDA EAIATVHVLP AADTRPTSNA
YQKWSGAKGP TLGINRETVS GFRRREYDEG MRVFAAYLRL PMPKRDTAET QSSEIATVLY
NMGQLQVDRL DYETGHGLFV DALVMTEYMS RDQARQSVVP ILHGIGYTRY RNGDFEKAIE
TFQEALVFSW EDGDRRHLAA TLNCLGVLYF HLPEAKPKRS MEFLTRALTL QRNIPGELVA
IATTLNNLGR VYYMEKRYKE ALSVYSEALN IRRNSLGAEN LDVAATVYNT GQTFQQLKEM
DTAIYYYKDF LRIAIPKLGR EHRDVCTILK CMAQIFHKKR DFPRALSLYH EVLSGYRSSM
GEHAEVASIM NKIGNLHYEA GDFDSAIDMY LQGLYMEREV LADAHPNIAV TLSNIGQIFK
QRGEYDSALR LYEEAFSLQV RAFGKCDPNV ALTLSNIGLI YYQSGNFAVA LEMYQEALAI
RRKLYTESNL DVASSLNSIG LVFFKLAQFT KALTSFGQSL NIRRNVLGDS HQDVAIILYN
VATVYMELGQ EDEAVEFYRE TIRVEKTALG PTHPDVCLTL RYVGQIYQQR GDLQDALSCF
REILQIQRDN FFEEDLCIAR TLNSIANLEL QRGNTDAVVE TMSDAARISK RAGGSEFDFR
LCGFHLYGFA KLHPQGAAAA