Gene PHATR_2089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_2089 
Symbol 
ID7204728 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011679 
Strand
Start bp709570 
End bp710970 
Gene Length1401 bp 
Protein Length449 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185770 
Protein GI219121078 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AGCTTGACCA AAACACTGCA AGACTACTTT GGCTACCCCG CTTTTCGCCC GGGACAGTTC 
CCAGTTATAG AGGTGGTCTT ACAAGGCCGT GATGCGGCAG TATTCTGGGC TACCGGCTCG
GGTAAGAGTC TCAACTACCA AATCCCAGCC CTCCATACGG ACACGGTGGC AATCGTAGTC
AGTCCCCTCA TTTCCCTCAT GCAAGATCAA ACACACAAAC TCAATTTTTT GTCGGCGTCC
TCCGCTAGTG GCACGCAGAA ACCCGTGGCG ACATTCTTAG GGTCATCCCA AACCGATCCC
GACGAGGAAG CCAAGGCACT CCGCGGTGAA TACAATTTGA TCTACGTCAC GCCCGAAAAG
CTAGTCACCA GCGAATTTCT GCAGGCTCTA GAGAAATTGC ACAAAGATTA CAAGCCGATT
CGACTCATTG CGATTGACGA ATCTCACTGC GTATCGGAAT GGGGGCACGA CTTTCGTCCC
AGCTTTCGCT CGGTGGGCCC GTCGCTCCGC ACACACGACG TCCTACGTCA GATCCCCCTA
CTGGCCCTGA CGGCCACCGC GGTACCCCGG GTGCAGGAGG ATATCCTTAC TTCTTTACAA
ATGGAAAATC CCTTGGTAGT GCGGCAGTCT TTTGATCGGA CCAATTTAGA AATTATTGTC
AAACCAAAAT CGACGGGCGG GACTGGCAGT ATCCCTAGCG CATTCCAATC GCTCTTGGCA
GAGTGGCAGT CTTTGTCTAC AAGTAGTAGC AGCAGCAACC GAGTAGCTTT GCAAAGCACA
ATTGTATACG CGCCCACTCG TTCCCAAGTT GACAATATTG CGTCCTACTT GCAAACGCAT
GCTCCGTCAA ACGTTCGCAT TGAGGCTTAC CACGCGGGGA TGAACGCAGA AGACAGGACG
ACGGCGCACC GGAACTTTTT GACCGGCGTC ACGACCGTAA TCGTAGCCAC TGTCGCGTTC
GGTATGGGCA TTGGCAAGCC CGATACGCGA CGGGTCATTC ATTTTGGACC ACCCAAAACG
TTAGAGGAGT ACTACCAGCA AATCGGGCGT GCGGGACGTG ACGGTCTGCC CGCGACCTGC
ATTTTGTACG TCGCGAGTAG CGATTTGGAC AGATACCAGT CAGACTTTTA CCTAGGGGGC
TTGCATGGGA AAGCGAAGGA GGCCACATTG GAGAGTATGG AGGCCATGAA ACGATTTTCT
TTGGACGCGG AAACCTGTCG ACGCAAACAA CTGCTACTCT ATTTTAATGA GGAACCTGCA
TTCGGGGACC GCTGCGGTAC TTGTGATGTT TGCAAGAGTG TCGAAAAGTA CGGAGATGAT
GCCCAGCGTG ACTTCGGCGG CGAAGCGAGA ATTGTTTTGC ACGCTGTTGA CGCTTTGAAT
CAACAAAGCA TGTCTCAAAT T
 
Protein sequence
SLTKTLQDYF GYPAFRPGQF PVIEVVLQGR DAAVFWATGS GKSLNYQIPA LHTDTVAIVV 
SPLISLMQDQ THKLNFLSAS SASGTQKPVA TFLGSSQTDP DEEAKALRGE YNLIYVTPEK
LVTSEFLQAL EKLHKDYKPI RLIAIDESHC VSEWGHDFRP SFRSVGPSLR THDVLRQIPL
LALTATAVPR VQEDILTSLQ MENPLVVRQS FDRTNLEIIV KPKSTGGTGS IPSAFQSLLA
DTIVYAPTRS QVDNIASYLQ THAPSNVRIE AYHAGMNAED RTTAHRNFLT GVTTVIVATV
AFGMGIGKPD TRRVIHFGPP KTLEEYYQQI GRAGRDGLPA TCILYVASSD LDRYQSDFYL
GGLHGKAKEA TLESMEAMKR FSLDAETCRR KQLLLYFNEE PAFGDRCGTC DVCKSVEKYG
DDAQRDFGGE ARIVLHAVDA LNQQSMSQI