Gene PHATRDRAFT_47203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47203 
Symbol 
ID7202190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp805466 
End bp808162 
Gene Length2697 bp 
Protein Length702 aa 
Translation table 
GC content57% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181445 
Protein GI219122213 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGAGAGGCAT CCCTGCCTCC TTCCCTACCT GTACACACAT TCGCACACAC ACACACATTC 
ACGTGAGAGA GACACATTGA CACATCCGTG TGCATCGTGC ACCAAAAGCG AGCCCTCGTG
TTTCTTGCTG GAGTCGCCGT TGTCGTAGAC GTTGTTGTCG TTGTTGCCTC CTCCTGGGTT
TTTCGTTTGT GGGAACAAAA CCAGATCAAG CACGACCCTT TGCTGCGGGT TGCCTTTCAC
TGGTTCCGTA TTGGTGTTGC TGTGCGGGTT GTGTAGATTG TGTGTTGAGT GTACCGATTG
ATCGCAACGA AAGGACTGTA CGAGTCGCCT TCCGTGGCGT GCCTCCTGGT CCAGTCGGGT
ACATTCGGAA GTGCGGGTCA GTGGTGATAT CGAATACGTG CACACGTTTG TCTGATCGAT
TCTGCTCGGC AGCGTCCATT TCCAAGTTCC AAAAGGGTCG ATCTTCCAGA CCGGATTCCT
GTATACCTAC CTACAGATTC ATATCCCACA CCTACTTTCC CTATATCGAT GCTTCACTCC
ATCGCGGAAC AACGCCAAAA GCGGGAGCTT CCCCCGCTCT TCTGGTACAT GGAGTTGGGA
GAATGGGAAA AGGCGTCCGA TCGGGTACGT CGACACCCAC GGGAAGTCAA GACTTGGGCT
ACACTCCGTA CCAAAAACAA CACGGACGAA CCCGCCCCGT TGCCTTCGTC CGCCAGCATT
GCCAGTTACG CCACCAAGGC ATCTTCGTCT TCCGGAACCA AGCGACTCGC TCTCCACCAC
TCGTGCTTCA AACTGAGGAC TGCTGGCTCT TGTCCCGTCG CGTCCAAGGC TTCGGAAGAT
CCCTACGTAC AAGTCTGTCG ATTCATTCTC ATGCTCCTCC GACTCTATCC GGAAGCAGCG
GGACAGCGAG AGTCGCGACA CGGTTGTTTG CCCCTACATC TAGCCTGCTT TGCCTCCTGT
GCGCCTAGGG CCGACGACGA TACCCTCCAC AACAACAGCA ACCACAGTCG CAACGCGACC
TCACCCTCCG CCATTGCCAA GGCACCCGTC GCTCGCCCCA ACATGATTGC GCGGCGCATA
GCGTCCGACG CCACCTCCGC CACCACGGAC ACCAACCTTT CCGCCGTGCA CGCCGAAGAA
ACATACACCG GAAACATGGC CGACAAGCAA ATACGTCGCG ATCACACGGT ACACGTCGAT
CCCAACGTAT CCGTATCCAC CAAGAAACAT CTATTGATCA GTTCCAAACG GGAAGAAATG
GCCGTACAAG TACTCAACGC GCTGCTCGAC GCCTACCCCA AAGCCATACG TACCGATTCC
GAAGGCGGGC GTTTGCCGCT ACATACGGCC TGTGCCGGAC GGGCCACGCC CCGCGTCATT
GCCACCCTCG TCACGGCCTA CCCCTCCGCG GCACGACACC GGAACAAGGA TGGATTCCTA
CCCCTACACC TTACCGCACA CTGGGGCGTC GCCCATCCCA ACGTCGTCGT TACCCTTCTC
AAGGCATACC CCGACGCTAC CGTTGGACGC AATCGCTGGG AACGAACTCC ACTGGAAGAA
GCATTGTGCA TGGCGGGAGA AAACGGTCGA CCACATCAAG CCGCCATGGT CAGGGCGCTA
CGGAAGCACC CCTCCTACTG GCAACGAGCG ACCGCCGAAA TTATTCAGGG CACGCGACGT
CTACGCCAAC CCGGCAGTAA CGTCGTGGAT GTGGACGAAA GCTTGCCCTC CAACGACAGT
ACGTCACTAG AAGAGCAACG CCAAGGTCAT TTCGCTCACG GACACAATCC ACTCGTTGAT
CAAGTCGAAC AGGAACATTC CAAAAAGCCC GCGGGCAAGC TATCCCCAGA AGCGGCCAAA
AAGAAGCCCA TGGATCATAA ACTGGACGAA CTTATTCGAC AGCACGACTG GGACGCGGTC
ATTCGTCGGG TCGAAACGAA TCCCCTCGAG GTGGAGACGG AATTGGCGGT CATGACCCGT
GGCGGATTCC TCAGTTGCTC GGGTGTCACC CCACTCTACT ACGCCTGCGA ACGCCAACCC
CCCGTCGCCG TTGTACAAGC CCTCATCCAT GCCCATCCCC TCGCCGTTCT CACGCGCGCC
ATGCCTGGTG GGAGTCTACC ACTCCACGTA GCCTGTACCT GGCACGCCTC ACCCGACATT
ATCTGGGCCT TGTTGGCCGC CGATCAGGGG GCGGCCAAAG TCACCGACGA ACTCGGCAAC
GTGGCGCTCC ACTCAGCGCT TTTTTCCGGA GCCGATGTCC GGGTGATCCA AGCGCTCGTC
CAAGCCGATC CCGAGGCCGT ACTCTCACGG AATCATCAAG GATCCCGACC CGCCGATATC
GGCAAACGAC TTCGGCACGA AAATCGCAAA ATGGTGCTGC CAGTACTCCA AACAACCAAG
GCACACCTGT TGGCGTCCCA TCGTCGGTCG CGCTCGTCGG GGACATTGGA GGACATTGCT
CAACAAGCGG AAGAATTGAA TCAAAGGCAG GGCACGCCCT TGGGCACTCC GCAAAGTCTT
CATCGACTTG CGAAGGATTT TCCGAAAGAA GGCAATCCCA ACCTTCACAC CGACGAGGAG
CAGGCGATCG AAGTCAGTTA CGGTGCCCAA GAGAAAAAAG AGCTCATGTG GGTGTAATTG
GACTGAAAAT AACAGCTCCT TCTTGGGTAA ACGCTATGCT AACAGCATTT TGGCAAC
 
Protein sequence
MLHSIAEQRQ KRELPPLFWY MELGEWEKAS DRVRRHPREV KTWATLRTKN NTDEPAPLPS 
SASIASYATK ASSSSGTKRL ALHHSCFKLR TAGSCPVASK ASEDPYVQVC RFILMLLRLY
PEAAGQRESR HGCLPLHLAC FASCAPRADD DTLHNNSNHS RNATSPSAIA KAPVARPNMI
ARRIASDATS ATTDTNLSAV HAEETYTGNM ADKQIRRDHT VHVDPNVSVS TKKHLLISSK
REEMAVQVLN ALLDAYPKAI RTDSEGGRLP LHTACAGRAT PRVIATLVTA YPSAARHRNK
DGFLPLHLTA HWGVAHPNVV VTLLKAYPDA TVGRNRWERT PLEEALCMAG ENGRPHQAAM
VRALRKHPSY WQRATAEIIQ GTRRLRQPGS NVVDVDESLP SNDSTSLEEQ RQGHFAHGHN
PLVDQVEQEH SKKPAGKLSP EAAKKKPMDH KLDELIRQHD WDAVIRRVET NPLEVETELA
VMTRGGFLSC SGVTPLYYAC ERQPPVAVVQ ALIHAHPLAV LTRAMPGGSL PLHVACTWHA
SPDIIWALLA ADQGAAKVTD ELGNVALHSA LFSGADVRVI QALVQADPEA VLSRNHQGSR
PADIGKRLRH ENRKMVLPVL QTTKAHLLAS HRRSRSSGTL EDIAQQAEEL NQRQGTPLGT
PQSLHRLAKD FPKEGNPNLH TDEEQAIEVS YGAQEKKELM WV