Gene PHATRDRAFT_34367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_34367 
Symbol 
ID7199782 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011673 
Strand
Start bp398061 
End bp401660 
Gene Length3600 bp 
Protein Length837 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178993 
Protein GI219116396 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00490098 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGAAT CGACACGCCG CTCTTCTCGC CGCCGCGAGA AAACTGTGTA CAGCGTCGGA 
GATCTTGTCG AGGTGAGTTG GTGGTGCGAT ACTTTTTCAT AAACCTTAGC CGCGCTCTTA
TTTCACTCCA TTTGAATGCA TTCTTCAGGT GACCCGCGAC GAGTCAATTG CTACGGGTAG
ACTTGCATCG AAACAGACCG ACACTGCTAA ACCCCGTTGG CTTGTAAAGT TTGACGAATC
TTCATGGCCA AGCATGGAGC TATTGGAGAC TGAACTTGGG CCTATTCTTG ATAGAAGCGA
CGACAACGCG TCTCAAAAAG AAAAGCAAGT AAGGCAAAAA TCATCGTACG AGTTAGGAAC
CACTTTGGGT GGGCGCGGCT CCCCAAGTAA GCGATCTTCC TCGCCGATGG TATCAGGCAA
GAGCGAAAAC GGAAAGGCGT CCCCGATATC TACGAACAGT TCAGATTCCA AGAAGAAAGT
AGAGTTCATC GCTCTTCAGG AAGAGTCTGA CATGTCTGGG TCCAAGAAGA GACCCGGTTC
TTTATCCAGG GAGGAGCGAA GCAAACGTCG TCAGGCCATG ATTGAGCAGG ACAAACTGAA
TTGGTCGAAG CCAGTTATGT CGCGACCCCC GAAAAAGAAA AAGGCGCAGC GCGATGAAGA
AGTTGTTCGG GTACCAATGC TTACGGGTAC ACTTTTGCTA TATCGCGGTG CTCATCGCCG
GGCCGAATTT GTGCGTAAGT TTTGAATGAC ACTGAGTTAA AAGCTTACAT ACGCAGATCA
TATGGTAAAT TTGATTTGTA GGCGGATTTA GATCAACGAA ACATATTTTA CATATGATGG
CAATTGAATA GTAGTGTGCT AGAGACGCTA TTCCTTAGAC TTTATATTCG CGAATATAGC
GCATTCGGAT AGGCGCGTTG CAATAGAAGT ATTGGAGACA AAGAAACTGC AGTAACGTTC
TTAACTCCCA TGTCTTGTCA CGAGCGCTGT GTTCGGAGGT TTTCCTCGGA AGAATGTCGA
TGCGAAGGCT GTAGAGGAGG AGTGTGTGAG ACGTTCGTTC ACAGTCATGT CAAGGAAATA
TGTGAAACGA AAACACAACA TTTTTCAGTT TTTTGCATTC ATCATTTTTC TAAAGGACGT
TTTAAAGCGT TAATTTGAAA AACTGTCGAG ACATATACAT AGAAGAAATC CCAAGGAAGA
GCTAGGTAAA GTAAAGACTT TTCACTTTTA CATTCTTTCG ATGTCGTATC TTGTAGTTAC
AGATAGTGAT ATCAACCCAC CTGCGAGGCG AGTCGCTTTC ATCAGTTTTG GCGAAAATAG
CGGCGGAGTG GCAGTGATTG TGCCGTCTGT TTTAAATTTC GCCCACATGA ATCTACAGAA
GTAAGCTTGA GACTTGGCAT GCATGCCCGT TTCCGGACCA TTCAACAAGT GCTTTATTTC
GATTTTTTGA CTTATGGTAC TAAGCAACAG CAAGGACGAA CACGGAGACT GGGTGGCATT
CCGTGACACG CCAGCTTCTT TCGAAGCCAG GAGTGTAAAC GAAAAGAGAA GTGTACTCAA
TGTTGCTTCC AAAAATATCT CGCGCAGAAA CTCCGGTACA GGTTCTTGGA TGGGAAAACC
CATCAATGCT GATGCATCGT GGGTCCCGGT GGCCTCCCCA GCACCAGGAA CGTATTCGCC
TCGTCCGTCC AGTCCACAAG GTTCTCGCAT ATTGTTTAGA GAGTTCGTGG GGAATCGCAA
GAAAGAGCGT TCTGATATTT TGACCCCATC TCAACCATTT ATGTGTCGAA GCAGCGCAAC
ACAGTCATCC AGCTTACAAG CTTGCAAGAG TGACATCTTG TCAGGAAAGA AACTCAAGAA
AGGTAGATCG GGCTCGTCCC CAGTGCCTCG TATTGTAATC CCTCTGAAGC CGGACCCTTC
GATACTTCGT ACCCGTACAC GTAGTACCAC AGTGTCAAGC GGGGAAGACG AAGATCAGAA
TCGATGTTCC CAAGCTCAAA ATGGTATTAG GGCTGCTAGT GCACATGAAA GGCGAGGACG
GTCCAAAAGC CGCAGTCGGC ATTCGCGATC ACCGTCGCCT AGAACTAGGT CGAAGTCAGT
TTCTGGGAAT AGTCAATCAA ACAACCGACG TGGGCGATCC AGTAGCCGCC CACCTGTAAC
ACGGCCGAAC ACTGTGCGTG AGTCTCGCGT TAAATCTCGA TCAAGCAGCC TGACTAGGAT
CACAAATCAC AGGGGACCGA CAGTGGTTCC ATCACCTTCA GCAAGGTCAG TAATTAACTT
TCCCCATGCA TCTCTGTCGA CCACGTGTCA TCGTAGAGAA AATGGCAGTA CCGGACCAGG
TATGCTCCCC TGTTCCAAAC AGAGACGGGA CCCTAAGATC GGACGAGATA TCAGCTTTGG
AACGGTGCAT TTATCTGACT CATCATCGGT ATCAATCAGA AGTGAAAAGA GTGGACTGTT
CGAGAAAGTT TTTGGCTTCC CAGGCGGACA AGCTTTGCAG AAACCTCAAG TAAAGCACTC
CATTTCGACA CGACCCCGTA TTCTTCTAGC TGCAACAGTG TACCACAATA CAGCGACTGG
TCTATGGATC ACAACAATCA ATACAAATCA ACGAGGAGTA TCAAAAAATC CTGCACAAGC
GAATAAATTT CTAAAAGCAT TCTCGTTTCC TACAGAAAAG GAAGCTCGAG AGTCAGCTAT
CGCAAACGCC CCACCGAAAA TGGTCTCCTT TCAAGAGTCA GCTAAATGCT TCCATTGCAG
GAAACTTTTC GCAGTTTTCA AGCGCGCCTG TCATTGTCGA AACTGCGGTG TGTGTATTTG
TGCCAGCTGC TCAATATCTT GGCCTGCTAA GATGCTTCCA GAAACATACA ACCTGAAAAA
TGAAGCTTCC TTGAAAGTTT GTACAAGTTG TGATACTCTT AGTTCTCTCT TCAAAAAAGC
GCTCTTGGAA GCGAAATATG AAGAAGCGAT AGCAATATAT GAGACTGGTA ACGTCAACCT
GCGTACTCCT TTTCCTCCTG CTAACAAAAA GGATGAAGTT CTTTATCCCA TCCATGCTGC
TATTGAGGGC GGCAACCTTA AGCTTGCGCG TTGGCTTGTC GAAGACCGCT TCTGCCCTCT
AAAGCAAATC AGAGCCGGGC GATCGAAGTC AGATAAAAAC GCACTTATTC AGACGTCGAA
AGGGCGAACT GTCTTGAGTA TTGCTATGGA GTTCCTTCGC ATCGGTATTC TACGATTTTT
GGTTGTTGAA AGAGGAATTT CCGTATTCGA AGCTACGGAT ACAAGAAGCG CCCTTCGGAC
TATTGAGGCG GCCTTGGTTG CTCTGCCCTG CTCTTCAGAA GGAGATGGAA TTCGAGAAGA
CGGGGCTTCC ATAGCGCGGT GGGACCAAGC CTACTTCGAC GATATGTCGG AACCGAGTAG
CCTCGGAGAC GATGATAATG TCACAATTGT AAGCCGATCG GTTCGAACAA GAACGAACAC
GGGCGACTGT TGCATAATTT GCATGGATCA CAAAATTAAT TGTGTTGCGA CTCCCTGCGG
ACATCAGGTA TGCTGTTTGG GTTGCAGTGC GAGCCTTTCG GCATGCCCAG TTTGCAATAA
 
Protein sequence
MTESTRRSSR RREKTVYSVG DLVEVTRDES IATGRLASKQ TDTAKPRWLV KFDESSWPSM 
ELLETELGPI LDRSDDNASQ KEKQVRQKSS YELGTTLGGR GSPSKRSSSP MVSGKSENGK
ASPISTNSSD SKKKVEFIAL QEESDMSGSK KRPGSLSREE RSKRRQAMIE QDKLNWSKPV
MSRPPKKKKA QRDEEVVRVP MLTGTLLLYR GAHRRAEFVL TDSDINPPAR RVAFISFGEN
SGGVAVIVPS VLNFAHMNLQ NKDEHGDWVA FRDTPASFEA RSVNEKRSVL NVASKNISRR
NSGTGSWMGK PINADASWVP VASPAPGTYS PRPSSPQGSR ILFREFVGNR KKERSDILTP
SQPFMCRSSA TQSSSLQACK SDILSGKKLK KGRSGSSPVP RIVIPLKPDP SILRTRTRST
TGTDSGSITF SKRRDPKIGR DISFGTVHLS DSSSVSIRSE KSGLFEKVFG FPGGQALQKP
QVKHSISTRP RILLAATVYH NTATGLWITT INTNQRGVSK NPAQANKFLK AFSFPTEKEA
RESAIANAPP KMVSFQESAK CFHCRKLFAV FKRACHCRNC GVCICASCSI SWPAKMLPET
YNLKNEASLK VCTSCDTLSS LFKKALLEAK YEEAIAIYET GNVNLRTPFP PANKKDEVLY
PIHAAIEGGN LKLARWLVED RFCPLKQIRA GRSKSDKNAL IQTSKGRTVL SIAMEFLRIG
ILRFLVVERG ISVFEATDTR SALRTIEAAL VALPCSSEGD GIREDGASIA RWDQAYFDDM
SEPSSLGDDD NVTIVSRSVR TRTNTGDCCI ICMDHKINCV ATPCGHQCEP FGMPSLQ