Gene PHATRDRAFT_34362 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_34362 
Symbol 
ID7199779 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011673 
Strand
Start bp374349 
End bp377951 
Gene Length3603 bp 
Protein Length837 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178990 
Protein GI219116390 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000328786 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGAAT CGACACGCCG CTCTTCTCGC CGCCGCGAGA AAACTGTGTA CAGCGTCGGA 
GATCTTGTCG AGGTGAGTTG GTGGTGCGAT ACTTTTTCAT AAACCTTAGC CGCGCTCTTA
TTTCACTCCA TTTGAATGCA TTCTTCAGGT GACCCGCGAC GAGTCAATTG CTACGGGTAG
ACTTGCATCG AAACAGACCG ACACTGCTAA ACCCCGTTGG CTTGTAAAGT TTGACGAATC
TTCATGGCCA AGCATGGAGC TATTGGAGAC TGAACTTGGG CCTATTCTTG ATAGAAGCGA
CGACAACGCG TCTCAAAAAG AAAAGCAAGT AAGGCAAAAA TCATCGTACG AGTTAGGAAC
CACTTTGGGT GGGCGCGGCT CCCCAAGTAA GCGATCTTCC TCGCCGATGG TATCAGGCAA
GAGCGAAAAC GGAAAGGCGT CCCCGATATC TACGAACAGT TCAGATTCCA AGAAGAAAGT
AGAGTTCATC GCTCTTCAGG AAGAGTCTGA CATGTCTGGG TCCAAGCAGA GACCCGGTTC
TTTATCCAGG GAGGAGCGAA GCAAACGTCG TCAGGCCATG ATTGAGCAGG ACAAACTGAA
TTGGTCGAAG CCAGTTATGT CGCGACCCCC GAAAAAGAAA AAGGCGCAGC GCGATGAAGA
AGTTGTTCGG GTACCAATGC TTACGGGTAC ACTTTTGCTA TATCGCGGTG CTCATCGCCG
GGCCGAATTT GTGCGTAAGT TTTGAATGAC ACTGAGTTAA AAGCTTACAT ACGCAGATCA
TATGGTAAAT TTGATTTGTA GGCGGATTTA GATCAACGAA ACATATTTTA CATATGATGG
CAATTGAATA GTAGTGTGCT AGAGACGCTA TTCCTTAGAC TTTATATTCG CGAATATAGC
GCATTCGGAT AGGCGCGTTG CAATAGAAGT ATTGGAGACA AAGAAACTGC AGTAACGTTC
TTAACTCCCA TGTCTTGTCA CGAGCGCTGT GTTCGGAGGT TTTCCTCGGA AGAATGTCGA
TGCGAAGGCT GTAGAGGAGG AGTGTGTGAG ACGTTCGTTC ACAGTCATGT CAAGGAAGCA
TGTGAAACGA AAACACCACA ACATTTTTCA GTTTTTTGCA TTCATCATTT TTCTAAAGGA
CGTTTTAAAG CGTTAATTTG AAAAACTGTC GAGACATATA CATAGAAGAA ATCCCAAGGA
AGAGCTAGGT AAAGTAAAGA CTTTTCACTT TTACATTCTT TCGATGTCGT ATCTTGTAGT
TACAGATAGT GATATCAACC CACCTGCGAG GCGAGTCGCT TTCACCAGTT TTGGCGAAAA
TAGCGGCGGA GTGGCAGTGA TTGTGCCGTC TGTTTTAAAT TTCGCCCACA TGAATCTACA
GAAGTAAGCT TGAGACTTGG CATGCATGCC CGTTTCCGGA CCATTCAACA AGTGCTTTAT
TTCGATTTTT TGACTTATGG TACTAAGCAA CAGCAAGGAC GAACACGGAG ACTGGGTGGC
ATTCCGTGAC ACGCCAGCTT CTTTCGAAGC CAGGAGTTTA AAGGAAAAGA GAAGTGTACT
CAATGTTGCT TCCAAAAATA TCTCGCGCAG AAACTCCGGT ACAGGTTCTT GGATGGGAAA
ACCCATAAAT GCTGATGCAT CGTGGGTCCC GGTGGCCTCC CCAGCACCAG GAACGTATTC
GCCTCGTCCG TCCGGTCCAC AAGGTTCTCG CATATTGCTT AGAGAGTTCG TGGGGAATCG
CAAGAAAGAG CGGTCTGATA TTTTGACCCC ATCTCAACCA TTTATGTGTC GAAGCAGCGC
AACACAGTCA TCCAGCTTAC AAGCTTGCAA GAGTGACATC TTGTCAGGAA AGAAACTCAA
GAAAGGTAGA TCGGGCTCGT CCCCAGTGCC TCGTATTGTA ATCCCTCTGA AGCCGGACCC
TTCGATACTT CGTACCCGTA CACGTAGTAC CACAGTGTCA AGCGGGGAAG ACGAAGATCA
GAATCGATGT TCCCAAGCTC AAAATGGTAT TAGGGCTGCT AGTGCACATG AAAGGCGAGG
ACGGTCCAAA AGCCGCAGTC GGCATTCGCG ATCACCGTCG CCTAGAACTA GGTCGAAGTC
AGTTTCTGGG AATAGTCAAT CAAACAACCG ACGTGGGCGA TCCAGTAGCC GCCCACCTGT
AACACGGCCG AACACTGTGC GTGAGTCTCG CGTTAAATCT CGATCAAGCA GCCTGACTAG
GATCACAAAT CACAGGGGAC CGACAGTGGT TCCATCACCT TCAGCAAGGT CAGTAATTAA
CTTTCCCCAT GCATCTCTGT CGACCACGTG TCATCGTAGA GAAAATGGCA GTACCGGACC
AGGTATGCTC CCCTGTTCCA AACAGAGACG GGACCCTAAG ATCGGACGAG ATATCAGCTT
TGGAACGGTG CATTTATCTG ACTCATCATC GGTATCAATC AGAAGTGAAA AGAGTGGACT
GTTCGAGAAA GTTTTTGGCT TCCCAGGCGG ACAAGCTTTG CAGAAACCTC AAGTAAAGCA
CTCCATTTCG ACACGACCCC GTATTCTTCT AGCTGCAACA GTGTACCACA ATACAGCGAC
TGGTCTATGG ATCACAACAA TCAATACAAA TCAACGAGGA GTATCAAAAA ATCCTGCACA
AGCGAATAAA TTTCTAAAAG CATTCTCGTT TCCTACAGAA AAGGAAGCTC GAGAGTCAGC
TATCGCAAAC GCCCCACCGA AAATGGTCTC CTTTCAAGAG TCAGCTAAAT GCTTCCATTG
CAGGAAACTT TTCGCAGTTT TCAAGCGCGC CTGTCATTGT CGAAACTGCG GTGTGTGTAT
TTGTGCCAGC TGCTCAATAT CTTGGCCTGC TAAGATGCTT CCAGAAACAT ACAACCTGAA
AAATGAAGCT TCCTTGAAAG TTTGTACAAG TTGTGATACT CTTAGTTCTC TCTTCAAAAA
AGCGCTCTTG GAAGCGAAAT ATGAAGAAGC GATAGCAATA TATGAGACTG GTAACGTCAA
CCTGCGTACT CCTTTTCCTC CTGCTAACAA AAAGGATGAA GTTCTTTATC CCATCCATGC
TGCTATTGAG GGCGGCAACC TTAAGCTTGC GCGTTGGCTT GTCGAAGACC GCTTCTGCCC
TCTAAAGCAA ATCAGAGCCG GGCGATCGAA GTCAGATAAA AACGCACTTA TTCAGACGTC
GAAAGGGCGA ACTGTCTTGA GTATTGCTAT GGAGTTCCTT CGCATCGGTA TTCTACGATT
TTTGGTTGTT GAAAGAGGAA TTTCCGTATT CGAAGCTACG GATACAAGAA GCGCCCTTCG
GACTATTGAG GCGGCCTTGG TTGCTCTGCC CTGCTCTTCA GAAGGAGATG GAATTCGAGA
AGACGGGGCT TCCATAGCGC GGTGGGACCA AGCCTACTTC GACGATATGT CGGAACCGAG
TAGCCTCGGA GACGATGATA ATGTCACAAT TGTAAGCCGA TCGGTTCGAA CAAGAACGAA
CACGGGCGAC TGTTGCATAA TTTGCATGGA TCACAAAATT AATTGTGTTG CGACTCCCTG
CGGACATCAG GTATGCTGTT TGGGTTGCAG TGCGAGCCTT TCGGCATGCC CAGTTTGCAA
TAA
 
Protein sequence
MTESTRRSSR RREKTVYSVG DLVEVTRDES IATGRLASKQ TDTAKPRWLV KFDESSWPSM 
ELLETELGPI LDRSDDNASQ KEKQVRQKSS YELGTTLGGR GSPSKRSSSP MVSGKSENGK
ASPISTNSSD SKKKVEFIAL QEESDMSGSK QRPGSLSREE RSKRRQAMIE QDKLNWSKPV
MSRPPKKKKA QRDEEVVRVP MLTGTLLLYR GAHRRAEFVL TDSDINPPAR RVAFTSFGEN
SGGVAVIVPS VLNFAHMNLQ NKDEHGDWVA FRDTPASFEA RSLKEKRSVL NVASKNISRR
NSGTGSWMGK PINADASWVP VASPAPGTYS PRPSGPQGSR ILLREFVGNR KKERSDILTP
SQPFMCRSSA TQSSSLQACK SDILSGKKLK KGRSGSSPVP RIVIPLKPDP SILRTRTRST
TGTDSGSITF SKRRDPKIGR DISFGTVHLS DSSSVSIRSE KSGLFEKVFG FPGGQALQKP
QVKHSISTRP RILLAATVYH NTATGLWITT INTNQRGVSK NPAQANKFLK AFSFPTEKEA
RESAIANAPP KMVSFQESAK CFHCRKLFAV FKRACHCRNC GVCICASCSI SWPAKMLPET
YNLKNEASLK VCTSCDTLSS LFKKALLEAK YEEAIAIYET GNVNLRTPFP PANKKDEVLY
PIHAAIEGGN LKLARWLVED RFCPLKQIRA GRSKSDKNAL IQTSKGRTVL SIAMEFLRIG
ILRFLVVERG ISVFEATDTR SALRTIEAAL VALPCSSEGD GIREDGASIA RWDQAYFDDM
SEPSSLGDDD NVTIVSRSVR TRTNTGDCCI ICMDHKINCV ATPCGHQCEP FGMPSLQ