Gene PHATRDRAFT_49571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49571 
Symbol 
ID7198190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011691 
Strand
Start bp76296 
End bp79396 
Gene Length3101 bp 
Protein Length1000 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184300 
Protein GI219128187 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.822968 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGAAGTTTGT TTTCGTTCTC TTGCTGTTTT CCGTTACACT TGCTTTCCTA CGCCAGTCTT 
CGTGATCTGT CGTTCCATTC TGAGTACAAG CCCATAAGAT GCGCGGTATC TTCTTCGCAG
CATCCACGCT GGTTCTCGGC ATCGCTTCCT TTCGGGCGAC CGATGCCCAG GACGTCAGCT
GCCAGTTGGT TCGTCAACCC AATGGCAGCG TTGAATACGT ATGTGACCAA GTAAGGGAGC
CAATGCGGCC TCCCCTCGTC CCAGACGGCC AAACCTATCA ACAAGGCCCG GTAGAAAAGT
ACTACCTTCA ACAACCGGCG GAAGATGTGA ATGCATCGGC AAACGCTACC ATTCCAATCT
TACGTCGTGT AGGCGTCCCG CCATTTGATA CTCCTCTACA GGCTTGCCAA GGAAGCTGTG
GAAGTGACGC AGATTGCGCG ACTGGACTCG TGTGTCTTGA TCAGTTGGAT CGGCCGCGAG
ATGGAAGCGT CCAAGGATGT TCCGGTAACG GTCGCCTGAC TATGAGCGTG TGTGTGGTAC
CGGGTTCGGA AGTTGATCTC AAGATTGGAT CGTTGTCGTT AGAAAAGTGC CAGGGGACCT
GTTCTTCAGA CGATGATTGT GCTGGTGGCC TCGCTTGTTT CCGGCGCCAA GGAACGGAAA
CCGTACCAGG TTGTCTCGAA CGTGATGTGA GCGAGTCTAA CTTCTGCCAT GATCCCAACG
AAACCTTGGC AAGATTAGCT TTCGTGGGTG CGGCCCCTCA CTCACAGTCA TCGCTACTTC
CTACATGCTC TGGATCCTGT TCATCAGATT GGGACTGCGT TCGCGGCAGT AATTGCTTTC
GTCGAGATGG CACGGAATCC GTTCCTGGAT GTGAAGGAAG AGGACTCAGT GGAGCCAATT
ATTGTTACAT TGCTCCGGAG GGGTCTTTGC TTCTGGTCGG CGATGGTGTA ACATACATAC
AATATCCATT GAGACAATGC GAAGGACATT GTATTGCCGA TATCGATTGC GAAAGTGGTT
TGAAATGCTT TCGACGCGAA GGTTCGGAAC ATATTCCTGG TTGTGACGGA GAGGGCGCGG
ATCGAACCAA CTACTGTTAC CAACCATTTC CGGATACTTC CGGTCCAACG GATGGCCCTA
CACTGGCACC GCTTTCAGTC TCCCCATCCA ATTCTCCTTC TTTAATCCTA ACGCTTGATC
CCACCAATAG CTTGGGATCT CGAGGTGGCG AATCCGAGTC CCCTTCGACA CCGACATCAG
ATATGCCGTC GATATCACCA TCAGATGAAC CTTCAATGAT ACCTTCTGAT ACGCCGTCAA
TGGTACCGTC TGACGTGCCT TCGATGCTGC CGTCCGACGC ACCCTCGATG ATTCCATCTG
ATAGTCCCTC AGACACGCCA TCTGATGTTC CTTCGGACGT GCCTTCGGTG TTGCCCTCTT
TTTCACCTTC TATGAGGCCT TCTGATACTC CTTCGACAAT GCCATCGGAT ACTCCTACCG
ACGTCCCCTC AGATGCCCCC TCTGATTCAC CGTCAGATGT TCCCTCCGAC GTTCCATCAG
ATGCACCTTC TGATGTTCCC TCCGACGTCC CTTCAGATGC ACCTTCTGAC GTCCCTTCAG
ATACCCCCTC TGATGTTCCC TCTGACGTCC CTTCAGATAC CCCTTCTGAT GTCCCCTCTG
ACGTCCCTTC AGATACACCT TCTGCCGTAC CCTCTGATGT CCCTTCAGGT AACCCTTCTG
ACGTCCCTTC AGAAACCCCC TCTGACGTCC CTTCAGATAC CCCTTCTGAT GTTCCCTCTG
ACGTTCCCTC TGACGTTCCC TCTGATGTCC CCTCCGACGT CCCTTCAGAT GTGCCCTCTG
ACGTCCCTTC AGATACCCCT TCTGATGTTC CCTCTGATGT CCCTTCTGAT ACTCCCTCCG
ATCTCCCATC CGACGTGCCG TCTGATGTAC CGTCCACGAT GCCGTCTTCT ACGTTAGGTG
GTACCTCATC TAATGGGCCA GTAACGGGAA AGGGCAGTCT AGCACCAACC GGTAGCTTTT
CGGGTGAGTC GAGTGGAAGG CCTTCTAAGA CTCCTGGTGT GATGGTTGAA TTGAGCGCAA
GGCCAAGCGA GTCTTTTACC CCATCTCCGA CTACTTTCCC AGTACTCCTT CCTTCAAAAG
TACCGTCAAT AGCTCCATCT CGCACTGAGA CCGTTGCTCC GAGCACAGCC TTCCCCACTG
AGACTTCGTC ATCACTGCCC ACTTTGACTA CGACTGTGTC TTCAGCTCCG ACACAAATGT
GCAGTATGTC CGCATCAGAG CGCGCAAGCA CCATCATGGG AATTTTGGAA GAGGACTTTA
ACGATGTGAG CAGTCCTCAG TTCCGGGCGG TGGAGTGGTT GGTGCAAATA GATCCTCTTT
CGCTTTGCCC AGGAGATGAG AACTTGGAGC AACGATACAT TTTGGCTGTG CTGTATTTCC
AAACCGGTGG AGAATGGTGG ACACGATGTT CACCTATGAG TGCTGAAGTT TGCGACCAGG
GTGAAGCTTT TTTGAGTGGC GCAAATGAAT GTGCCTGGGG TGGAGTCAAT TGCGATTCTT
CGAGTCGTGT GACGGCTCTT CACTTGGATT CAAACAACTT ATCGGGTAGT TTGCCCAGCG
AGTTGGGTCG TTTGGCATAT TTGGTCGAAC TGGACATGGA CGACAACGAG CTGACGGGTT
CCATACCGCG GATTCTGGGA CAGCTTTCTT TTTTGGAAAT TGTTGACCTG GACGACAACC
AATTGACGGG AAGCATCCCG GAAGAATTGT ACAGTGTCAG CTCGCTGGAG ATTTTGGATC
TGGATATTAA CCAATTGACA GGAACTATTT CTACCCTCAT TGGCAATCTG GTAAATCTGT
ACTATTTGCA GATTGACTCG AACAAATTTA CGGGCAGCAT ACCGTCCGAG GTGGGCACTC
TGACTCGTCT CGAATACTTC TCCATGACCG ATATTCAAAT AGCCGAGGCT CTACCCGACT
CGTTATGCAG CCGCGATACC TTGCTTTTGT TCGGAGATTG TGAGGTTTGT GTGGTAGAAG
ACTGCTGCAC CGCTTGTCTC GCCAAGGAGA CAACTCCGTA A
 
Protein sequence
MRGIFFAAST LVLGIASFRA TDAQDVSCQL VRQPNGSVEY VCDQVREPMR PPLVPDGQTY 
QQGPVEKYYL QQPAEDVNAS ANATIPILRR VGVPPFDTPL QACQGSCGSD ADCATGLVCL
DQLDRPRDGS VQGCSGNGRL TMSVCVVPGS EVDLKIGSLS LEKCQGTCSS DDDCAGGLAC
FRRQGTETVP GCLERDVSES NFCHDPNETL ARLAFVGAAP HSQSSLLPTC SGSCSSDWDC
VRGSNCFRRD GTESVPGCEG RGLSGANYCY IAPEGSLLLV GDGVTYIQYP LRQCEGHCIA
DIDCESGLKC FRREGSEHIP GCDGEGADRT NYCYQPFPDT SGPTDGPTLA PLSVSPSNSP
SLILTLDPTN SLGSRGGESE SPSTPTSDMP SISPSDEPSM IPSDTPSMVP SDVPSMLPSD
APSMIPSDSP SDTPSDVPSD VPSVLPSFSP SMRPSDTPST MPSDTPTDVP SDAPSDSPSD
VPSDVPSDAP SDVPSDVPSD APSDVPSDTP SDVPSDVPSD TPSDVPSDVP SDTPSAVPSD
VPSGNPSDVP SETPSDVPSD TPSDVPSDVP SDVPSDVPSD VPSDVPSDVP SDTPSDVPSD
VPSDTPSDLP SDVPSDVPST MPSSTLGGTS SNGPVTGKGS LAPTGSFSGE SSGRPSKTPG
VMVELSARPS ESFTPSPTTF PVLLPSKVPS IAPSRTETVA PSTAFPTETS SSLPTLTTTV
SSAPTQMCSM SASERASTIM GILEEDFNDV SSPQFRAVEW LVQIDPLSLC PGDENLEQRY
ILAVLYFQTG GEWWTRCSPM SAEVCDQGEA FLSGANECAW GGVNCDSSSR VTALHLDSNN
LSGSLPSELG RLAYLVELDM DDNELTGSIP RILGQLSFLE IVDLDDNQLT GSIPEELYSV
SSLEILDLDI NQLTGTISTL IGNLVNLYYL QIDSNKFTGS IPSEVGTLTR LEYFSMTDIQ
IAEALPDSLC SRDTLLLFGD CEVCVVEDCC TACLAKETTP