Gene PHATRDRAFT_38156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_38156 
Symbol 
ID7202976 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp209043 
End bp212377 
Gene Length3335 bp 
Protein Length1023 aa 
Translation table 
GC content59% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182178 
Protein GI219123743 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00847311 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATATC GTCGCTTTTG TCTTACCAAT GGCACAGCAG ACGTCGACTC GACTGGCGGA 
TCTCGGTCCC GTTTCGTTTG GAACTACGGA AGCAAATATG TTGATCCAAC TCCTAGTAAG
GACAACGTCT CACCAGAAGT TCAATCGCCT ACTGCTAAAC GCCAAGTCGC TTCCGTCCCA
CACACAACGA ATCACAGCCC CACAGTCCCT ACGAATATCC AAGAGTACAA TGTAGACTCA
TAATCTTCGT AGTTCATCGT TTTCCTCAAC TGTGATCTTC ATCGTCGTTT TGTGTTCTTC
AATAGAACTA CGAAGCAACC CCACCGTCTT TACGCTCCCA GCCCTGCCCT ACCGCCCATC
CGCCATCTCT TCGGTCCCGA TGTCGACCTC TGCTCACTTC AAACTGAGCG ACTTTCCTCA
CAAAGTCCTG GATCCGATTG CCACCCTCAC CGTCCCACCG ACTTACGCGA CCATCAAACA
TGCCCAACGT CAGCTCATGA CCAACGCAGC CGCCATCCCC ACGCTCAACG GTGGCGGCGC
CCACGGCCAC ATGGCCTTAA CCCTCACCGC TCTCGCGTAC GCCGACATCA GCGACGTCCC
GTTCGTCATT CCCGTCGCTC CCCCTGCCAA TCCGCCCCCC GGCGCCACGC AACCCCAAAT
CACCGAGAAT AACCGCGTTC ACCAACGCGA CGCTGACATT TACAACCTTT ATGTCGCTGT
TAACAACGCT CTCCGCCAAC AGCTTCTCGA TGCGATTCCC CGCATCTACG TACGCGCCCT
CGCGCATCCC ATGTTCGAGT TCAGCAACGT CACGTGCCTT GACTTGCTTT CGCACCTCTG
GACCAAATAC GGTACAATCA AGCCCGCTGA GCTCCAGAAA AATTTCCAGT CAATGTTCAC
CCCGTGGAAT ACAACAGAAC CGATTGAATC CGTTTTTCTC CAGCTCGACG AGGCCATCGC
CTTCTCCGTC GACGGTAACG ACCCCATCTC CGAAGCTGCC GCCGTACGAG CCGGCTATGA
AGTCATCGCG CACTCTGGCC TGCTCCTCCT CGACTGCAAA GAATGGCGCA AATTACCCCT
TGCTTCTCAC ACCCTTGCCA ACTTTCAGCA GCACTTTTCC CTTGCCGACG ACGACCGGCG
CCTTACGGCC ACCACTGGTT CCCTCGGCTA TGCCAACGTT CTCGCTGCAA CCCCCTCTCT
GACTCCAGCC ACGGTTTCCG ACACCCTCAG CCTTCCCTTC TCCGCGCTCT CTGTGTCACA
GACTTCCGTC TCCTCTCCGG ATATGACCTA TTGCTGGACC CATGGGACCA GCAAGAACCG
ACGCCATACG AGCGCCACGT GCAAGAACAA GGCCCCTGGC CATCGCGACG ACGCGACCGC
CACCAACACT CTCGGCGGAT CCACCAAGGT TTGGACCGCT CCCAAGCCCC CTGAATAGGA
AAGAGGGACG GCTACGCCGA TGGTTAACTC TAGTAATACC GATTATTTAA ATCATATTAC
TAGTCTTAAT TCATCTGTAG CCCCCTCCCC GCCTAGTTCC CATACCTCGG CCATTGCCGA
CACCGGTTGC ACCGGCCATT ACATCACCGT CAACTGCCCC CACACCCACA AACTTCCTGC
ACGCCCCAGC CTTGCCGTCC GTGTCCCTAA CGGCGCCGTC CTCCGCTCAA GCCACATTGC
CACCCTGGCC CTCCCTGGCT TCTCCCCTTC TGCTTGCCAG GCCCACATCT TCCCCGGGCT
TACCTCGCAC CCACTCATTT CGATTGGACA ACTTTGTGAC GACGGCTGCA CTGCCACTTT
CTCAGCCACT CGCCTCGAGA TCCACCGCGA CACTACACTA CTCCTCTCCG GCACTCGTGC
ACCCACTACC GGCCTCTGGC ACCTTGATCT TACCCCTGCC AAGCCTCCTG CCACAGCCCA
CGCTCTAGTT CCCAACACTC CCCTCGCTGA CCGCATCGCT TTTGTTCATG CCTCGCTCTT
CTCCCCGGCG ATCTCCACAT GGTGCCAGGC CCTCGACTCC GGCCATCTTG CAACCTTTCC
TGAACTTTCC TCCCGCCAGG TCCGCAAGTA TCCACCTCGT TCCCCCGCCA TGGTCAAGGG
CCACCTCGAC CAACAACGCG CAAACCTTCG ATCCACCAAG CTTCCCCCTG TCGGTTCCCC
CATCACGACG GCACCCCCTG CCGCCGCTGT GCCCGACCTT GACCCTCCCG ACGCCCACCC
CGTCACACGC ACGCACCATG TCTTTGCTGC TCACCAGCGC GTCACCGGCC AAATATACAC
GGACCAACCT GGCCGTTTCC TCACTCCTTC AAGTTCAGGC CACAACGACA TGCTTGTTCT
TTATGATTAC GACAGCAACG CTATCCACGT CGAACTCATG AAGAACAAGT CCGGCCCCGA
GATTCTGGCC GCTTATAAAC GCGCTCATGC TCTTTTCACC CAGCGAGGCC TCCGTCCCCA
ACTCCAGCGG CTTGACAACG AAGCCTCTGC CGCCCTCCAG TCCTTCATGA CCTCAGAGCA
CGTTGACTTT CAGCTGGCAC CCCCCCATCT ACACCGTCGT AATGCAGCCG AACGGGCCAT
CCGCACCTTC AAGAACCACT TTATCGCTGG CCTATGCACC ACTAACCCGG ATTTTCCATT
GCACCTTTGG GACCGCCTCC TCCCACAGGC CCTTATCACC CTCAATCTTC TTCGTCGCTC
CCGCATCAAT CCTAAGCTGT CCGCCCACGC CCAGCTTCAT GGTGCTTTCG ACTACAACCG
CACCCCGCTT GCTCCACCTG GCACTCGCGT CTTAGTTCAT GTCAAGCCGT CCGTCCGCGA
AACTTGGGCC CCCCATGCTG TTGAAGGTTG GTACCTTGGC CCCGCCCTGC ACCATTACCG
TTGCCACCGC GTCTGGGTCA CAGAAACACG TGCCGAACGC GTTGCTGACA CCCTTTCCTG
GTTCCCGACC CGCATTCCCA TGCCCGCAGC TTCGTCCACC GACCGCGCCC TGGCCGCCGC
CCGCGACCTA GTCCATGCCC TCCAGAATCC TTCCCCTTCG TCTCCGTTCG CCCCCCTCGA
TGCCACCCAG CACCAGGCAC TCACAGATCT TGCCACCCTC TTTGCCACCG TGGCCACCCC
GACCGACGAT CCCCCTGCCC CCGCAACTCC CCTTGCTCAG GTCCGTTTTG CCGTTCCTCT
TGTCACGGCC GAACATGCCC CGGCACTTCC GAGGGTGCCC ATTCCGGCCC CAGCACTTCC
GAGGGTGCCC ACCATGGCCA CCTATCACTC TCGCACCGGT AACCCAGGCC GTCGCCGCCG
CAAAGCACGC AAACAACCGG CAACCCCAAC CCTAG
 
Protein sequence
MAYRRFCLTN GTADVDSTGG SRSRFVWNYG SKYVDPTPKL RSNPTVFTLP ALPYRPSAIS 
SVPMSTSAHF KLSDFPHKVL DPIATLTVPP TYATIKHAQR QLMTNAAAIP TLNGGGAHGH
MALTLTALAY ADISDVPFVI PVAPPANPPP GATQPQITEN NRVHQRDADI YNLYVAVNNA
LRQQLLDAIP RIYVRALAHP MFEFSNVTCL DLLSHLWTKY GTIKPAELQK NFQSMFTPWN
TTEPIESVFL QLDEAIAFSV DGNDPISEAA AVRAGYEVIA HSGLLLLDCK EWRKLPLASH
TLANFQQHFS LADDDRRLTA TTGSLGYANV LAATPSLTPA TVSDTLSLPF SALSVSQTSV
SSPDMTYCWT HGTSKNRRHT SATCKNKAPG HRDDATATNT LGGSTKERGT ATPMVNSSNT
DYLNHITSLN SSVAPSPPSS HTSAIADTGC TGHYITVNCP HTHKLPARPS LAVRVPNGAV
LRSSHIATLA LPGFSPSACQ AHIFPGLTSH PLISIGQLCD DGCTATFSAT RLEIHRDTTL
LLSGTRAPTT GLWHLDLTPA KPPATAHALV PNTPLADRIA FVHASLFSPA ISTWCQALDS
GHLATFPELS SRQVRKYPPR SPAMVKGHLD QQRANLRSTK LPPVGSPITT APPAAAVPDL
DPPDAHPVTR THHVFAAHQR VTGQIYTDQP GRFLTPSSSG HNDMLVLYDY DSNAIHVELM
KNKSGPEILA AYKRAHALFT QRGLRPQLQR LDNEASAALQ SFMTSEHVDF QLAPPHLHRR
NAAERAIRTF KNHFIAGLCT TNPDFPLHLW DRLLPQALIT LNLLRRSRIN PKLSAHAQLH
GAFDYNRTPL APPGTRVLVH VKPSVRETWA PHAVEGWYLG PALHHYRCHR VWVTETRAER
VADTLSWFPT RIPMPAASST DRALAAARDL VHALQNPSPS SPFAPLDATQ HQALTDLATL
FATVATPTDD PPAPATPLAQ VRFAVPLVTA EHAPALPRVP IPAPALPRAV AAAKHANNRQ
PQP