Gene PHATRDRAFT_47901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47901 
Symbol 
ID7203108 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp389421 
End bp391384 
Gene Length1964 bp 
Protein Length538 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182384 
Protein GI219124172 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGTCAC AATCTGCGAT CTCAAAAGTG ACATGACCAT TTACGTTTCG TTTCGCTTTA 
TTTGTATTTA CAGTTAGTCT TTTACCACTT ACATTATTAC GGTTAGGAAA AATGCTCTCT
GGCAGTCAGC TCCGCTTCGA AGCCTCTTGT CCAGCGTTCG CCTTGAATTT CCGAAATGTG
GGCGTCTACC TTTGGTCATT CTTTCTCGAC CGCTTAGCCG TGATGGTGAT TCTTTGAGTG
GTCGTCCACC AGCGCTACCC CAGAATTCTC GTCATGAGAA GAACATATGC CGACCGTTCC
GCTATGCATC GGACAATCCG CACTTCACTG TCGAGAACTC GGCCAAATTT AAAGGCGCCA
AAAAAGATGC CTTATTTCGT TTTTGTTATG CTTTTCATGC TCCTATTCTT CTTTCAACAA
TGGGCGAGTC GAGCTGCTCG ACGATCGCAC CAAATACCGT TAGAGAACTT CGTCGAAGAA
GTCGCTACTC ATGACGGTTT GATCCCCTCA AAAGAAGGAC CATCCCAGAA ATTACCTCAA
TGGATTCAGA GTTATCTTCG TTGGCATCAA TCTGTTCGAG CTCAGTTTCC AGGTGACCTT
CTTTTTACGG ATCCCGCCGC TCCAAATTTG CTGCTGAGGA CCTGTTTGGG ACTTTGCGGA
GGACTGAATG ATCGACTTGG TCAGCTTCCG TGGGATTTGT ACTTGGCAAA CCAAACAAAC
CGGATTCTAT TGTTGTATTG GCATCGACCG GTACCCCTCG AATCTTTTCT CATTCCGAAT
GAACTAGACT GGACCGTCCC GAAAACGCGC CCGGGATTCT TTCCATCTCC CGGATCCAGA
ATCGTTTCTC GCGGAGACAT GGTCTTGGCT CGAGATATTC CTGAGCTATT TGCAGACTTC
AACTCGGAAC AGCCAACGGA CCAGTTTTGG AGCACTCATA TGGATGTGGC AATAAATAGA
GCTACCGCCG GTCACTACCG GAACCATAAG GTCCTCCGGC ACCGTTTGTT GGGCCATTTG
AACGAAGATC AGCTGGAAGT AAGGCTCCGC ACGCTTGGCG AAACTGACAT GATTCACTGG
ACTGAATCAT TCGGCAATAT TTTCCGAATG TTTTTCCGTC CCGCTGCCGC TATCCAGGGG
GAATTGAACC GGGTATTCAG TGATCTACAA ATCACTGCAG GATCTTATTC TGCCGTTCAC
TGCCGAGTGC GACATCCTAA AGCCTCGCCT GCCCATGTTT TCGTGAAAGG AAAGAATGAT
GCTTATCCAG CAGACAAAGC AGGACTACCG TGGATAGGAG AAACACAAGC TTTCGCAATT
GCCACTGCTA CGAAGGCGCT GAAATGTGCC CGCCAGGCAG CACAAAACCT TTCTGAACCG
ACGTACTTTT TATCGGATTC TAATGATTTG GTTCGATATA TAGCGCACGA GTTGACAAGT
TCCAAATTCG TTTCCGCCAA TGCTACAATA CTCCACGCTG ACCCTGTTCA CAGTTCGGCG
CTCCAAACTG TGGACTCCAT GCGTATCGTA GCCAGGGAAT CATCTCTGGA AAACGCCCAC
ATCGACCTCC AGAAAGGACG GGAACCTGCG GCGTACTATG CTACATTTGT GGATCTGTTG
CTCGCCGTCA ACGCACGATG TGTGACGTAC GGTATTGGCT ACTATGCAGT CCTCGCGACC
AAGATTTCTG GCACGAAATG TAAGAACCTG TACCAAGAAG AAGCGTGGGG AGGCAGCGAA
AACAAACGAA ACAATACACA TGTATGTCGC CTCTAAAGAA GAATGCCTCT TACAATTTCC
ATTGTTGTCC ACACTGATGT GGCTGCTCCA TCTAGGCATC CAAGTCTGCT GAAGTTTGGT
TACATATGCT GATGCGGCAG CAGTTTGGGC GTAAAATCAG GAGGGCGACG GATTGGAGTA
TGCGGGCTGC TGATACAGAC GCTAGTCTTA AGGTCGGTAT TTGA
 
Protein sequence
MRRTYADRSA MHRTIRTSLS RTRPNLKAPK KMPYFVFVML FMLLFFFQQW ASRAARRSHQ 
IPLENFVEEV ATHDGLIPSK EGPSQKLPQW IQSYLRWHQS VRAQFPGDLL FTDPAAPNLL
LRTCLGLCGG LNDRLGQLPW DLYLANQTNR ILLLYWHRPV PLESFLIPNE LDWTVPKTRP
GFFPSPGSRI VSRGDMVLAR DIPELFADFN SEQPTDQFWS THMDVAINRA TAGHYRNHKV
LRHRLLGHLN EDQLEVRLRT LGETDMIHWT ESFGNIFRMF FRPAAAIQGE LNRVFSDLQI
TAGSYSAVHC RVRHPKASPA HVFVKGKNDA YPADKAGLPW IGETQAFAIA TATKALKCAR
QAAQNLSEPT YFLSDSNDLV RYIAHELTSS KFVSANATIL HADPVHSSAL QTVDSMRIVA
RESSLENAHI DLQKGREPAA YYATFVDLLL AVNARCVTYG IGYYAVLATK ISGTKCKNLY
QEEAWGGSEN KRNNTHASKS AEVWLHMLMR QQFGRKIRRA TDWSMRAADT DASLKVGI