Gene PHATRDRAFT_45974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45974 
Symbol 
ID7201039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp820811 
End bp822252 
Gene Length1442 bp 
Protein Length452 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180127 
Protein GI219118720 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTTCATACGA GGTAGTATCG TTGGAAGCCC CTAGTGATGG AACAGAATTC AATGCATTCC 
GAAGTGAGTC CGCTTCTGTT CCGTCAAGAA AGCTCCACAC TTGCGGAAAA GCGCCATGGA
AACGCGACGG AAATAGAGAC CGTTCTTAAT TTGATGAAAA CATGTATGGG AACTGGTTGT
CTCGGTCTCG CCTTTGCTTG TCAGCAAGGG GGCATTTTTT TGTATTCAAT AGGTCTCCTA
GTCATTGGAG CTTGGAATAC CTATGCGGTA CAAAGACTAT GTGCCTGTCT GGATTACATC
CCGGGAGATT CCTTTTCTCT TTCAGGAAGT TGTCCCGAAG GTGTCGTAGA GGATACACCA
CCGATCCGGT CGAGAAACCC TCGACCAAAA GCACCGACAC CACCGCCCTC TGGTACTTCT
ACACTGAGCG TCGTCGCATG GCATGCATTT GGTCCAATCG GCTTAGAAGT ATTGGACGCA
ATGATGATCA TCTTACTTTT TGGTATCATT ACTGCCTACG TCGCCGCCGT CATAACATTT
TTATCCGACA CGCCATTTTC GCTGGGACGC TGGATGGACG CCACAATCTC CGCAATATTA
TTCGCTATTA TTGCCATGGT TCCTGATATG GGCCATCTTT CCAACAATTC CGCGATTGGA
CTCTCGATCC TCGCCTTTAC CTTTCTTGTG ATTGCCGGAT ACGGTGACAT GTCACATGAT
CACCAAGCGG TATCTCACTT GCATGGTTGG CCGCAGTCGT TGGCCGGCGC ATCGCAATTT
TTTGGAATTT GCGTTTATGG ATACGGCATT GTACCGTTGA CGTACAACTT TCGGTCCTCC
ATGGCCCAAC CCGAACGTAT GGTGCCGGCA ACGATGCTGG CGGTCCTACT CGTTGCGTCG
GCTTACATTT TTGTTGGGAT TGGTCTGTAT ATCTTGTATC CTGATTTGGA AGGGGAACTT
TTACACGAAC TGCCCCACTA CGGTTTTCTT CCCGTACTGA CGCGACTCGC CATGGTAGTT
GTTGTTTTCA TGACGGTTCC ACTGCTGGTG GTACCGTGTG GGGAATTGCT CGAAGGCAAG
TTTCAAACGG ACCGTCGCGG CCTTGTGCGG TTTTGTGTTT GTGGCATTAG TGCCTTATTG
GCTACGTCTT TGCCAAGTTT TGTGCAAGTC TTGTCCTTGG TTGGCTGTGC CTGTGTTGGT
ATGGTGGGTT TTGTACTACC GCCACTGTTG CATAGTCGTC TTTTGTGGCT GTACCAAAGA
CGTCTTGGTC AAAATCTTTG TTTCGGTGGC CACCATTGTA GGTTTCTTGT TGTGGACGCG
CTTTTGTTGA CTTGGGGCGT GATAGCAACC GTCATCAGCA CTGTCTATAC GCTACGAGAA
GTCAACGCGG TATAGAATCA TGTCTACGGT ACACTACTTT ATGCGCAATT TTATACAAAG
TC
 
Protein sequence
MEQNSMHSEV SPLLFRQESS TLAEKRHGNA TEIETVLNLM KTCMGTGCLG LAFACQQGGI 
FLYSIGLLVI GAWNTYAVQR LCACLDYIPG DSFSLSGSCP EGVVEDTPPI RSRNPRPKAP
TPPPSGTSTL SVVAWHAFGP IGLEVLDAMM IILLFGIITA YVAAVITFLS DTPFSLGRWM
DATISAILFA IIAMVPDMGH LSNNSAIGLS ILAFTFLVIA GYGDMSHDHQ AVSHLHGWPQ
SLAGASQFFG ICVYGYGIVP LTYNFRSSMA QPERMVPATM LAVLLVASAY IFVGIGLYIL
YPDLEGELLH ELPHYGFLPV LTRLAMVVVV FMTVPLLVVP CGELLEGKFQ TDRRGLVRFC
VCGISALLAT SLPSFVQVLS LVGCACVGMV GFVLPPLLHS RLLWLYQRRL GQNLCFGGHH
CRFLVVDALL LTWGVIATVI STVYTLREVN AV