Gene PHATRDRAFT_55097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_55097 
Symbol 
ID7198341 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011692 
Strand
Start bp167199 
End bp169349 
Gene Length2151 bp 
Protein Length454 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184577 
Protein GI219128768 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGCTCACCAG AACCTGGAGC TCTCACCAAT AATGCTTTCA GATAGACAAC TTTACTGTTA 
AGGCTACGTA CTGCCCGGTC CGGTCCGGTT GCGAATTCCT ATACGTGCCG AGAACACAAC
TCTAGTAGTA GGGCCATTTT TCCTAGACAG CTTTCACAAG AAACTGGAGC TCTCACCAAT
GCTGCTTTCG GATAAATAAC TTCACTGTTA AGGCTATTTA CCGCCCATTC CGATTGCGAA
TTCCCTATAC GTGCCTCAAA CCGTGGATCT TTTGAAGGTG CAGATGCCAG TTCACTGTTA
GCGTGTATGA TTTGGTTCGC TGTTAACGCA AGCAGTTTCT ATACCAGTGG TCGGTGAAGT
TTTCTTGTTT GCAAACCACT TTTCACTTCT TGACACACTT TCCGCATCCA AACTTTTGAG
CTGTCGACCA AGATCATGAA ATCTCTCAGC CTTGTCGCTA TCTTGGCCGC AGGAGTATTC
GTCCCCGGCT ACGACGCGCT CAAGCCTTCA AAGTGTGGCG GAAAGCTTAC GAGTCCTTGC
CTCAGTGCAT CCGATACAAG GTACGATGCA AACTTTCCCA AATCAATCAC TCTGCAAAAT
CCGGCATGGA AGCAATTCGA GGGCTTGTGG AAGACGACCT CAATCAATTT CCAAGGAAAC
GGAATCGTGG CGCAGCCTCA ACCACATATT CCTGCATTGA AATACGCGAC TCTTCCGTAC
ACTCTGAATG AGGTTGTAAC CTTCTACAAT CATACCATTG TCGGATCGAG AATGTCGCTT
TATGCATATT TTTTCTATTC CCCGGCCCCA GAATCATTCT GCAACCAAAC ATTCAATCCG
CCTTTCGAAA ATGTTATTGG GTCAGGAGTT TGCGGTGTGA ATGGATTTAC CACTGCCGTT
GCCCAGTTTG GGACAAGTAC CCACGAAAAT CAAGGTGAGC AAAAACAGTA TGCAAGCTTT
GGGTTCCCTG GCGCTACAAG TCCCTCTCAC ATGTCCCATA TTTAACTTGC TGATCAGGCG
ACGTTGATTT TTTTCGTCTT CGGACTTCTG CTGCTCTCGG TCCGGTCACA ATTGATTTCG
ATTCAGGACT ATTCACTTGG ATTGATTCCA ACTCTTTGCT TGCAACGAAT ACACTTGATG
GCCTTTTCAG TCAAAGCAAT CCATACACCT TCCTTGACAA CAGCTCAGCC TTTGTCAACT
TCAATGTAAT TGATCTTGTA AGGAGGACTA GAGACACCAA TGCCCTTGCT CAGATGACTC
GAATGGAGGA GAGTGAGTGG CTCGCGGCTA TCGAGGAAGC GTACCAAGAT GTCAACATTG
CAGCTGCAGA CAAAATCCCT GTTCCCTTCC AGACTTCTTC TTCGGATCCA GAATGGTATC
CAACAGAAGA TGAATGGTGC GGTGGAGTTG GTAACGACCC AGAGTGCACT GTATCCCCAT
ATCAAGAACC AGATGCCAAG TTAAAATCTA GTGCTTTGGT AGGGTTCGTT ATCCTTGGCC
TCGCTGTGTT CTGCATTCCT TTGTATGCAC TATACCGATA CCGAATTGGC CAACAGGAAC
GCCGCATCAA GGACAAATTC ATTCGAGGTA TTGCCAAAAA CATGTCCATT GCTCCTAGCG
CTGGGGCGAT ATCCCGCGAC AAATTGGTGG AAGAATTCCA GCGCATTGAT AAAGACAAGG
GCGGGACCAT TGAGAAAGCC GGTAAGTCAC GGATAATGAA CAACTTTGAC GATATTCCCT
GTGACAAAAG TAGGATGCAA CTTACCCAGT ACTCTCTACA GAACTCAAGG ACTGGATAGA
CGAGGGGAAG TTGGGAACAA TTTCAGACGC TGATTTTAAT GCATTGTGGA GTGCTTTGGA
TAGGGACGGT TCCGGTAATA TTGATTTCAT GGAGTTCTGC ACTTTCCTCA GTGGTTGCAG
CGAGGCGTTC GACAATGTTT ACGACGAGCA GCAGAAAATG TGAGTTTCCT CTCTTAGGGA
GGGACGCCTT CTCTTGTCAA CTTTTATCAA TGGTGAATCA AAGGACCATC AACTCTAGGT
CTTCATTAGG AGGGTAAAAC GAAGAAGAGC AATTTATGTA TGAGTGAACA TGGAAGGATG
TCAAAAAAAC CTAATTTAAA GTCTGTAATA ACGAAATTGA GAACTAGTAG T
 
Protein sequence
MKSLSLVAIL AAGVFVPGYD ALKPSKCGGK LTSPCLSASD TRYDANFPKS ITLQNPAWKQ 
FEGLWKTTSI NFQGNGIVAQ PQPHIPALKY ATLPYTLNEV VTFYNHTIVG SRMSLYAYFF
YSPAPESFCN QTFNPPFENV IGSGVCGVNG FTTAVAQFGT STHENQGDVD FFRLRTSAAL
GPVTIDFDSG LFTWIDSNSL LATNTLDGLF SQSNPYTFLD NSSAFVNFNV IDLVRRTRDT
NALAQMTRME ESEWLAAIEE AYQDVNIAAA DKIPVPFQTS SSDPEWYPTE DEWCGGVGND
PECTVSPYQE PDAKLKSSAL VGFVILGLAV FCIPLYALYR YRIGQQERRI KDKFIRGIAK
NMSIAPSAGA ISRDKLVEEF QRIDKDKGGT IEKAELKDWI DEGKLGTISD ADFNALWSAL
DRDGSGNIDF MEFCTFLSGC SEAFDNVYDE QQKM