Gene PHATRDRAFT_11843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_11843 
Symbol 
ID7200130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp655304 
End bp656902 
Gene Length1599 bp 
Protein Length533 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179481 
Protein GI219117373 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGGTC CCAAATCGGA TACGCTTTTG CTTCTGCTAG CGACTGCATT AATTACGCCA 
GTCTGCAAGC GTCTCGGAAC GAGTCCAATC CTCGGTTTTC TTGCTTCGGG CATGCTTCTA
GGACCTAATG GGTGTGGACT CATTTCGGGT ATTCACACCA CGGAAACCCT GGCCGAACTC
GGCATCGTTT TCTTCTTGTT CGAAATGGGG ATTGAGCTTA GCTTTGACCG TCTCTTGTCC
ATGCGAAAAG ATGTGTTCGG GTTGGGTTTT TTGCAATTTT CCATCACTGC CGTTGCCGTC
GCGCTTGTGG GCAAATTGGC TGGCTTGCCG GCCAACGCTT TGGTCGTACT TGGTGGAGGG
CTCGCATTGA GCTCGTCCGC TTTTGTGTTA CAATTACTCA AAGATAAAAA TCAGCTCGCC
ACGCGGTTTG GCAAGGCAGC TTTTGGTATA TTACTTTTTC AAGATTTAGC CGTTGTGCCA
TTGCTCGTGG TGACACCAAT TTTGGCCGGA TCGGGACAAG GATTGGCGTC GGCAGTTGGT
TCAGCCGTTG TTAAGGCCGC TATGGCTTTG GGGTCGATCG CAGTTGCTGG TCGTTTTGTA
TTGAATCCAC TCTTCAAGAC CGTGGCTCAG GCACAGTCAC AAGAAGCCTT TCTCGGGGTT
GTGCTACTGA CCGTGCTGTC AATGAGCTTT ATGACGGAAG GTTTGGGATT GAGCAATACA
CTGGGAGCTT TTTTGGCAGG GGTCCTGTTG TCGGAAACCA AATATCGATA CCAGGTAGAA
GCTGACATTG CACCGTTTCG AGGAATTCTG CTGGGATTCT TTTTTGTGAC GGTCGGTTTC
GAGATTGACT TGGCCTTGAT TTGGTCACAG CTCCCCCTGG TTGGTAGCTT AGTGATGGGC
ATCATACTTA TCAAGGCCGC TATAACGACC GCCTTGTCGC TAGCGTTCGG GTTGAGTCTA
TCGACCAGTC AACAAACAGG GTTGATTCTT AGTCAAGGTG GAGAATTTGC TTTCGTTGCC
TTCGGTCTAG CACGGTCCTT GGGGATTTTA GACGTGGCGA CAACCAAGCT CCTCTTGACA
AGTGTGAGTT TGACCATGGC ATTGACCCCT GCAATGGCCA CAATCGGTGC CAAAGTAGCC
AAGCGCCTCG AAGAATCTAG TGATTTTACA CATTATTTGG GACAGGATCG TGACGCCAAC
GAGATTAAGG AGAGTGACGA CTTTGCGGTT GTGATTGGAT ATGGGGTTGT GGGTAAAGTT
GTTTGCGACC TGCTTGATCG GAAATTCATT AAGTACGTCG GTTTGGATAT TGATCCGAAC
AAAGCGATTC AAGCGAGAAA TGCTGGTCTT CCCGTATTCT ACGGAGACAT TGGTAGACAA
GAAGTTGCCG AAGCCTTCAA CGTTGGTAAA GCGAAAGCCG TGATTGTATG CATTGCAAAT
CGGGCGCAAG CAAATCGGTG TGTGATTGCT TTACGACGAT GGTACCCTAA TCTGAAAATA
TTTGCCCGCG CAGCAGACGC AGATCACGCC AATCGTCTGC AGACTACACT CAATGTTGCA
GCCATGGTAC CCATTCTACC GGAGGACAAT TTATTACTG
 
Protein sequence
MSGPKSDTLL LLLATALITP VCKRLGTSPI LGFLASGMLL GPNGCGLISG IHTTETLAEL 
GIVFFLFEMG IELSFDRLLS MRKDVFGLGF LQFSITAVAV ALVGKLAGLP ANALVVLGGG
LALSSSAFVL QLLKDKNQLA TRFGKAAFGI LLFQDLAVVP LLVVTPILAG SGQGLASAVG
SAVVKAAMAL GSIAVAGRFV LNPLFKTVAQ AQSQEAFLGV VLLTVLSMSF MTEGLGLSNT
LGAFLAGVLL SETKYRYQVE ADIAPFRGIL LGFFFVTVGF EIDLALIWSQ LPLVGSLVMG
IILIKAAITT ALSLAFGLSL STSQQTGLIL SQGGEFAFVA FGLARSLGIL DVATTKLLLT
SVSLTMALTP AMATIGAKVA KRLEESSDFT HYLGQDRDAN EIKESDDFAV VIGYGVVGKV
VCDLLDRKFI KYVGLDIDPN KAIQARNAGL PVFYGDIGRQ EVAEAFNVGK AKAVIVCIAN
RAQANRCVIA LRRWYPNLKI FARAADADHA NRLQTTLNVA AMVPILPEDN LLL