Gene PHATRDRAFT_41220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_41220 
Symbol 
ID7199051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011696 
Strand
Start bp236009 
End bp237271 
Gene Length1263 bp 
Protein Length420 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185238 
Protein GI219130156 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.132681 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGTTAC GCTGTGTGTA TGTGTGCTCT TCACTTTTGG CGCTAGGTTC CAGTTTTTCG 
GGCGCCTCCT TTATAGGGTC GCTCCAAGCC ACGAAGGAAA TTTCGCGCCA TCCTTTCAAA
TACACCAAGA CATTTCTCCC CATGGTCGGA GGCGGTGGTT TAAGTGATGG TGCAGGATCT
GAGCTTACCA ACACACTAGC TCGTTTGGAT CAGCAGTGGA AGATTCAGCA AAAGTCAAAG
CCTACTTCTC GTTGGTCGAA AATTATTTTG GATCGTGACA CACAGGAAGT CTCCGAGGAA
CCACCAGAAA CGTATGTTCC TCCCCTACAA GAGAGGCAAG ATTTCGTATA CTTGCTAGAA
CCACCCAGTA AGTCGAACCC TTCTTGTGTA ATCTTTTTTG TTGGCGGTGC CGGCCTAGGA
CAATTCCCCC AAATAGCCTA CAACGAATTC TTGTTGCGTC TTTCGGACCG GCTGAACGCT
GCGGTGATTG CGGCGCCTTA CGCTGTGGGA TTGGACCACT TTGGACTGGC GAAAAGCGTC
GGGGAACTTA TGCGCAAGGC AAAACTTCAC TGTGAAGAGG ACTCGTCAAA ACTGTATCCG
AAAACTTTGC CAACCTATTG CATTGCGCAT TCATTGGGGT GCAAGTTGTC CAGCATCTAC
ATGGCAGCGA CAGAGCAAAC GTATGATGGC ATTGGTTTTA TGAGTTTCAA CAATTTTGGA
TTTAGCCAAA CCATCGGTAT GGCCAAAACA TTTGCCGATC AACTGCAAAA AAATATTGGT
ATCGGCCGTG GTATTCGACC TGAAGTGCTG GATCAGGTAT TTTCATTCGC AGAAATGGCG
GTGGGTTCGA TTGGGTTGGA CTTCACTCCG AACCCCATGG AGACAGAGAG GTTACTAACG
TTGAAGTATG ATGAAGAACA GCAGGAACGT ACGCGCCTGT TTGTTTTCGA TGACGACATG
TTGGATTCGA CGCAGAACTT TGTGCAAGCT TGCAACGGGG CAGGTCCCGA TGTGTCGGGT
TTGCCAGGGT CGCATTTGAC ACCCGTCTAT TTCAAGTTGG GCCTCGATGA ACTACCTGAC
GAAGTGCGAG GCGTCGCTAA GGAGGCGTCA GGCGGGTTGG AATCCGCATC ATTTGGAAAT
GAGGAAGAAC TCAACGCTTT GGTGACCGAA GTCAGTGGCT GGATTTTGGG AAAAGGTCCC
TCGAGAAAGC CTTTGTGGCA AACCGAGCGA CCAACAATTT CTGGTTCGGC AGAAGATCAG
TGA
 
Protein sequence
MRLRCVYVCS SLLALGSSFS GASFIGSLQA TKEISRHPFK YTKTFLPMVG GGGLSDGAGS 
ELTNTLARLD QQWKIQQKSK PTSRWSKIIL DRDTQEVSEE PPETYVPPLQ ERQDFVYLLE
PPSKSNPSCV IFFVGGAGLG QFPQIAYNEF LLRLSDRLNA AVIAAPYAVG LDHFGLAKSV
GELMRKAKLH CEEDSSKLYP KTLPTYCIAH SLGCKLSSIY MAATEQTYDG IGFMSFNNFG
FSQTIGMAKT FADQLQKNIG IGRGIRPEVL DQVFSFAEMA VGSIGLDFTP NPMETERLLT
LKYDEEQQER TRLFVFDDDM LDSTQNFVQA CNGAGPDVSG LPGSHLTPVY FKLGLDELPD
EVRGVAKEAS GGLESASFGN EEELNALVTE VSGWILGKGP SRKPLWQTER PTISGSAEDQ