Gene PHATRDRAFT_37297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37297 
Symbol 
ID7201944 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp656588 
End bp658673 
Gene Length2086 bp 
Protein Length527 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181417 
Protein GI219122154 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0209176 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGTGA GGGATCCTAG ACTTAGAATT GGAGGGAAGG TGACGGCAAA GGCTTGTCAT 
GTTGTGCATC TGAGCGAGTG CGCACGGAGA TATGGCGTCA ACAAGCACTC CAAGCGGCTT
GTTGGAACGG TTCTAGACGT CACGACCACC CCTGTATCCA TTACAACCGG GCGTACCTCT
ACTTTGATAA CAGCAGTTTA TGATTTTGGA GAGAGTTTGT TCAAGGAAAA AACACTGAAC
ATTCGGAGTG TAAAGGCATT TGTACCGCCA GAAGATGAAG GAATGTCCTT AATTGAGGAA
TTAGCAGCAG AGGCTTTGCA GGCAGCAGAA GCAGACATGG AAGCCGGAAA CTTGATGGAA
GAAAGTGTCG AAGCCCCGGT AGCCGAAATG GTTGAAACCC CGGCTGACAT AGAGCCCGAT
ACCTTGGTCG ACACAGAGCC CAATACCCCG GTTGACACAG AGCCCGATAG CCCGGTAGCC
GAAATTGTCG AGACCCCGGT TGACACTGAT ACCTCGGTTG ACACAGAGTC CGAAAACCCG
GTAGCCACAG TGCACCAAAC AGAGTGGTAT GTGAATGAAA GAAAAACCCG GCTGGATGTG
AATGGCCATG TCTATGTTAG GCACTTCCAT ATCCGTACTT CAGTTGGTGA CCTTATTGGT
CAAGACTCTG ACAATGGGGT GAGATTTTCG CGCCTCGAAT ATTTTCTGCT CATGTTTCCG
CCGACCCAGC TGACTACTAT GTGTCGGCTT ACAAATACTA TGCTTGCACA GCAAAACAAG
AATCCAATCA CAACCGGAGA ACTTCTTCGG TTCTTTGGAA TGCTCATACT CACTACAAAG
TTTGAGTTCA GTAGCCGGGC CCAACTATGG TCCACCACTG CACCCTCCAA GTACATTCCT
GCCCCTTCAT TTGGACGCAC AGGAATGTCC CGGCAACGGT TTGACAATAT CTGGAAATAT
ATCCGTTGGA GTGAACAATG TCCAGTCCGA CCCGATGGTA TGAGCACTCA TGTTCACCGA
TGGCAACTTG TTGACAACTT TGTCACAAGG TTCAATGAGC ATCGTAGCGA AAACTTTGTA
CCTTCCCATC TGATTTGTGT GGATGAATCT ATCTCAAGAT GGTATGGGCA GGGTGGGATT
GGATAAACCA TGGTCTACCA AATTATATTG CAATTGATTG AAAGCCTGAG AATGGGTGCA
AGATTCAAAA TGGTTTTGTG TGGACAATCC GGTATTATGC TTCGATTGAA ACTTGTAAAG
GGAAAGACGA TAACTGACGA CGAAGAGGGT GACGAGGAGG ATGAGTATCT ACCGCATGGT
GCAAAAATTA TCAAAGAACT TGTTTGTCCT TGGTGGGGGA GTGATCGGAT TGTGTGTGCT
GATTCTTATT TTGCCTCCGT TGTGACAGCT GTCAAGCTTA AGAGGATTGG CTTGAGATTC
ATTGGGGTTG TGAAGTCGGC AACGAGAAGA TATCCAATGG CCTACCTTTC ACAGTTGGAA
ATGACAAGTA GAGGAGAATG GAAAGGATTG GTGACAGACA GAATCTCGGA TGGAAGTTGT
GACCTGATGG CTTTTGTATG GGTGGACCGA GACCGTCGAT ATTTTATATC AACAGCATCC
AATCTGAATA GAGGCTGGAG TCCAGTTTGC TACCGGTGGA GACAGGTGGA TACATCACCT
GATGCAGACC CTGAGAGGGT GGAGATCAAT ATTGCGCAAC CAGTTGCAGC AGAAGTGTAT
TATTCTTGCT GTGCAATGAT TGACAGACAC AACCGGAGTC GGCAGGATAC ACTGATGCTT
GAAAGAAAAC TTGGCACATG GGATTGGTCG ACACGAGTCA ACTTATCAAT TTTTGGTATC
ATTGTTGTGG GCACATGGTT AGCCTACAGC CAATGTACAG GAATAGGAAA GTCTGCTGGA
CGAGAAGAAA AGCAGAAGGA TTTCTACAGT GCCTTAGCCG AGGAGCTGGT GGACAACCAG
TACGATAGTG TTGGAAGTCG GAAAGTTGGG AGGGGTGAGT TGGACAAGGA TAGCCCAACC
ATCTCCAGAA CTGGAGAGCC GCGATGTGGT CTCTCCGCAC ATCTAA
 
Protein sequence
MPVRDPRLRI GGKVTAKACH VVHLSECARR YGVNKHSKRL VGTVLDVTTT PVSITTGRTS 
TLITAVYDFG ESLFKEKTLN IRSVKAFVPP EDEGMSLIEE LAAEALQAAE ADMEAGNLME
ESVEAPVAEM VETPADIEPD TLVDTEPNTP VDTEPDSPVA EIVETPVDTD TSVDTESENP
VATVHQTECK TRIQSQPENF FGSLECSYSL QSLSSVAGPN YGPPLHPPST FLPLHLDAQE
CPGNGLTISG NISVGVNNVQ SDPMGKTITD DEEGDEEDEY LPHGAKIIKE LVCPWWGSDR
IVCADSYFAS VVTAVKLKRI GLRFIGVVKS ATRRYPMAYL SQLEMTSRGE WKGLVTDRIS
DGSCDLMAFV WVDRDRRYFI STASNLNRGW SPVCYRWRQV DTSPDADPER VEINIAQPVA
AEVYYSCCAM IDRHNRSRQD TLMLERKLGT WDWSTRVNLS IFGIIVVGTW LAYSQCTGIG
KSAGREEKQK DFYSALAEEL VDNQYDSVGS RKVGRELESR DVVSPHI