Gene PHATRDRAFT_42437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42437 
Symbol 
ID7196642 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp49681 
End bp51216 
Gene Length1536 bp 
Protein Length428 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177006 
Protein GI219110509 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.138953 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGGTC TTGTTCCCAA ACCTTCCGCC AAAGATAGAC ATTGCAAAGG GTTCCAGTGT 
GCCGCTCTTC TTCTTTACAC TTTGCGCACT TGTAGGATCG GCCAATTTGT ACCTACGCGA
AACATGGATG ACACTACGAT GCATCGAGCG GCCAGCCAGG ATTCGACAGA CATACGCTCT
CCGCGGTTAA AGGAAACGTT TCCCTCAGTC GCCAACGAGG GAAGCCCTTC ACTAGCCGAA
CATGCATCAA CCACTAAGCC GCTGACATTT CCTAAACCAA CCCATAGTCG CGCGGATAAT
TGGATGATTG TCTCGTACCG ACGGACACGA TTCAGCAAGT TTTGCAAGGT TGATGTGCAT
CCCGAAGACT ATTCCAAGCG CAATAATTTG CCCAGGTGTC GCCCAAGTCT CGTTGACCCC
TTACCAAGTA CTCCGCTTTC GGATTGCGAT ACGTGTGATG CTGACAGCAA TTTGCATCGA
CAGAAGGACT CTTTACCGGA CCGCGTCGAT GTGATCGTAG AGGTAAGAAG CCTCTCGCCC
ATGCGTCGTG TTGGACTTTG TTCAAGAAGT CCATTCTCAT GCCAGATATA AATTCTTTGG
GTTTTAATTC ATCCTAGGCG ACTGCACTTT GTGATCAAGA TTTGCATATA CGGCGCAATG
CTTCCGAAAT ACATAAATGG TGGCCCTGCG CGTTCGGTAG CAGCTTTGTC GGACGCGCCC
AGGCCTGCGG CCCCGAAGCG AAAGAAGCCG GGATCAAACC AGGATCGCGC GTGGCTGTAA
TAGCCAAAAG TGGACCTATT GCCCGCTACG TTCCCGCTCG CGCACAGGAT TTAGTGACTG
TACCGAAAGA ATTGGATGCT GCCGACATTG CCTGCCTAAT TGCTACATAC TTACCAGCCT
TCCAAGCTTT GCATCACGGA AGGATCCGCC CCTACCGGTA TTCTCGAACA TGTTTCAAAG
GAAGGAGGAT TTTGGTCACT GGTGGAGCTT CACCGGAAGG ACTGGCAGTC GTTCGATTGG
CTCAATTGGC GGGAGCCAAG GACATTTTTG TAACGGCCCC GAGAGCGCAC TTTGATGTTA
TCAAAGCGCA GCGTGCAATA CCGGTCGACG ACAATGCAGA GGCATGGTTG GATCAGCTCG
AAGGGCGTAT CGACATTGCC ATCGATTTGA ATTTTCCAAG AAATTTTGTC TTTGTACGTC
AGTCTTTGGC ACGGAAGGGG AGATTGGTAT GTCGCCCAAT ACATCATTGT AGCAGCGCTC
CTGAGCAACA CTGCATGACA ATGACTAAAA ATCTATTTGG TCGCTTCCGT TTGTGTATGA
TGAAACGTGC AACAATCTTT GATTTTACGG AAAACCTGGA ATTGCATCGG AATGAAACAG
GAACAGGACT TTGGGTTTCT CCTACGTATG CTAGCGCTTC GAAAGATTCG CCCACACATT
GATTGCTTCA TCCGACTGGG TGATGTCCCT GAAACCCTCC TAGACCTACG TGCCAAGCTA
TCAACTGGCA CCATCATTTG TGAACCTTGG AAATAG
 
Protein sequence
MMGLVPKPSA KDRHCKGFQC AALLLYTLRT CRIGQFVPTR NMDDTTMHRA ASQDSTDIRS 
PRLKETFPSV ANEGSPSLAE HASTTKPLTF PKPTHSRADN WMIVSYRRTR FSKFCKVDVH
PEDYSKRNNL PRCRPSLVDP LPSTPLSDCD TCDADSNLHR QKDSLPDRVD VIVEATALCD
QDLHIRRNAS EIHKWWPCAF GSSFVGRAQA CGPEAKEAGI KPGSRVAVIA KSGPIARYVP
ARAQDLVTVP KELDAADIAC LIATYLPAFQ ALHHGRIRPY RYSRTCFKGR RILVTGGASP
EGLAVVRLAQ LAGAKDIFVT APRAHFDVIK AQRAIPVDDN AEAWLDQLEG RIDIAIDLNF
PRNFVFVRQS LARKGRLEQD FGFLLRMLAL RKIRPHIDCF IRLGDVPETL LDLRAKLSTG
TIICEPWK