Gene PHATRDRAFT_39147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39147 
Symbol 
ID7194887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011686 
Strand
Start bp444278 
End bp445441 
Gene Length1164 bp 
Protein Length387 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183097 
Protein GI219125669 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGACGC CCACGAGGCA GCGTCGCAAG TTACGGCCTC TCATCATTAC CATGGGAGGC 
CCGCGACGGG AAAGCTTGGA AGCCTTGTTC GCCGAACCCG CCATGGCCGC AAATTTTGAA
CCTCCCATAT TTTCCCCCGG CGTACCCAGC CGCAGTTTGC GTTCGCGGTA TCAGTTTTTG
TCTCAGGCGT ACCGAGCGGG ACTCTTGCCG GAAGCCGAGT GGGAAGCCGT GCGGGACCAC
GATTGCGCGC CGGACGAAGG CGACACTTCG ACCGATGCGT TTTTTGCAGG TCTCGGCGAC
GTGCCGGTGA CGACGGGGCG ACGAGGTAGT GCAGCCGACA TCCGCTTGCA CTACTCTAGG
GAGTTGTGGC AAAAAGCCAA GGGTATCAAT CGAGGTCGGG CGGTGTTGGG TTGCACCTTT
GCACATCTAA TTGCTTTGCG AGTACTGGTA GATCAAGAAC TGGACTTTGT ATTGGAAGAC
AATGTCCGTG TCCCCCTTAC TTCGTGTGCC GATCGAATTT GGGAGCTGCT CGAGGCTACC
TCGAATCGAA AGTGCCACCA TCGGTACTAC GGCTGGTTAG GTTCCGTGCC TAATTTGCGT
TGGATTTACG ATTTTCACGC TCCCAGGTTC TCGCATGCAT CGGACATCTT CGAGCACTTC
GCAGCTTTTC CCTTTCCCAG TAACGAGGAT ATTGGAAACG ACCTCACCGC AAAGGAAGCC
AATAGCCAAA GCGAGATCAA TGAGAGGGAT AGTGAGACCG ACCATAGGCA GCTTGACGAA
CGCAAACCCG GAGGAAATCC AGTTTGGGGT TGTTACGCCT ACTGGATCTC GAAAGAAGCG
TTTGCCGAGC TAATGGAGAC ATTGCGCAAC GACGTGGGAG CTATGCTGTG GAAAACGAAA
CGTGCCCGCC ATTACATAGT CAAGCCCATC GATAAGATTC TTCCGCGACT AGTTATGCGA
ACGTATGGAC AAGAAGCCGT CCTGCTACCC TCTCATCCAG CGTTTTTCCG AGCCCCAATG
TTGACCAGTA AAATCCATAC AAAGTGGGAC GCTGAATTCT GTAAAAGTAC AAAATTCCAA
CTAGAGCATT CTGGTTTAAG TTGGTCCGAT TTGTGGCTCA CGGCAATGGA AAAGGCAGTA
GTAGCATATC ACGAGCAAGA GTGA
 
Protein sequence
MSTPTRQRRK LRPLIITMGG PRRESLEALF AEPAMAANFE PPIFSPGVPS RSLRSRYQFL 
SQAYRAGLLP EAEWEAVRDH DCAPDEGDTS TDAFFAGLGD VPVTTGRRGS AADIRLHYSR
ELWQKAKGIN RGRAVLGCTF AHLIALRVLV DQELDFVLED NVRVPLTSCA DRIWELLEAT
SNRKCHHRYY GWLGSVPNLR WIYDFHAPRF SHASDIFEHF AAFPFPSNED IGNDLTAKEA
NSQSEINERD SETDHRQLDE RKPGGNPVWG CYAYWISKEA FAELMETLRN DVGAMLWKTK
RARHYIVKPI DKILPRLVMR TYGQEAVLLP SHPAFFRAPM LTSKIHTKWD AEFCKSTKFQ
LEHSGLSWSD LWLTAMEKAV VAYHEQE