Gene PHATRDRAFT_39921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39921 
Symbol 
ID7195707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011689 
Strand
Start bp368109 
End bp369662 
Gene Length1554 bp 
Protein Length517 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183979 
Protein GI219127515 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTTGA GAAGAAGAGA GAGGCACGTA GGCCGCAGTC CTCTTGATGG AGCTGGTCGG 
CTGCACTCGG TCCTGCTCGA TGCCGACGAG AAACGACCCC GTCGGCCGTA TTGGCGAAAG
GCGTTACGGC AGCTTATCAT TTTTTGTCTA TCTTTGATGA TTTTTTGGGT TTTCTACAGT
TCTTTACCAT TAGCCAGACA ATCTTCTTTA CTTGACTCGA TTGGGCGGCA ATTTCTTTCC
GAAATGACCT GGTCGATCGT TCGACAAGAA CGTAACGCGA TTGCTGGGTT GGTTGTTCAG
ATTGTACCGA AAGCGACAGT TTTGACCGAT TTTAAACCAA TGGCTCTACC ACTCTTGGCT
CTGCCAATCA CCACAGACGA TGCTGAAGAC TTTGGTGCTT TAAACATAAA TTCGCTGGGT
ACTTCAGATT GGGTTCGTTC GATTGCCGCT GACGAATATG AAAAATATGA AGCGGAACGC
GGTTCTTGGA TGGATCACAT GGCTCAGCTG CAGCCCAATC TTCCCGACAA ACTGTTGTAC
AACGACGACA TCGTGGGAAA GCCTACCTGT CGAAGGAACA ACTGGGCTCG CGTGTCTCAT
CCCACATGTA ATATTTTGCA CGAAACCCGA TTCGACCAAT CCTACGAACC AACGGAACTA
TTTCAAGAAT ACAAAGTCAA GTTTGCGGGC GATGGTGCAT ATCGGAGTGT GTGGATTCTT
GAGCGGCCGG CCGTATCTAC GTTTGCTCTG AAACAGTTTC AGCTAGAAGA ATACGAGTTA
GGTGTTCGTG AGCACTTTCA AGTTCAGAAA GAAGCCTCGA TCCTGGACGC GTTGTCCGAC
AGCCCTCGTA TCATTAATAT CCACGCCCAT TGTGGAGTAT CTCTTTTCAT CGAATCAGCG
GTTGGTACGC TAGAAGCAGA ATTGGCGTCC ACAAACGGAA CGATTGAGTT GCATGAACTC
GGCCAGTTGC AACGATTAGA CGTGCACCCC CTGAACAATT TGACACTGGC AGAAAAGCTG
GACCTCGCCT TAGCTATGGC AGAATCTTTG GCAGATATAC ACGGTTTTGA AGGTGGGGCA
ATCGGCCACG GCGATATACA TCCCTCACAG TGGCTTCAAA TGGCAAATGG CGGCGTCAAA
CTAAATGATT TCAACTCCGC CGAAATATAC GAGTACAATG TTGACGAAGG CGTCTATTGC
AAAACTTATC ACAACTTTCC AGGAGCATTC CGAAGTCCTG AAGAAGTCCA GCATCGCCCC
TCCAACGAAA AGATTGACGT TGTGCCTTTG GGAAACAGCA TTTACGTCCT CGTAACGGGA
CTTTTTCCGT ACTACGAGTT GGGCGACAGT GAAAAGGAAG CAAATCGCAA GGTCAAGCAA
GGAGTCCATC CTTACGTTGA TACGCGCTAC CGCAACCGAT CTGTCGTCGA ACGAGAACTT
ATTGACGTCA TGGAACGTTG CTGGGAATTT GATCCAGATA GCCGAGTGTC TTCATTCGAA
GTTGTGTCAC GATTGAGAAA TCTCAAAGCA ATGGTCGCGG AGAAGCAAAT CTAG
 
Protein sequence
MSLRRRERHV GRSPLDGAGR LHSVLLDADE KRPRRPYWRK ALRQLIIFCL SLMIFWVFYS 
SLPLARQSSL LDSIGRQFLS EMTWSIVRQE RNAIAGLVVQ IVPKATVLTD FKPMALPLLA
LPITTDDAED FGALNINSLG TSDWVRSIAA DEYEKYEAER GSWMDHMAQL QPNLPDKLLY
NDDIVGKPTC RRNNWARVSH PTCNILHETR FDQSYEPTEL FQEYKVKFAG DGAYRSVWIL
ERPAVSTFAL KQFQLEEYEL GVREHFQVQK EASILDALSD SPRIINIHAH CGVSLFIESA
VGTLEAELAS TNGTIELHEL GQLQRLDVHP LNNLTLAEKL DLALAMAESL ADIHGFEGGA
IGHGDIHPSQ WLQMANGGVK LNDFNSAEIY EYNVDEGVYC KTYHNFPGAF RSPEEVQHRP
SNEKIDVVPL GNSIYVLVTG LFPYYELGDS EKEANRKVKQ GVHPYVDTRY RNRSVVEREL
IDVMERCWEF DPDSRVSSFE VVSRLRNLKA MVAEKQI