Gene PHATRDRAFT_41350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_41350 
Symbol 
ID7199209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011697 
Strand
Start bp248013 
End bp249665 
Gene Length1653 bp 
Protein Length550 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185296 
Protein GI219130279 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATCGC AACAGGACGA TCCTGGGTTC ACCTTTCCTC AGGAGGTGGC GTACACCGTC 
GCCCCGATCA TTTCAGGTCT CTTGTCGACC TTGGGATCAA CGGCGATTAT CTGGATGATC
TTGACGGATT GGAATCGAAA AATTCGTCGT GTCAAGTACC GAATATTGTT GGGCTTGAGT
TTGTCGGACG CCCTGAGTTC GATAGTACAG ATGTTCTGGG GAATCATGCT GCCCAAGGGG
ACACCGGGTT CATGGGGGGC TATTGGGAAC AAGGCAACCT GCAGTGCGCA CGGATTTATT
CTGCAGTTTG GCCTTTCGGG GAGCTTTTAC AATACGGCGT TGTCAGCCTA TTTCTACAAA
TCCATCTGCT TCGGCATGAC CGACGCAACG TTCGCCGCCA AGTACGAACT CTGGATTCAT
CTGACGTCCG TTCTCTTTCC GTTGGCAACC GCCGTAGGCG CATTGATTCT GGACATTTAC
AGTGTCACCG GAGGAGGGTG TTACATTGCT CCGGACCCGC TACGATGTCA CCGCCGCGAC
GACGTAGAGT GCCTGCGGGG AGAAAACGCC TACAAATACT CCTGGGCAGT CGCCGGAGTA
CCCGTGGTCA TTTTCCTCCT CTATATTACC TACACGATGT TCCGGATTTA TCAAAAAGTT
CGCCAAGTCT CCCGACGCTC GGAACGATTC GAGTTTCGCT CAACGCGCGT GTCCTACGAA
ATGCCCGATT CGGAAAATCC GTCGGAAGAA CAACGGCGAC AGTCCGTGAA CGGCCAAGAT
AGCAACGACA ACTTTTCGCT CCGCGAATTG GTTGAGCAGC ATGGGGATAC CACGGAACTG
CAGCGGCCTC CCGCGGTTTC ATTCCAGCTC ACTGGTGGCA ATACTGGCAC GTCATCCGGA
ACCGCTTCAC CTCCTCCGAC TCACCCTCTT CCTTCGGCTT CGTCGCGGGG AAGGACTTCT
TTCAACAGCA GAAGATCCAT CCAGGATTCC AACCGTGTCA GCCGTATCCG AGAGACAGCC
ATCCAAGCTT TCCTATACGT CGTTGCTTTC TTTGGAACCC ACTTTCCGTC CTTTATTCTC
AACAATTTGG AAATGTTTGG GGGCACCAGT CCCTTTTATT TGGTCTTCTT GGCTTCATTC
GCCTGGCCCT TGCAAGGTTT CTTCAACCTC TTTGTGTTTT TGCGACCGAG AATACGTAGC
TGCCGTCGAC AAGAGCCATC CTTGTCCTAC TGCAAGGCCG CCTATCTGGC GTTGTTCCAC
TACGACGAAG CTCGCGGTCG TGTCAACGAG TCGCAACTCA CGGACGCCAC CCCGGATGCG
GCGAAGTTTC CTAGTGGCTC GGACTCGTGC AACGGCAGCC AAGCTCTGCA AATGATACGT
GTATCACGTC TCGAATCCAT TGACGACAAT GACTATCGAG AAGAAAGCTT GTCTTCTGCG
TCCGCCTACG AAGACGAAAC CAACAACGGG CCCTCCACGC TCGATACCGT TGAAAGCTAC
CCAGCGAGTC AAGTCCCCGA CACGGCAACG ATGGCGGACG AAGAAAGCTA CTCGCCGGAC
TCTTTCGAAG CTTCCAAGGA AATCAAGCAC GTTGTTTCTC TTTTACGGAC AAAAGAGCAC
CCGGAAGAGA ATCAGGACCA TGACCGAAAT TAA
 
Protein sequence
MASQQDDPGF TFPQEVAYTV APIISGLLST LGSTAIIWMI LTDWNRKIRR VKYRILLGLS 
LSDALSSIVQ MFWGIMLPKG TPGSWGAIGN KATCSAHGFI LQFGLSGSFY NTALSAYFYK
SICFGMTDAT FAAKYELWIH LTSVLFPLAT AVGALILDIY SVTGGGCYIA PDPLRCHRRD
DVECLRGENA YKYSWAVAGV PVVIFLLYIT YTMFRIYQKV RQVSRRSERF EFRSTRVSYE
MPDSENPSEE QRRQSVNGQD SNDNFSLREL VEQHGDTTEL QRPPAVSFQL TGGNTGTSSG
TASPPPTHPL PSASSRGRTS FNSRRSIQDS NRVSRIRETA IQAFLYVVAF FGTHFPSFIL
NNLEMFGGTS PFYLVFLASF AWPLQGFFNL FVFLRPRIRS CRRQEPSLSY CKAAYLALFH
YDEARGRVNE SQLTDATPDA AKFPSGSDSC NGSQALQMIR VSRLESIDDN DYREESLSSA
SAYEDETNNG PSTLDTVESY PASQVPDTAT MADEESYSPD SFEASKEIKH VVSLLRTKEH
PEENQDHDRN