Gene PHATRDRAFT_54342 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_54342 
Symbol 
ID7200297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp185127 
End bp186882 
Gene Length1756 bp 
Protein Length585 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179379 
Protein GI219117169 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGAATA GCGACCATGG TGGCAAACAG CGGATCCTTG CACCAGCGGT CCAAGCGCAT 
TCCTTATCAA GGCCAGCTGT CGTTCTCTTT GGAACTCATG ACATCCGAAT TCACGATAAC
GAGGCCTTGC TGTTGGCCTG CCATCACAAC CACGTATTGC CGGTTTTTCT ATGGCAAGTA
CCGGTCCACC ATTGGGGAGC TCGTGGCGCC CTGCAAGTCG TGTTGAAAGA AGCATTGCAC
CAGCTTTCAC TACAGCTTTC ACAAGAAAGT ATCAATCTAC CTCTGGTGTG CGGCAATACG
GCGGACAGTG TTTCGGAGCT CTGTAAAATT GCTTCTGAGA TTGGCGCGAG TGCAGTCTAT
TGGAATCGCG AGATGACACC TGAAAGCAGG GAAATGGAGA GGCACAGAGC CACAAGTTTG
AAGCAACTCG ATATCGCAGC TGTAGCATGT CAATCGGCCC TACTCTACGA TGTCGAGAAA
CTTGAACTTG ACGAGGGGTT TCATGGTGGT CACTGGGGCA CGCTGATGCC GTTTAAGAGG
GCTTGCGAAA AGCAACTTGG AAAACCGGAT CGACCGATTT TGATGAAAGA GTCGCTGGCT
TGCCTATACT CCGTCGCTCC GCCTCCGCAA GACTGTAAAT CCGTACCCAT CGAGGAACTC
GGCTTCGAGA CCGTGCCGCC TTCCACAAAG TGGGATGAGC CCATTCGAGA GCGATTCCCA
ATGATACATT ACTTGGCTCA GCGGAGACTG GACCACTTTC TTATAAAAGG GCTACCTCTG
TATGAAAGTG ACCGAAGTCG GGCAGACATG GAATACGCGA CTTCGCAGCT TTCAGTGTAC
CTTCGTATTG GTATCATATC ACCACGAGAG CTATACTGGA GGATTGAGGA CAGCTCACTG
AGCCCTGAGG CGAAGAAAAC GTTTGCCCGT CGACTAATCT GGCGTGAGCT GGCGTACTAT
CAACTATTCT GTTTTCCAAA GATGAGGGAC AGATCGATAC GCAAGCACTA TGAAGCATCG
GAATGGGTCA CGGGTGACGA AGAGAAAGGC AGATTCAATG CATGGAAGAG AGGGTTGACT
GGCTATCCGT TGGTGGATGC TGGGATGCGT GAACTCTATA CAACTGGTTA CTTGACCCAA
TCCGTGCGGA TGGTTGTCGC GTCATTTCTT GTCGAGTATC TTCGAGTCGA CTGGACCAAA
GGAGCAGAAT GGTTCCACTA CACTTTGGCC GACGCCGATA GCGCGATCAA TTCGATGATG
TGGCAGAACG CTGGGCGGAG CGGCATCGAC CAGTGGAATT TTGTTTTGAG TCCTGAGAAT
GCATCCCAAG ACCCATACGG AGAATATACT CGCAAATGGG TCCCCGAGCT TTCTCCGTTG
CCATTGCAAT ACTTACAGCG ACCTTGGCAG ACGTTTGAAG GTGATCTTCG TATGGCCGGT
ATCGTCCTTG GTGAAACATA CCCACATAGA ATTGTTCAGG ACCTCAAGGG TGAACGACAA
AAAAGTGTCG AGAGCGTTCT TGCAATGAGA AGGCGATCGC AAGAAAAAAA TGATGAAAAT
GGATACGACT TGATCGACCT TCCTTCGGGC ATCGAAACGG TCGTTTTTAC GAAGAAAGAG
TACCGTATTG ATCGGTTGGG CAAAGTGCTC CAGGGAAAAC CAAAGACCGC TACTTCAACA
GTCAAGCGCC GAAAAACAAA ACGTACAACG AAAACAGATG GGAGAAGAAA GAACCGGCTG
CCATCAAGTC TCGCAT
 
Protein sequence
MSNSDHGGKQ RILAPAVQAH SLSRPAVVLF GTHDIRIHDN EALLLACHHN HVLPVFLWQV 
PVHHWGARGA LQVVLKEALH QLSLQLSQES INLPLVCGNT ADSVSELCKI ASEIGASAVY
WNREMTPESR EMERHRATSL KQLDIAAVAC QSALLYDVEK LELDEGFHGG HWGTLMPFKR
ACEKQLGKPD RPILMKESLA CLYSVAPPPQ DCKSVPIEEL GFETVPPSTK WDEPIRERFP
MIHYLAQRRL DHFLIKGLPL YESDRSRADM EYATSQLSVY LRIGIISPRE LYWRIEDSSL
SPEAKKTFAR RLIWRELAYY QLFCFPKMRD RSIRKHYEAS EWVTGDEEKG RFNAWKRGLT
GYPLVDAGMR ELYTTGYLTQ SVRMVVASFL VEYLRVDWTK GAEWFHYTLA DADSAINSMM
WQNAGRSGID QWNFVLSPEN ASQDPYGEYT RKWVPELSPL PLQYLQRPWQ TFEGDLRMAG
IVLGETYPHR IVQDLKGERQ KSVESVLAMR RRSQEKNDEN GYDLIDLPSG IETVVFTKKE
YRIDRLGKVL QGKPKTATST VKRRKTKRTT KTDGRRKNRL PSSLA