Gene PHATRDRAFT_42574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42574 
Symbol 
ID7195955 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp474293 
End bp476744 
Gene Length2452 bp 
Protein Length705 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176590 
Protein GI219109672 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AAGGAAATCC CCAAGCATCA CATAATAAGC AAGTCTCCCA GCAAAGAACT GTTCGAGGCA 
TTATCAATCA ATATGAAACT TGTTGTCTCT GCCTTGTTCT TCTTGTCCCT GGCGGAAGCC
GATTTGTTCA ACTATGGCAA CACGGACACC ACTGTCGATG GCGAGAAAAG TTACGGAATG
CCGAATTGGA ATCGAGTTGA ATGCAGCAAC GAAGACACCT GTGTGAGTGC GCAATTTTCT
TAAAACATTT AACATATGGG TCTGTTGCTC ATTACTTGCT TTTTTTTTTG CCTTTACAGC
GAGGCTGGCC CGACAAGTTT CCCTTTTTGG TAGGCTGGGA CGCGGGCAGA AACACCTGTC
AGTGGTGCCC TGAAGGTGGA CCAGAGAATT GTGGTTTTCA CAGACAATCA CCGATTGATT
TGAAGCGCGA CCGGGGGGTG ATCGGCGGAA GCAACGAGAA ATCATGCCCA GATTGGCATT
GGATGGCGTA CAAGGACGAT ACTTGTGCAT GGAAAGATAT GGTGGATGAA TTTTCGATTG
AACGTCACGG TTTGCGTCTG TCAGTCCCCA TTGAATCAGA CGGTGAAATT TCTTGTGTTG
AAAATCGAAA TGGACAAGAT GTGAGGAGGT TTCCTCGTCT AGATTACAGC AAAGGTTTCC
CGGATTGGTG GTGGCTCCAA TCGACCGACA TATCTGTACC CAGCCATCAT ACACAAGAGG
GAAAGAGATA CGACGCCGAA GTCACACTTA AGCATTTCTA CGAGATTGAA CATGACAAAA
ATCAGGTATG AAGCGCATCC AATACTTCTG TACTTAGTAG TACCGAAACC TAACGTGAAT
CTTTTCTGGT CTTATCAGCT CGGCTATGTA ACTCTCTTTA TGGAAGCCTA CGACGATGCA
GAGTCATGGC CGTTTTTAGA CAAACTGATT TGTGCCTGGA GAGAAAAGGA GGAGAAGGTC
CGGCGTGAAT GTGGACTTCC GCCTTCAGCT CCGTACGGTA GATGTCGAAT CTTTTCTGAA
CGTGGTCAGC AGCCAACTGA TGCTTGGAGA TTCGAAGCTG GTGAGCTACA GTCTTTTGGA
GAGGGTGAAC CCGCCCCGGT TCCATCCGTT GCCCCCACCA AATCACCAAC TGCCTCGATC
ACTACCTTGT CTCCTGTAAT TCCGACTACG TCACGTCCAA CCGAGTGGGT CTTACCTCCT
ATCTTTGCAC CCCCTTCCGT CACCGTCACC ACCGACACGC CAACCATCAC TGAGCAAATT
GTTTCGGCAC CTCCGCCTAT TGACTGTGAT GCATTTGATA TGAACTATGA TAGGCTTTGC
TACTCCAACG ATCCTTGTTG TGAAACCCAG AGGTCAACTT CCGAGTATTG CTGGGACGCT
TATGAGAACA TTTTCCCAGG AAATGCCATC TACTCGGCTT GTCACCATTG TTGTGGAGGA
GAACGAAAGA GCGTTGGACC GCCTAGTCCC ATCAACCCCA AAATTCCAAA AACGTTACAA
TGTTCGTCGT TGTCGAACGA ACCAAACCGC ATGTGCAATC TTGAAAGCTG TTGCGATGGT
TCCGACTCCA GCTACTGTCG GGATGTTCAG AAGCAGTTCG GAGACAAAAT GACTGAAATC
TGCGTACGTA TTGATAACAG TTGATGGAGT TGATTCGAAG TATTCCGCTG ACCGCGTGTT
TCCTTTTCTT GCAGTGGTAT TGCTGCTCAG AGCCGAAGGA GTATGATTCC AACAGACGAA
CGCTTCGCGG AACAAGTTTT GGCATGGAGG TCGGCGAAGA CATCAAAGGT GTGCAGTCTT
TCCCAAGTGG TACTAAGTTT ATGGAGGTGG ACGGCCGCCG GCTTGTTTTG CGGAAAGAAA
ATTTTGAAAG GGATGAAGAG AGCGAGGAAG ATTATTTCAA TCGGATCTAT TCAAATTACA
AGCACCGGTC CTTACAAGCC ACTGTCCATC AAGAGGACTA TGCTGACATC GAGTACTGGC
CGTACGAATG GATGCTGAAG GTAAATACCG AGTATTACTT CAGATACGAA GGTACTCAGG
TAGTTGCCCC GTGCGCAGAA ACTGTCCATT GGAGAGCTAT GAAAGATCCT ATTAAAATTC
ATCCTCGCCA GCTTGCCGAG TTGACAAGGC TTTTGAAAGA AAGAATCGCC CCCACAGGAG
ATCCCAATTC TTGTCAATCG GACACTGCAG GCGTTTCCGG AAGTGACGGT TCACTCAAAC
TGAACAGAGA CCTTCAGTAC TACCACAATG TCCATCGCAA GGTATTTTGC GAGTGCAAAG
ACTGGCCCTC AAAGTTCGAA AGTGACAAGC AGTGGTGCCG CAATTGGCAA GATGACACCA
ATTACGAGCG GTTTTACCAG CGTCCTTATA GTTTCGATTC AAATGGAGAG TGGTAAAGCA
GGTTGTTTGA TAAAATACTC TATATAACGT TAAATGGAAT GATGTAAGCT TT
 
Protein sequence
MKLVVSALFF LSLAEADLFN YGNTDTTVDG EKSYGMPNWN RVECSNEDTC RGWPDKFPFL 
VGWDAGRNTC QWCPEGGPEN CGFHRQSPID LKRDRGVIGG SNEKSCPDWH WMAYKDDTCA
WKDMVDEFSI ERHGLRLSVP IESDGEISCV ENRNGQDVRR FPRLDYSKGF PDWWWLQSTD
ISVPSHHTQE GKRYDAEVTL KHFYEIEHDK NQLGYVTLFM EAYDDAESWP FLDKLICAWR
EKEEKVRREC GLPPSAPYGR CRIFSERGQQ PTDAWRFEAG ELQSFGEGEP APVPSVAPTK
SPTASITTLS PVIPTTSRPT EWVLPPIFAP PSVTVTTDTP TITEQIVSAP PPIDCDAFDM
NYDRLCYSND PCCETQRSTS EYCWDAYENI FPGNAIYSAC HHCCGGERKS VGPPSPINPK
IPKTLQCSSL SNEPNRMCNL ESCCDGSDSS YCRDLMELIR SIPLTACFLF LQWYCCSEPK
EYDSNRRTLR GTSFGMEVGE DIKGVQSFPS GTKFMEVDGR RLVLRKENFE RDEESEEDYF
NRIYSNYKHR SLQATVHQED YADIEYWPYE WMLKVNTEYY FRYEGTQVVA PCAETVHWRA
MKDPIKIHPR QLAELTRLLK ERIAPTGDPN SCQSDTAGVS GSDGSLKLNR DLQYYHNVHR
KVFCECKDWP SKFESDKQWC RNWQDDTNYE RFYQRPYSFD SNGEW