Gene PHATRDRAFT_35054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_35054 
Symbol 
ID7199994 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp965847 
End bp968336 
Gene Length2490 bp 
Protein Length621 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179549 
Protein GI219117509 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGTG TCGAAATTGG ATACTGTACG TTTCCACAAA AAGTCACATC ACATATATCA 
CGCGTGATGT GACATTGCCC TCGCGACGAA TGTTCGCACC GTCGTCGGAG AGACGACACG
GCGACAACGT GTGAGTCCTT ACAGGTCGGT GCCTCGTTTC CGCGAGAAAG GTCCAAACGC
ATCTCTCGCG CAGTGGTCCT TCCTATTCGA GCGTCGTCCT TTTGGGGTTT CCCTCACACT
GTCCCTACGC GTTCATTCTC AGTTTGACTG GCTTTTGGCT ACTGCTACTA CTGCTGCTGC
TACAAAGGCC CGTTGGTCGT CAAAGACAAG CGGGTGTTAC GAGTCGACAG TCGTGTAGAC
CACTTCGGTG CTACAGTTAG TCAAGTGCGC AACGAATCCG GAAGCCAAAG CAAAGATACC
AGCGGATCCG TTGTTTGTTT CTTAGTGGCA CGTTTGGAAC GAGTTCTTGA TACGTATTGT
TTGTTTCGAG CCAACAGAGT TTTTGCAGAG AACTGTCGCA ATCAATCTGA AAAAGCAACC
AATTGTGTCT TTAGCTCGGT TTTGCTTGCC GACAGTTGCG ACAGTCTTTC TCCAACAAAC
AACTTGTCCA GAGAGTTTTC GAAATCACTG TTGGTATTCT GAGCTGTCGA CCGTCGATTG
ACACACTACA CCCAGTAAAC CTTTGGGCCG ACACTTTTCG CAAGTATCTA CCGCGAATTT
TGTTGTTTGT AACGTTTACA GCACCAGTTG CGAGGGCACG GACATTCACA CATCGCTCGG
CGCCTATGCA GTTTCAATAC AATTCAGATT ACCCCGGTGG TGGTCCGGAC GATAACCGCC
TCTTTTCTAC CGCCAAAGCC GCAGCAGACG CCGCAAATCC TCCCGATTTT CCTCCGGCAC
ATCCCATTCC GTGTGCCCGG ATCACTACCC GTGTGTTTCA TCCCAACCGA AATCAAGTTG
CTACCGTGCA AAACGTTCTG GTACGGACCG TCCTCCAAAA TCCATTACAT TCCTCACCAG
CGGTGGCACT ATACAATGCC GCCGAAAACC ACGAGGACTC TTCCATGAGC AGTAGTGACG
GGTTTTCCGA CAGCAACGAT GACGACAGTA TGCTGTTGGA CGACGGGTCT CGGGGTGCCT
CGGGGCCGAT TGGCTTTCCG GGCATCGCTC CGGCATCGGC TTCCACCAGA ACGGTTCAAC
AACAACCCAA TGGCGACGAC GATATGGACA AGGATGGTGA CGATCGCGCA TACTGGATAC
AACGGACCAT TCGTGATGCG ATTTACGGGC ACGTGTTCAT GGCGGTGGTG CTGCGGAGAC
GAGTACCCAG TCAAGCCGGC AACGACAATG CCGAATGGGA AGTTACCGCA CAACACTGTG
CCGTTAAGGA AATGAGCTGG CAGCATATTC GGAAAGAACG CGATCGCTTG GCCGAAGACC
CCATAAAGGA AGTCTCTGCT ATGCAATATC TGGTATCCTG GCATCGATCC GAACGGAAAG
AATGTGAGCA ACAAGTATTG TCCTCCGCAT CTACAGAACA TCCTCGAGAC GGCGTCTCTC
GATCCGTACG AGCCATGGTG GCAACCAACA TTATGATGCC ACTGGATTTG CTATCGGATG
ATCGGAATTT ATACAGCGTC ATGCCCTACT GTAATGGTGG CGAGCTTTTT GAGCGACTCG
ATATGAATGA ACGATTTAGT GAACCGGAAG CGCGGTATTG GATGAATCAA GTTTTGAATG
TACGTATCAA TACAAATGGA GTCATTCGGA AGTTGCCCAT GAAGCACAAT TCTGACCGTG
TTCTTGTTTT AAACTTAGGG TATTGAAACT CTACAGAATG CTGGAATATG CCATCGTGAC
ATGAGCTTAG AGAACCTTTT GGTACACGAA AATGGTGCGC TGATTATTGA TTTGGGTATG
TGTTTGCGGG TCCCTGTGCA GAAGGAGCAT GGAAGCGACA CTCCGGAGGA ACAGGCACAA
TTTCTGTCGC AGTCGTTTGA CACGATGAAT ATGAACGGAA ACAACTCAAC GGCGCTATTA
ACGCCCACAT CATCCTTGAC AACTACCACC ACAACTATTC GCGGAGGTGC AACCATATGC
CGGAAGCAGC CGCGCCGATT GATTACTCCG CAAGGGACCT GCGGTAAATG GATATATATG
TCACCGGAAA TATATAAGAA CGCTGCACCT TTTGATGGCT TTGCCGTGGA TATGTGGGCT
GCAGGAGTGA TTTTATTTCT CATGCTGACA GGATTTCCGC CATGGGAGCG CGCGTGCCAG
ACGGACGAAC GCTTCAAATA TATGACTGCT GGGTATCTGG TTCAGATGCT GACGGAGTGG
GACATTGGCC TTAGTCCGGA CGCGATGGAT TTACTGCAGC GAATGTTATT TCTAGATCCT
AAAGACCGCT TGAGCTTGGA GCAAGTGCGG GCACATCCGT GGATGGTCAA TGGACCGAGT
CAACCGCCAG CGCCACTAGC CGAGTTTTGA
 
Protein sequence
MKRVEIGYCP LVVKDKRVLR VDSRVDHFGA TVSQVRNESG SQSKDTSGSV VCFLVARLER 
VLDTYSPVAR ARTFTHRSAP MQFQYNSDYP GGGPDDNRLF STAKAAADAA NPPDFPPAHP
IPCARITTRV FHPNRNQVAT VQNVLVRTVL QNPLHSSPAV ALYNAAENHE DSSMSSSDGF
SDSNDDDSML LDDGSRGASG PIGFPGIAPA SASTRTVQQQ PNGDDDMDKD GDDRAYWIQR
TIRDAIYGHV FMAVVLRRRV PSQAGNDNAE WEVTAQHCAV KEMSWQHIRK ERDRLAEDPI
KEVSAMQYLV SWHRSERKEC EQQVLSSAST EHPRDGVSRS VRAMVATNIM MPLDLLSDDR
NLYSVMPYCN GGELFERLDM NERFSEPEAR YWMNQVLNGI ETLQNAGICH RDMSLENLLV
HENGALIIDL GMCLRVPVQK EHGSDTPEEQ AQFLSQSFDT MNMNGNNSTA LLTPTSSLTT
TTTTIRGGAT ICRKQPRRLI TPQGTCGKWI YMSPEIYKNA APFDGFAVDM WAAGVILFLM
LTGFPPWERA CQTDERFKYM TAGYLVQMLT EWDIGLSPDA MDLLQRMLFL DPKDRLSLEQ
VRAHPWMVNG PSQPPAPLAE F