Gene PHATRDRAFT_46440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46440 
Symbol 
ID7201543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp345965 
End bp349057 
Gene Length3093 bp 
Protein Length1030 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180990 
Protein GI219120506 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCTTC CGCCGTTATC GGTGCGCAAA CGCGGCGTCA AGGATCCGCG GCCGTACGAG 
CCAACAACAA CCCAGGCGGC GGAGGATGCA GTCGCCCGGA TTATGGATTC GTTGGGGACT
GCCGTTTCCC CACACTTGGG GAAAGTGAAT TTACAGAAAG GACTGTACAA TGAGAGCAGT
GGCACATCAA CACGGTACCG GTACGAGCGT ACGGAAGAAA GTTGGTTGGA ACGCGCGTTA
ACAACGGACG ATGACGACGC CATGGAGACG GACGAACTCG TTACGGGCTC GGCAAATTCC
AAAAAGCCGC GGATTCTGGA ACGCGCCACG ACGCCGATCA TAAGTAACGC GAGTCACAGG
TCGCAGGACC GCGACGAAAT CATGTCTCCC GTCCCTCTCC GTTCACCCTC TCCCACGGAT
GATGCTGGGA TCACCGGGAA CGACACTTAT CTGCGGCGTA CCCCTACCAC GCTGCGTACG
GGACACGTGG CTGGCCCATC GTCTCGGCAC GATGCTCTCG AGTATGGGCG CTATCACGAT
GCCGTATTGC GCTTCGTACA AGTCAAACGT CGTGTCACGG AACGCGTCGA ACTCAAACAA
CGTTCGTTGG CACTCCAGGA GGAGAGAACC TCACCGCGCG TACCGTTCGA CGACGCCATG
GAGCTTGTTC CCGACGACCT TTTCATCGCG AACCGGGATG GAGATGCGTT AATCTTGGAC
GAAACACGAG CGGACGTTGC GTTCTTGCGT TCCTTACACA ACCTCTCCTC GGACCATGTC
AATGCAAGCG ATGCTCGTTT GGCACGCAAA GAGGGCAACT TCTGGCTCCT CCTGTCGCAT
TTACGAGAGT TGAGTTTAGA TACCTTGATC TGGGCTGACG ATGCTGCCTC TCTACACCAG
CACGAGAGTT CACTGTTGGC CTATGTTGAT TCCCTCGCTG CCAAGGTCAA CGCCGCCCCA
CTGGAACTCA CGCAGGCATT GCAACTGAAT ACGTCTGAGT GTCCATCATT GCTTCGTCGT
CGTCAGTGCA TCATGCGTTG GCTGGAAGCC TGTTTTCATC AGCAGTTGCC GACAGGATCC
ACCCGCGCTC GCCATGCCAC CATAATTTGT GATTCCAAGC TCCTGCGGCA AGAAGGTTTG
CCCGAGACGG ACAAGGATGC GGAAATACTG AAAAATGCGT TGGCCTTGAT TTTGGCCGGA
CGAGAGGAGG ACGCACAAAT ATTGGCTCGT GATAGTGGCG CTCCTTGGCG AGCGGCGCTT
TGGAGTGGAG GGAAACCACA AGGTGTCGTG CACAAGCCGA ACCTCGCAAC CGAGACGATG
GATCGGATCC CCACGGGAAA TCCCCGCCGT GCTTTGTGGA AGCGCATGAT GTGGAAAAAT
GCCGAAGCGC TGCATCAAAA GGGAAAGGCG GCGGCAGACG AAGCTGCGAT TGCCGCCATT
CTTTCCAATA ATCTTAAGAT TGCCCTCTTG AACCCATCCC TCAGGACGTG GGAGAAGTGT
CTGTATGTTG CCTTCCGATG CATGATCGGT CGCACCGAAG ATGACCTGCT ACACAAACAC
AACAATCTTC GTCGGCAATA CCGCCCTCCC TTCCCTGGAA CACAGTTCGC ACAACACGAA
CTCCAGCAGC TTCGTGACAC TGCTGACATT GCTGCTGAAG ATGAGGCCTC TGTTATTCAT
AATATCTTAC CAAGCTCTGT TTTTGACGAA GTCAAAGATG ACGATGTAGT GACGGATGCC
ACTTCGAGTT TTCTGGTTGG CAAAAGTTCG ATTGCGTCTT ACCTACAGGA CTCAATGGCG
GACTTGGACG ATGCAGGAGA AATGCAGCTA CGGTTCCTCA CCCACCTGGG TTTGTACTTG
GATTCACTAG CGGTAGGCAC AACGCCAATC TTCATTCAAG GAGTTTCGGA CTGGAAGAAC
AGAATGTTGT TGAAGTACTT GCAGTATTTG TCCACCCGCG AAGAACTATG GCATTTGCTT
GTACTTTACG CGTCGTTGCT ACCGGAGTCG GTCTTGACGT CTCAACTTCC CAATATGCTG
CAAAGCCTTG ACAGTCAAGA AGGCCGCAGA ACGATTGTTG AACAGATGCG TGAACTTTTA
CCGCGTGCAG GTTTAGATTT GGTAGTTCTT CAGAATGTGG TACAAGCTAC TTTGCAAACG
ACGGATACAG AGAATTCCTG TGTCGACACT CCCACTCGTT TGGATGTCCA AAAGATGCGC
TCGATTGCGT GGCTTTCTTA CAGTCAAGAC CATACGGTGG ATGCTCTCGT ATTTTCTAAC
TCACTGCTAC GGCACTTCCT TCTTGGGGGT CGTCGAGCGA GTGCCATACT TTTCGTCGAA
GACTTTCGGC TGGAAAGTGT CTTGGAGTTA GCGGAAGGCA ATGGAGAAGA TGAAGCAGCG
GATGTAGACA CGTGGCGACG TGAACACATG GCACTGCAAT ACTACTTGGA TGCGACTCAA
GCTATTGACC ACTGGCGTGA GATTATTTCT ACTGTTGAAA GCACCACCAA ACTTATTGAT
GATCGGATCG ACATTGGTCG GTTGGATGAA ACAGACGTCT CGGTGGCATT GAAGATCGAA
CGCCGTGCTT TGCTCGAAGA GAAGCGCAAG GCTAGCTTCT CTGTCGTGAG GGCGTCTAAT
AGCGCACTCA AAGCTCTTTC CGCCGTCCTA AAGTATAGAG GCGGGTGGTT GTTACTGGAC
TACGATGCCG CATCAAGGCA CTGCGACCAA AACGCTCGTT CGGCCGAACT CGATGCCTTG
CGCCGCAAAA TTTTACCGCA TTGTATTTCT AGCTATTGCG AAGTCTGCAT GGAGACGGCA
GTGTGGATGT CATCCTCCAT GGACGATGCT GTTGCTCAAC TGAGTGAAAG CCCTTCCGCT
GTGTTGGATT CGCTCGACAG TCCCCAGGAC GGTGAAATAT CTCCAATCGC CCCTTTCTAT
TGGCCACAGC AAGCATTGGA AATTGCCAAT GTTGTTGCAT CAGAAACCTA TGGTATCCTC
TCTGCTTTCG GGACGGCAGA AAAGAAACAG CTCGTTTCCG ATTTAGCGGA GGCGTCTGTA
GCCAACCTCT TTTATACTAC GAAAAGAGAC TAG
 
Protein sequence
MSLPPLSVRK RGVKDPRPYE PTTTQAAEDA VARIMDSLGT AVSPHLGKVN LQKGLYNESS 
GTSTRYRYER TEESWLERAL TTDDDDAMET DELVTGSANS KKPRILERAT TPIISNASHR
SQDRDEIMSP VPLRSPSPTD DAGITGNDTY LRRTPTTLRT GHVAGPSSRH DALEYGRYHD
AVLRFVQVKR RVTERVELKQ RSLALQEERT SPRVPFDDAM ELVPDDLFIA NRDGDALILD
ETRADVAFLR SLHNLSSDHV NASDARLARK EGNFWLLLSH LRELSLDTLI WADDAASLHQ
HESSLLAYVD SLAAKVNAAP LELTQALQLN TSECPSLLRR RQCIMRWLEA CFHQQLPTGS
TRARHATIIC DSKLLRQEGL PETDKDAEIL KNALALILAG REEDAQILAR DSGAPWRAAL
WSGGKPQGVV HKPNLATETM DRIPTGNPRR ALWKRMMWKN AEALHQKGKA AADEAAIAAI
LSNNLKIALL NPSLRTWEKC LYVAFRCMIG RTEDDLLHKH NNLRRQYRPP FPGTQFAQHE
LQQLRDTADI AAEDEASVIH NILPSSVFDE VKDDDVVTDA TSSFLVGKSS IASYLQDSMA
DLDDAGEMQL RFLTHLGLYL DSLAVGTTPI FIQGVSDWKN RMLLKYLQYL STREELWHLL
VLYASLLPES VLTSQLPNML QSLDSQEGRR TIVEQMRELL PRAGLDLVVL QNVVQATLQT
TDTENSCVDT PTRLDVQKMR SIAWLSYSQD HTVDALVFSN SLLRHFLLGG RRASAILFVE
DFRLESVLEL AEGNGEDEAA DVDTWRREHM ALQYYLDATQ AIDHWREIIS TVESTTKLID
DRIDIGRLDE TDVSVALKIE RRALLEEKRK ASFSVVRASN SALKALSAVL KYRGGWLLLD
YDAASRHCDQ NARSAELDAL RRKILPHCIS SYCEVCMETA VWMSSSMDDA VAQLSESPSA
VLDSLDSPQD GEISPIAPFY WPQQALEIAN VVASETYGIL SAFGTAEKKQ LVSDLAEASV
ANLFYTTKRD