Gene PHATRDRAFT_45154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45154 
Symbol 
ID7200335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp336242 
End bp339409 
Gene Length3168 bp 
Protein Length1055 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179408 
Protein GI219117227 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACGGT TGGGCACGGA CTTCAACGCC ACTGGTTCCG TCATGAGCAA ACGGGAACTG 
GAAGAACTCC GGCGGAAAAA GGGAGAATGC GTCCGTTGCG GACAAAAATG TTTCCAGAAG
AAGCTCTTCA AAATGATTCC CATTACGGAT CACGGCAAGG TGCTCAACGG GCGCTGTTTG
GGTTGCAATC CTCTGCCCGG GGACGGCGAC GCCTCGGGCG TACTGCCGGC CGTATCGCGA
CCCGCCACGA TGCAAGATCT CCAACGCTTC AATCGGTCAC AGAACAATCT CGTTTTGCCG
CCGAATACCG CTAACACCGC TACCGGGAGT GTTGCCGGAG GAAGCGTTTC GGGGAGTGCA
AACGGAGGGG GGTCCCGGTC CAATTCGTCA CGCACGTTTA CCCCCATGAC GAACACACCG
ACGGGGAGCC ACAACGGTAA CGTTGGTACG GGTAATTCTT CCACTACACC CCGTCGCAGC
GCCCCTCGGG CATCCTCTTC GCGGGGCTTG ATCAGGGGAC AGTCGGAACG TGCCAGGGCA
TCGTCGGGTA CGGCCGCTTC GATTACCACC GCCACGGGAC GGTCGATGGA GAGTCTCTTG
CAACAGCAAT CGGAAGAAGT CCGATTGGAA TCGTCACAGC TAACGCCGTC ATCGTCGCAC
CCCGACCAAG ATCACATTCC CGAACAGTGT CGTCAACAGT TGCATTACGG GTCCACGGCA
TCGCCGTCAA CTGGTTTCTT GTCCCAACGT AGGAACGGCT TGCAACGTGG GGGTAGTTTT
GCGCGGACCG GACGGGGTTC GCCGGAAGTT TACGGAGAGA CTCTGTCCAT GTCGCGGAGT
TGCTCCACGT CTCCGGCCAA TACACCAACC GGATCATCGC CGCGGTACGA CATGGAACAC
CGACAAAGCA GTGTCCGCAG TGGATTGTCC CACATTAGTT TGGATTCGGC CTTGTCCGGC
GCCTCCGCGA TATCGTCGCC GCATCCACAC TGGCAACAAA TCGAGGAAAG TCCCGTCAAT
AGGAGGAACA GTGTCAACAC CAGTATCGAC GGCAGTATCA GCAGTAGTGC ACACGTGCGT
CAATCACCAC ACTGTAGCAA TAGTGGTGTC GATACGAGTG TGAATGGTTC CGGTGACTTG
GTCCACACGG GGCCGACCAC ACACCCTTAC TACGAGCACG ACACCTTCGC GGTCCCGTCA
TCGCGCGCTA CGGATACCTC CTACGAATCG CTTCCATCTC ACCACATCTC GACGGCGTCC
ACGGACGACG TATCGGGAAA TGATGAACTA CACGATGCAC CATCTCCTGT CACGGAACAG
AGACCGGATG TGGATTTGCC ACAGCAACAA CCAGACACCA ATTTAACGGC TCATCCAACT
GCCGCAGAGC TTTTGGATTA CTATCGTCGA ACCTTTCGAG AAGCATCCCA ACGCTCGTTA
CAGCACCACG GTAGCAACGG TACTGGATCG GTCGCTGGGG GCCGGATGAC TAGCGGGTCG
CTGAGTGGCA GCGGTGGCCC CGAAACGACA GGAATGCCAG CAGGGGAGAG CGATCATCTT
CCTTCCCACG GTGTGTTGAA CCGCGGCGGT GTCGTGTCTT TTCAGCACTT GCAACATCAA
GGGAATCAAG CACAAGGTCC GCAGAGTCTC ACTGGTACCG TCAATAGTAG TTCGCATCAT
CATCGACGTG GAAGTTCACG GCCCAGCAGC TCGCGATCTT TGGATTCGAT GAGTAGTTTT
GGGGAAGAAG TGAACGCCAG CATTTTGAAT AGCAATGATG CGGAGAGCGC CCCGAGCGAT
ACGAGCTTTC GAGAAAGCTC TGCGGTATCA CAAGATCCTG GCGTCGCCCG CATTCAACAA
GCCGGTGTGG ATTTCGTTGA AGTTCTCAAT ACTTTACGGG ATCTCCCGGA CTCTTTGCGT
ACACAAACAG CTGGTTTGCA CGTATTGTCT GAATTGACCC TGAGTGAAGA AGATTCGGAG
ACGTTGCTCA ATATTGGAGT GGTGCAGGTA ATCTTAGACG CAATGCGTCG TTACGCCCAC
GATACGTCTC AGGTCGAATT GCAAACGGCC GCCTGTCGAG CTATTCTAAA CGTTACGGGA
ACGTCGGAAG CGCAAATAAA TTTTGTGCAA AACCAAACGG TTGAACATGT GTCCACCCTG
ATGCAAAATC TTTTGGAGAA TGCGACCGTG CAGGAATATG CAATGGCAAC CATCGCCAAT
TTGAGTGTCC TTGAAGCGAA CTTGCCGATT CTGATAGAAG AACACTCGGT CACACGCATT
GTTGAAGCTA TGAACAAGCA TTCCGAAAAT CGTCAAGTCC AAATAAAGGG TTGTTCCGCC
ATTACCAACA TGGCTTCACA CACGACGCCT TTGAAAAAGA CCATCATGGA CCAAGGAGGA
GGCGGAGCTG TGGTCGTTTC CATGGTGATG CATCCTGGCG ATGTCGAATT GCAGGAAAAA
GCACTGCAGG CCTTGCGCAA TTTGTCGGCA AACTCAGACG AGAACAAAAT GGAGCTAGCT
CGCATTGGCG GGATCGAATC AGTGATTGGT GCCATGCAAG TCCACCGCGA CGAAGCTGGT
ATTCAAAAAA CTGGATCCTG GAGCTTGTCC AACTTAGCCG GTTTCGTTGA TAACAAAAGG
ATAATTGGCG AGTGCGGGGG AGTGGACGTG ATTGTGCGAG CTATGTGGGT ACATTCGGAT
GAAGTATCGG TTCAAGAATG GTGCTGTCGA GCCTTGTTTA CCTTGGCACT GGAGCCCCAG
AACCGATTGG TAGTTTTGGA CGTGGGTGGG ATCTCGGCTG TAGTTAATGC TATGCAAGCA
CATGTAGATT CATCGACGGT TCAGGAAATG GGATGTGCTG TCTTGTGCAA TCTAGCAACA
GATCAAGCAA CCAAGCTTCG CATTGTAGAC GAGGAAGCCT TGGATGCCAT CGTGTTGGCC
ATGGTCCTAT TTGGCGACGA AATCAAAGTA CAACAGCAAG GATGTCAAAT TTTATCACAG
CTTTGTGTTG CCGAAAACCT TAAATCATTA CAAGCGTCAA ACGCGGGAGA GCTAGCGCTG
GCAGCGGCGC ACAAATTCCC GGAATGCGAC GCGCCAGCAC AGTGGTTGTT GAATTCGCTC
GAAGAATTTG CTGCTGCGTA TATTGAGACT ACGGAAGCCC ACCATTAG
 
Protein sequence
MERLGTDFNA TGSVMSKREL EELRRKKGEC VRCGQKCFQK KLFKMIPITD HGKVLNGRCL 
GCNPLPGDGD ASGVLPAVSR PATMQDLQRF NRSQNNLVLP PNTANTATGS VAGGSVSGSA
NGGGSRSNSS RTFTPMTNTP TGSHNGNVGT GNSSTTPRRS APRASSSRGL IRGQSERARA
SSGTAASITT ATGRSMESLL QQQSEEVRLE SSQLTPSSSH PDQDHIPEQC RQQLHYGSTA
SPSTGFLSQR RNGLQRGGSF ARTGRGSPEV YGETLSMSRS CSTSPANTPT GSSPRYDMEH
RQSSVRSGLS HISLDSALSG ASAISSPHPH WQQIEESPVN RRNSVNTSID GSISSSAHVR
QSPHCSNSGV DTSVNGSGDL VHTGPTTHPY YEHDTFAVPS SRATDTSYES LPSHHISTAS
TDDVSGNDEL HDAPSPVTEQ RPDVDLPQQQ PDTNLTAHPT AAELLDYYRR TFREASQRSL
QHHGSNGTGS VAGGRMTSGS LSGSGGPETT GMPAGESDHL PSHGVLNRGG VVSFQHLQHQ
GNQAQGPQSL TGTVNSSSHH HRRGSSRPSS SRSLDSMSSF GEEVNASILN SNDAESAPSD
TSFRESSAVS QDPGVARIQQ AGVDFVEVLN TLRDLPDSLR TQTAGLHVLS ELTLSEEDSE
TLLNIGVVQV ILDAMRRYAH DTSQVELQTA ACRAILNVTG TSEAQINFVQ NQTVEHVSTL
MQNLLENATV QEYAMATIAN LSVLEANLPI LIEEHSVTRI VEAMNKHSEN RQVQIKGCSA
ITNMASHTTP LKKTIMDQGG GGAVVVSMVM HPGDVELQEK ALQALRNLSA NSDENKMELA
RIGGIESVIG AMQVHRDEAG IQKTGSWSLS NLAGFVDNKR IIGECGGVDV IVRAMWVHSD
EVSVQEWCCR ALFTLALEPQ NRLVVLDVGG ISAVVNAMQA HVDSSTVQEM GCAVLCNLAT
DQATKLRIVD EEALDAIVLA MVLFGDEIKV QQQGCQILSQ LCVAENLKSL QASNAGELAL
AAAHKFPECD APAQWLLNSL EEFAAAYIET TEAHH