Gene PHATRDRAFT_47861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47861 
Symbol 
ID7203084 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp265564 
End bp270456 
Gene Length4893 bp 
Protein Length1234 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182190 
Protein GI219123768 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.090651 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGTTGC GTCGTCGTCT GACCAAGCCC GGTCGTGAAA GCCTGAAGGG GTGGGAAGGT 
TGGGGTTCCG CTCCGCACAG CGACCTCCTC GATACAGCAC GAGAGCCTCT TTCCCAAAAA
GGGCCTGTTT ACTATCGTAC CTCGCTACCG TGTCGACTAG TCGCTCTACT TGGCGCTGGT
GGGTCTCCCT TCTTACGTTT ACACGTCCCG GCACGGTAAA AATAGTAGGT ATACGCCCCC
ATCGCTGCCT GCAGAGTTGG TGGTACACAA GTTGTGCGTC TATTGATCTA TCTATCTATA
TCTCTATCCA CCCCAGGGTC CTTTGTGGGC GTGTTGGTAG AAGAGAATTA CGAAACTTCT
TTGGGATAGC TTGGTGTCCC TTTTGAGGAC TCAAAAGACC GTTCAGGAAC CTGCCTACCA
ATCCCCATTC GCGTCACTCT GTGTACGTTG ATTACCATCT TCGCGATTGT GCAGACAGTG
TATCGCACTT GCAGTCTTGT TGGGGTTTCT GTTCTTGATT GATTCGGTCC TTACGGTGTT
ATCGTAGAGT CCACCGACGA AGACAGGGAC ACACACGTTG GTTGTTAGAG ACTGGACGTC
CGACAATACT CGGTACGGCG ACGACTTGTG GTCTACTTAT TCGTGCCGAG ACTGGAAGAT
CCCGCCCGTA CACTTGGAAA GACACGCACA CTCGCTCAAA AAAGCACAGA CATATACATA
CACAGTATGC CTGTTCGACC CAAACGCTCG TTGGCCAAGT ACTTTCACGG TGGGGATCCT
TCGGATGTCG TCGAATGCCC AGGCTACCGT GCCGCCAACA ACGGCAATCA ACACCGTAGC
GTACACAACG CAACCATGGA GGCACTCGTA CGTGGTTCCC CCGTCGCCGA CACTGCCAAT
GGCGGGGGCG TGCTCGTACC GTCGGTGCAA CCGCAGCTCT CGGCTGTACG GCATAGCAAC
GGTTGTTTGC ACTGCCACAC GGAAGACACC TCCGAAAACA ACGTGCAAGT CCGCTGCGAC
GAATGCGGCG CCTATATTTG CGACGAATGT CACTGGTGTC ACGAATATCA GGCCAATCAC
GAGATTCGAG TTTGCGATCG ATGTGACGGC TTTTACTGCA AGGGCTGCGA CGAAATGGAT
CAGTGCGAAG ATTGCGGCGA AGTTGTATGT GCAAGCTGTA GTACTCTCCT CAGTTGCAAA
TTTTGTGGTG GAGGGCTTTG TGAAGAATGC GCGACGGCCT GTGGACGGTA CGTAGGCTCT
GGAGTATTGT GCAACCAAAC ATGGACCCTT TGAACACTAT ACGAGAGGAT ACGTTTGTAT
TGCATACGAA TTGTTCTTCT CACACGATTT TTCTCTCACT TTTGGCAGTT GCGGTATTGT
GTTATGCAGC CGGGACGCCA AGTTTGCGGT AGAGTGCGAT ACTTGTCGAC TGAGTTACTG
CCTGGTATGC TTGGCTAGTG GGGTCAAGGA TCCTTGCGTC CGTTGTGGAC ATCGCCCTTC
GAAACGCATG GAGCAACTCG TACACCTGCG ACTGAAATCC ATTTACAAAG CATTCAAACA
AAACAGCGGT TCCAACGGTC TGGAAGTCCG TGCTCCGCAC ACGGCTACCA AAACGTATGC
GCGTTTACGC GAAAACGACG AACCTTGCGA CAATCCGGAT TCACTATTAC AGGCTGCCGC
GTCAGTGGTA GCGGCCAAAC ATCCGGAACT ATTGACTGCA CCGGAAGATG TCGAATCCCA
GCTAGTCCGA CTCGACTTGG AACAAGAAAA AGCCGATGCA GCCGCGGCGG CGTTATTGGC
GGAACTTGAA GAGGAAGAGC ACGCCGAACA AGTCAAGAAA AATAAGAAAA AGAAACGCAA
AGGTCGCAAC GGCAATAAGA AAAGCGAGGA AGAGGTTGAC AAAAAGCTTC CAGCAAAAGA
AGATCTACTT CCGCAGCCGT CCAGCCCGGA ACCAGTATCA GTAGCTACGG ATTTGCATGT
AGTAGCTACC TCACCGTCAA TACCAAACTC GCCTGATCCT CCCAAAGCTT CGGCCGTCGA
TTCGATGCAA CAAAAGCTTT GCGACCTTGT GATGAATGAG GATATGAAAG GGTTAGAAGA
TTTGATGGCC TCTTTAAAGG GTGTTCCGGG ACAAGCAGCC TTACGCAAAA ATGCAAAAAA
AGCGCTGAAA CGACTACAAA CACCAGAGGT AGACGCCCAT ATTATTGAAG CCAGAGAAAT
TACCACTGCT ACTCTGACGA CACCTTTAGA CGAAGCAACC GTCCCTGATG GCGCTGCCAC
GCCTCCGCCT CCCACCAGCG ACTTGCTTCA CGTTATTTCT TATACACACA ACAAATTGCC
GACTCAACCT TCCAACGTCT CACATCGTGC ACGCAATGCC TCCGCTACAC CCAAGACGGA
ATGTGTCCTT CACATGGCAT CCTCGATTGT CGGTTGGGTA ATTGGGAAAG GCGGTCAACG
TATTCGCGAT CTTATGGAGG AATCCGGAGC CCGAGTTTGG ATCGACCAAG AGAATCTAGG
CAAAATGGAT CCCCGTATTG TTTACGTTAG CGGTCATCGG AAAAACGTGG ACTCGGCGGT
ATATTTGCTG CAAAAACTCG TTGCGCAAAC ACCTACCGAT CCTTCAGCGT CGAATCAAAA
CACACTCTTG GGGCTTAAAA GCGACAGCTC GTTAGATCCT GGGATTGTGC CATCGCGGCC
GTCGGATTCG CACGAAGGCA CGACTAGTGT GCGTGTGCAA ACTTACGGTG TGCATGCTGA
TCGAGCCAGC GATACTACGG GCAAAGGCAA TCATATTCTC ACCTGCGATA AACGTTTCGT
TCCTCTTTTG ATTGGGAGGC GAGGATGGAC AATTAAAAAC ATCCAGGATT CATCCGGGGC
GCGTGTTGAT ATAGATCAGA ACGTAGCACC TCCCCGTATT ACAATCTCAG GTGCGGAAGA
ACAGGTTTCG ATCGCGGTAG AAATGGTGCG GGATGTGTTG AGCTATCCAC ATTCGCAACT
TCAAGGACGC GCTGGCAGGG ATGAATGCGA TCACGCGGCA GATCATGAAA GGAACACTCC
CGGCGTTGAA TTGCAGATGA GACTCTACTT CCACCGAATG TATCACCAGT GGCCGACAGG
AACAGGAATT CTCCGCCGTC TTCGTTGATT ATGCCAGACG ATGTTCAGAG CACGATTTCG
GCTTCTTCTT CTTTGTCTTT GACTCCAGAG CCATCGACAG CGTCTTCGAA CAGAACCAAT
TTGCACGTTC CTTCTGGACC TATGTTACCG CCCGCGTACA ACGCCGGACT CTATACTTCC
GGTGTCAATG CTAGATCTAC TTCGTTTCAG CCAGGTTTTA ATGCGTCAAA CGGGCCACTT
TTCACTGGCC ACAGTCCGAG TATGATTCTA CCGGCGGAGC AATTGTTTCG TGTGCAATCG
GGTCAGTACG CAGAATTTCA GCAATCGACA AGGGATATAG GTGGCGCATC TTTATTTCCA
GATCAGAATA TAGGCTTTGC TGCTCCTGCT AACGTTCAGC CTCCCAACTA CCTTCAGAGT
CAATCTTCCT CTCCCTTTGA GTCAAATCCA CTGGGATCGT ACGGAAACAT TCCCTCCAAT
CAGCAGCAAG CAGGGCTGTT TCCTCTTTCA CAACCCGTAT TTTCTTCCCG GAATGAACGA
ATTATAGGCC AGTCAACAGC TGTCGATGCG CTAAGGTCGA ACTCATTGGA TCCGAAAGAA
AGCGCGAGTA TGTGGGAACA ACTGGGGGGC GCAGCGGTTT CTCAACCGGC TGTTGCTAGC
GGAGGCAGTG CCGGGTTCCA CTTGGATGCC GCCGTCGAAT TTCTGCAGAA CAGCAATTTG
GGGCCACACT ATTCGCCAAT TTCCAGTGAT GCCGATCAAA ACATTGGGCA GTCGACTGGT
GGATCTGTGA ACCCTCAAAG ATTTGGACCG TCGGCTAGGG GAAAGCGTAC CCCAGCACAC
GGGAAGGCTG AGTCTCAGAT GGTCGACAGC TTTTTCGGAC CGAACAAGCA AGACATCAGG
GACAACAGGG TTTTGAATGG CTTGTCAGGT CTCTCCGTTG CAGACAAAGA TGTATCTACT
GGTCTTTGGG GTGTCCCTAT ACAAGCACTG CCATCACTTG GCGAGGTGGA AGGAGCGACT
ATAAACAAAT CCTCTGCTTT GTATGCAGCA ATTCAACCGA ATCTTGCGAA TAAAAAAGAA
CAACGCCCCG AGCATTCACG CTTTAATTGG GGGCCATGAA GAATTGCAAA ATGTTTTGCT
TGATTTTACT TTCTTGCTGT TTCTCTACCC TTTAGACGTA CTGCCATTGT TTTGGTGCGG
GTTGGATTGC AAGTGGAATA TTATAGACTT CTGGATTGGC AAGTAAGCGA GGCAAAAATA
CACACGAATA GCATGGAGAA AGCTGGCTAA AACAGGGAGA GATGCGGGTC GACATACTGC
TTCCGTGTCG ATCTAGACTT GGTTTCTACA TACTGCTTCT TTGTACAAAC CTTGATCATC
TCTCACCAAT AATTAGCTCC TGCTTACTAG CTCAAAGTTT GTCTTTTACA CCACATCAGA
CACATCTTGG GAGTATTCCT GCACTGTCGT AGATGGCTCG TCCGTGGCGA CATGAACTTC
GGAGAGCATC ACTTCTTCCA AAAGTGGGAA GACTTGAATT CCGTTCTTGT TCCATTCTAA
TGATCCTGAT GTGATAACAG AAACAACTAC TTCCCACACG TACGGAACGA AAAAGATTTC
AGGATCCGCT TCCTCTTCGT ACGTAAAGGT GAAGTCTTCG TTTTCGCTTC TTTTTCGACG
TTCGGTCACA ACGGCTTTGA TGCGGTCGAT GTTCTCCCCT AGCACTTTCA AAGCCTTAGT
TGCCGTCCGG GCGTCACCAT TTTCTTCAAT AAT
 
Protein sequence
MSLRRRLTKP GRESLKGWEG WGSAPHSDLL DTAREPLSQK GPVYYRTSLP CRLVALLGAG 
GSPFLRLHVP ARLVSLLRTQ KTVQEPAYQS PFASLCSPPT KTGTHTLVVR DWTSDNTRMP
VRPKRSLAKY FHGGDPSDVV ECPGYRAANN GNQHRSVHNA TMEALVRGSP VADTANGGGV
LVPSVQPQLS AVRHSNGCLH CHTEDTSENN VQVRCDECGA YICDECHWCH EYQANHEIRV
CDRCDGFYCK GCDEMDQCED CGEVVCASCS TLLSCKFCGG GLCEECATAC GRRDAKFAVE
CDTCRLSYCL VCLASGVKDP CVRCGHRPSK RMEQLVHLRL KSIYKAFKQN SGSNGLEVRA
PHTATKTYAR LRENDEPCDN PDSLLQAAAS VVAAKHPELL TAPEDVESQL VRLDLEQEKA
DAAAAALLAE LEEEEHAEQV KKNKKKKRKG RNGNKKSEEE VDKKLPAKED LLPQPSSPEP
VSVATDLHVV ATSPSIPNSP DPPKASAVDS MQQKLCDLVM NEDMKGLEDL MASLKGVPGQ
AALRKNAKKA LKRLQTPEVD AHIIEAREIT TATLTTPLDE ATVPDGAATP PPPTSDLLHV
ISYTHNKLPT QPSNVSHRAR NASATPKTEC VLHMASSIVG WVIGKGGQRI RDLMEESGAR
VWIDQENLGK MDPRIVYVSG HRKNVDSAVY LLQKLVAQTP TDPSASNQNT LLGLKSDSSL
DPGIVPSRPS DSHEGTTSVR VQTYGVHADR ASDTTGKGNH ILTCDKRFVP LLIGRRGWTI
KNIQDSSGAR VDIDQNVAPP RITISGAEEQ VSIAVEMVRD VLSYPHSQLQ GRAGRDECDH
AADHERNTPG VELQMRLYFH RMNRNSPPSS LIMPDDVQST ISASSSLSLT PEPSTASSNR
TNLHVPSGPM LPPAYNAGLY TSGVNARSTS FQPGFNASNG PLFTGHSPSM ILPAEQLFRV
QSGQYAEFQQ STRDIGGASL FPDQNIGFAA PANVQPPNYL QSQSSSPFES NPLGSYGNIP
SNQQQAGLFP LSQPVFSSRN ERIIGQSTAV DALRSNSLDP KESASMWEQL GGAAVSQPAV
ASGGSAGFHL DAAVEFLQNS NLGPHYSPIS SDADQNIGQS TGGSVNPQRF GPSARGKRTP
AHGKAESQMV DSFFGPNKQD IRDNRVLNGL SGLSVADKDV STGLWGVPIQ ALPSLGEVEG
ATINKSSALY AAIQPNLANK KEQRPEHSRF NWGP