Gene PHATRDRAFT_48109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48109 
Symbol 
ID7203273 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011684 
Strand
Start bp228276 
End bp231225 
Gene Length2950 bp 
Protein Length857 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182492 
Protein GI219124400 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCCGCG ACACGGATTT CCATCCGAGC GAACCTCCGT GAGTCTCGAA CACGCGGTCC 
GAATCGGCCT GTGCACGGTA CTTACCAACC GTACGGTTAA TTCTCCCTTA CTGCTAGCTG
ACACAACGAA CCCGACGCAG CCAAAGCAGA GAGAGAAGAC AGCCCCAAAC CGCGGCCTAT
CGTAAAGTAG AGTACTCGTT CTATTGGACA GTGTCGGTGC GCAGCTCTCC GTGGGACGTC
GGATACAACC GCAATCGCGC AATCATCCGT CCGGCCTACC CGTAAGTCGT CGAAGCACTC
GACTGTTTGC CGACCTCACG AATCGTCTCG GAGAAATCAT CACCACAGAA CGAGCATTGA
TGGAATCTGC AGTGTCCACG AGTTGTACAG CCTCAGTGTC GGACGCGGCT ATTTCCGGCG
CTCGTGTTGT CCCGTCGAAT GGAGCACCGA TTCCGGCCGA AGCAACCCAC GGCTGGCACA
ACCAGACGAG CCCTCTACTG ACCGTTTCGT CGCAAACGAG TAGTCCGACG ACGACGTCCA
CACAAAGTAC GGGGACGTGT TCGTCTCCGT TGGATCTCTT GGCCGGCGTT TCGTCCACGG
AACACGCCAA GCACCAAACA CCATCAACGA CCAAGGTCCG TCCACAGTCC ATCGGTGCGG
CGATCAAGAA CACCAATCCG TCCGCAGACG CCGTTTCCAA GACAGAAGGC GTGAGCAATA
GACACGTTGC TGCCGAGAAT CAAGCCACTG GAACGAGCGT GGATGTCCAC GAGTCGCAGC
TACAAGGAGC CGAAACATTG GAACAATTCG AAACGAAAAG CGACGACGTT CCATCGCCTC
ACCTTTCGTC CTTCCGAAAA GCAACAACAA CAACAACAAC AATAATTCGT CACAATACGC
GCCACAAAGA AAAACGTCGT CCGGTGGGAT ACGTGGCTCC CAAAACGGAA CGCACCGTCA
AACCAAAGGC TACGCCCAAA CCCCGCCGGA ATCGTACCCT TCAGCGAGCA AGCGGAAGTT
TCCCACGCGC CTGTTTCACG CATCGACCCA GCTTGGCTTG CCTTCGCTCA CAGTCCTTGA
CCGACTACGT TAGAGATGTG GTGCTTCCGA CTGCCAGCGC GTACGAACCG AGTACGGATC
CCGACGACGA CGATGATTAC GACGATTTCA ATCGAATTCA CGAGTACCGT CCGCTGGAAT
GGACTGAAGG TATGGCCAAA ATCACCCTAC CAGAAGGTTT TTGTACGCTT GATGGAATCG
CACGCGACCG GACGGGAAGA GGACTGGATT GGCAAGCGGG TACACCCTTG GGCGACTACG
TCATCCAAAC CCCGATTGAA CAAAACATAC GAGGCCTCGC TGGCGTATAC GAGTACACCT
TTGCCGACAA GCCGCAGGTC ACAATTGCAA GTTTTCGTGA ACAGGCGGAC GCCTACCGTA
AAGTACAAGT TGGCAGCGCT GTCGATGATG GCGAAAATGC GGACTCGGAC GAAGCCATGG
ATAAGTTGGC CCGAAAATTC TGGCAGCGTC TTGGTCCGAC CATGCCCCCT GCCTGGTATG
GAGCCGATCA AGAAGGAACA CTCTTTGGCG ACGATCCTGC ATCCGGCTGG TCGATTGCAA
AACTCGACTC GTGTCTGCAC GTGCTTTCGA ATGTTCCCGG CGTCACTACC CCTTACCTAT
ACGCTGGAAT GTGGGCGTCG GTCTTTTGCG CGCATACCGA AGACATGAAT TTACTAAGTA
TTAATTACCT GCACGCCGGT GCACCCAAAA TTTGGTACGC CGTTGCGCCC GGAAAGGACG
CAGATCGGTT TGCCGAGCTT TGTGCCTTTC AGTACAGTAT GGAGGCCCGC AAATGTAAGG
AATTCATGCG CCACAAACGA TGCCTACTCA GTCCGAAAGT ACTACAAAAG GCAGGAATTC
GCTATACAAC GGCGGTACAG CGACCAGGTG ATGCCATGAT TACTTTCCCG GGTGGTTATC
ATTTCGGCTT CAACGTGGGG TTCAATCTGG CGGAAGCAAG TACGTATGGC ATTGTGACAA
ACGTTTTTGG TTTTTTGATT TGGATCAGCT TACACAAAAT TGTACGCTAT TGATTTTTAG
CAAATTTTGG GGTACCAGAG TGGATTCCCC TGGGTTTGCA AGCTCATGTA TGCTTATGTC
GACCAGATTC GGTTCGAATC GACGTGGAAC GCTTAATTGC GCTCCTGAAA TTGTACCAAC
AGGCTGAGAA GCGGGAGGTG GGTTTGTCGT GGAAAACTTG GAGTCAGCGG AGGGAGGAAA
AATTGGCTCG ACGAGCACTG TCGGAGCGTC GACGCATGTC ATCGCCGCCT TCTAAGAAGA
AAAAACGTTC GAAAGCGCCA CGGACAACTG AATTTTGGGT CGAAGTTAGG AGACCTATAT
CCAAAGAGGA AACTGCAAAA AAGAAAGGCA AAAGGCCACT GAAAAAAGCT AAGCGCACTG
ACGAAGAAAT ATGGCATCTC GCCAAAGCAA CAACACGAAA AGGTCTCGTT CCCGATGCTC
GTGTTCTTTG TGTTTTGCCG GCGAAAGTTG TCTTGGATCG TGTCAAATTT CATTACAGAA
CTACTGGGGA CCCCGATAAT CAAGATGAGC AATGCTTTGC CGGTCAAGTA GTCGAGCTAA
TTGATGATCA TGTTCGAGTC AGATTGGATG GGCTTCCAAA GTCCAGCGAC GAATGGATGC
ATGTATGGAG TCCAAAGCTG TTTCTGGACG GTGGTCGATG GGGCGAGGAT CACGACGTTA
CGGTAGAGGA CGAAATCGGG AAGACGTTAT ACTGGGAAGA AGTAGACTCC AAGAGCCTAT
GTCTATGAGT TGTTGTAACA ACAGATTTGT TTTTGGAAGC AGTCAGTTTT CTTCGCACTG
GGACAGCGCT TTGAAACCGA GATCTTCTTA ATCGGCTCGT ATGAAGTAGA GAAATTCGCT
CAAGTTGTCG
 
Protein sequence
MGRDTDFHPS EPPVGAQLSV GRRIQPQSRN HPSGLPVSRR STRLFADLTN RLGEIITTER 
ALMESAVSTS CTASVSDAAI SGARVVPSNG APIPAEATHG WHNQTSPLLT VSSQTSSPTT
TSTQSTGTCS SPLDLLAGVS STEHAKHQTP STTKVRPQSI GAAIKNTNPS ADAVSKTEGV
SNRHVAAENQ ATGTSVDVHE SQLQGAETLE QFETKSDDVP SPHLSSFRKA TTTTTTIIRH
NTRHKEKRRP VGYVAPKTER TVKPKATPKP RRNRTLQRAS GSFPRACFTH RPSLACLRSQ
SLTDYVRDVV LPTASAYEPS TDPDDDDDYD DFNRIHEYRP LEWTEGMAKI TLPEGFCTLD
GIARDRTGRG LDWQAGTPLG DYVIQTPIEQ NIRGLAGVYE YTFADKPQVT IASFREQADA
YRKVQVGSAV DDGENADSDE AMDKLARKFW QRLGPTMPPA WYGADQEGTL FGDDPASGWS
IAKLDSCLHV LSNVPGVTTP YLYAGMWASV FCAHTEDMNL LSINYLHAGA PKIWYAVAPG
KDADRFAELC AFQYSMEARK CKEFMRHKRC LLSPKVLQKA GIRYTTAVQR PGDAMITFPG
GYHFGFNVGF NLAEATNFGV PEWIPLGLQA HVCLCRPDSV RIDVERLIAL LKLYQQAEKR
EVGLSWKTWS QRREEKLARR ALSERRRMSS PPSKKKKRSK APRTTEFWVE VRRPISKEET
AKKKGKRPLK KAKRTDEEIW HLAKATTRKG LVPDARVLCV LPAKVVLDRV KFHYRTTGDP
DNQDEQCFAG QVVELIDDHV RVRLDGLPKS SDEWMHVWSP KLFLDGGRWG EDHDVTVEDE
IGKTLYWEEV DSKSLCL