Gene PHATRDRAFT_48981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48981 
Symbol 
ID7195253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011688 
Strand
Start bp216850 
End bp218936 
Gene Length2087 bp 
Protein Length639 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183699 
Protein GI219126929 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TGAAAACTCC CAGAAAGAAA CGCATTCTCA AAAGTCCATG CTAGAGTATG GACATCACTC 
TCAGATCGGT TTTGAAATGG GCAGTGACAT CGAGTCAAGG TCCGATGTTT CGTCCGCGAA
AGAGTTGCAG AAGGCGACGG ACCGACCGAC TGCGATGCCA CGGAATGCCA GCACACCTGC
AAACACCATC TCTGAAGTGA CAGAGCAAGC CGCAGTCGAC CACGTCCCGT CAGAAGATGA
ACAGGGTCCA CTCGTTGATG CTGGTACGCA CAAGCGACCG TCAGACTCGG AGATACTTTC
CACGCCACGA CGTCCGGTAC TCGGCCGCTT GGATTCTACG ATGGCAGTGG CAGATCAAGG
CTCTCAAAGT AGTTTGAGCA TTGAATCAGC GACGGCCCTT GCGGCTACTG AGCCCGATCG
CCTTGTAGTC GATCGAACAC CAGAAAGTAC CTCTGCAGGG ACACCGGTGC GGGAGCGCTC
GTCCCAAACT CCTGCAAGAC GCACACCGCT TTCATCGGGA AAGAAATCTG GACGAATTGT
TATTTATGAC CCGAATGAGG ACTCATCGAC GGAACCAGGT ATGTATGCGC CGGATCGAGG
CGAAATCGAG CAAGAAGAAG AAGTCGCGGA TAACTTATCC CGTTCTGATA GAGCGGAAGA
GATTGTCGAC AGCAGGAAAG CCCGACGATC CGCTGGCAGC TTGCCAGCGA CACCAGACGA
CAAAGAAGCT GGCTTTCTTT TGCACGATAC ACACCAAACG GGTACCCCGG ACATTGACAC
AGTTCGGCAG TCTCAATCTT TGCTACGGAG TGGACGGTTA CAGAGAAACA CGCTGTTAGA
GTCTGATCGC CAGGAGAGTG ATAGCGAATC AGTGGCTTCC GTCGTTGCAA AGGTGCTTGA
CGATCAGGTG GACTGGTCGT CCAAGACTGT CATTCAGCTG AAACAAGAAC TAGAGAGTCG
TGGATTGTCA ACACGGGGCA AAAAAGCGGA GCTCATTGAT CGCTTGACCG AAACAACTAG
TGTTGGAAAG AAGGACGAAG GAGTACAGGA AGAGACGATC AAAGCCGAAA GTGATACGAC
TTCAAACCAC GAACGTTTGG ACGAAGAGAC ATCGAATGAT GAGTCATCAG AAGGGGATAC
CAAGGTTCCA ACGGAACAAG TGAACTGGTC GTCCAGGACA GTCATCCAAC TAAAACAGGA
ATTGGAGAGT CTTGGACTAT CCACAAGGGG TAAAAAGGTG GAGCTCGTAG ATCGAATATC
CGCCGCAAAG AGTGCTGAGG ACGAAGACGA AGAAGCAGTT GACGGAACAC CCGAAGAAGC
CGTACCGTCA GAACTCGAAG ACACAGACGG AGAGGCAGTC ATCGACTGGT CCCAAAAAAC
GGTCGCTGAT CTTCGACGAG AATTATCTCG TCGCAACCTG TCAACCAATG GTCGAAAAGT
AGAGCTAGTA GCTCGACTGA CACACGGCGA TGAGGAAGTT GATGGAGTCT CGGTTGCATC
CAGCGAGTCC GCCGGTCAGG ATGAAGGGAT CGATGAAGGC AATGGGGACG TTCGTATTGA
CTGGTCCAAG AAGACTGTTG CTGAATTGAG GGCGGAGCTT GGATATCGTG GGTTGTCGAA
GGATGGTTTG AAGGCCGACC TAGTCGCCCG AATGGAACAT ACCACCGCTG GCGAAAAGAC
GGTGACCCCA ACGAAAAAAC CGGCAACTAG AACGAACGAA GAAACTTCCG TGGCATCATC
GACTCGGCCC GGTAGTGGGC GCCGCACGCG AGCTCGCGAC CACGAGAAGG AGAAGCAGGA
CGATCCTGTC GGTATCGACT ACGCCTCGGT AGCCTCTTCG ACAGCACGTC CATCTCGGAA
TCGTACCAGA GCCCAAGCCA ATGATATAGG TTCCACTACC GGTACTCGTG CTTCGCGACG
GGCCACAAGC CGGCGCTCGA AGCGAACCAC TCGCTAGTTG CATTTCTTCG CTTTGCATGT
CGCAAGAATA AGAGTAGTCT ATTTTGCGTC TATCAGTAAG TATGACTCAT GTCAGCTAAA
GCACTGTGGC GGCAGAATCT GGAGATCTAG CTAGCATTTT TTGTCTC
 
Protein sequence
MLEYGHHSQI GFEMGSDIES RSDVSSAKEL QKATDRPTAM PRNASTPANT ISEVTEQAAV 
DHVPSEDEQG PLVDAGTHKR PSDSEILSTP RRPVLGRLDS TMAVADQGSQ SSLSIESATA
LAATEPDRLV VDRTPESTSA GTPVRERSSQ TPARRTPLSS GKKSGRIVIY DPNEDSSTEP
GMYAPDRGEI EQEEEVADNL SRSDRAEEIV DSRKARRSAG SLPATPDDKE AGFLLHDTHQ
TGTPDIDTVR QSQSLLRSGR LQRNTLLESD RQESDSESVA SVVAKVLDDQ VDWSSKTVIQ
LKQELESRGL STRGKKAELI DRLTETTSVG KKDEGVQEET IKAESDTTSN HERLDEETSN
DESSEGDTKV PTEQVNWSSR TVIQLKQELE SLGLSTRGKK VELVDRISAA KSAEDEDEEA
VDGTPEEAVP SELEDTDGEA VIDWSQKTVA DLRRELSRRN LSTNGRKVEL VARLTHGDEE
VDGVSVASSE SAGQDEGIDE GNGDVRIDWS KKTVAELRAE LGYRGLSKDG LKADLVARME
HTTAGEKTVT PTKKPATRTN EETSVASSTR PGSGRRTRAR DHEKEKQDDP VGIDYASVAS
STARPSRNRT RAQANDIGST TGTRASRRAT SRRSKRTTR