Gene PHATRDRAFT_48471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48471 
Symbol 
ID7203699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011685 
Strand
Start bp592640 
End bp594609 
Gene Length1970 bp 
Protein Length495 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182868 
Protein GI219125187 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00174392 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGTTTTCCTT GTCTCATCAA GCAGTGTTGT AATTTGTCAT TTGAACGTTG ATCCTCTCAC 
TGTCAGTGCC GGCCGAAACA ATCGTAAGCT TTCCGGTGGA GGGTTTAACG TTTGATAGTA
TTGAAAAGTC CGTTCGCGGT CCTTAAGGAG ACGCGGAGAT ACGGAGATTG AAGATCATCA
TCATGTACGC AATTTCTGAT ACCAGTTCAC AGCTGAACAG CGATCCAGCC ATGGGGACCA
AAAGTCTCAA GCTCTTTCAA CCAAGAGCTC CTCTTTCTAG GCCTCCGCTG GCTCCGCCTG
TTTCCGCTTC ATGTCAACCT CCGAACTTGG AAGACCCGGC TCCGGAACAT TTAAACGTGG
TTGAGGAGAA CGATCTTTCC CATCACTTCA TCCCAGATTT TAACCTAGTC TCGGCGACAA
ATTTTTCGCA GGAGAGTTCG AGCTGCTGTG TCTCCTCACT GGGAGGATTT CTTGAAGAAG
AGCAGAAATC ATACGACGAG AGAGAATGCA TTCCTGTGGG ATTCTTTTGG CGTAACAAGC
AAAAGGTTTC ATCCGCAGAT GTGGCGTCGG TAGGTAACGC CAGTGACACC GTGGTGTCGG
TGGACGACCA TCAACTCCAT CTGTCCGGTC GCGATTTACA CGAATCCGCG AAGATGGCCT
TGAATGCTGG AGACTTCACC AAAAGTCTGT CCATGTTCGA AGCGATTCTA ATGGCGCAAG
CCCAGCGGTT TGGTCCCTGT CACCCATCCG TTGCCGCTGC TATGCACAAT GTCGGAGGTA
TGCATATCAT GAATCGTTTG AACATGTTAT TTCAAGCTGT AGATAGCTCA TCCTTCCTGT
GCTCACTGCT TTTAGTCTGT CGACAGCGGA TGGGGCAACA TGATACAGCG GAGAATCTTT
TTGCTGAAGC TGTTCAAGTG CGTCGGCAAA CTCTCGGCAG CGATCACCTG GAAGTTGCTG
CTTCGCTTTC TAAGCTAGGA TCAACAAGAG TGGCGCTGCA AAAGTTCGAT TTGGCCTTTG
GCGATCTCCG AAACGCCTCC AAGATTGCTA CCAAAAATCT TGGCCATGAA CACAAGACAG
TTGCTCAAAT ACAGTCCCAC CTCGCATGTT TGTATTTCGA AGGCGGCGAG CTTTTTGCTG
CTCAAGCAAC TTTTGAGGAC GCTCTAGAGA TCTACCGCGC TGTTTGGTCT AGTCAAGAGT
CGAATCGCGA TACCACTATG ATGCAACTTA CAGATACACT TTGTAACATT GGATCTATTT
TGAACCGACG CAAACGTTTT GGAGATGCTA TTCACTCCTT TTCGGAAGCT TTGGATCTCC
AACGTGGTAT TTTTTCCCAC AACCACCCGC GCATTGTGCA GACTCTGGAC AATTTAGGAT
ATTCATACTC AAAAAACAAG GAATATGGGA GGGCCTTGAC CTGTTACAAG ACTATGCTTC
GCATGCAATT CTCTCATTAC GGGACCTTCA ACAACTTTTG CCTCGAAACA TTCCGCAAAG
AAATTATAAT GTATGAAAAG CTCAAACGTC TTCCCGAAGC AGTAAACGAA ACGAAGGAAA
CACTCAAACT CGAAATTTCG GTCTTGCCTA GAGATCACAC AATCGTGGTC CAAACAAAAC
AGCTATTGGA AGACTTGGTA AAGCGTTGCA AACGAAAGTC TTCGCCTTGA CTTACACTTG
CGTCAGGTTA ACCTTCATAG ATATCATTGC TACTTTGAAA GCTGTTCAAT GCTATTGATC
CAATGAGAGA CATCTCAGCG AGATGATCTT TGGATTTACT GTCGAGTAAG TGGTCTAGTA
TAGGGGCAAA TCTGAATGTC TTGACATTCC CTATGGATCG TCTCGAGAAT TATTTTGGAA
CGCTGACTAA CAGTGACTTT GTTTTTCAAC GGAACGCAGC TGAAGGGCAT ATTTTGGTGG
TTTCGTGAAA GTAATGGCAA TTAATAAATT TAATGGAATG CGACATAGTC
 
Protein sequence
MYAISDTSSQ LNSDPAMGTK SLKLFQPRAP LSRPPLAPPV SASCQPPNLE DPAPEHLNVV 
EENDLSHHFI PDFNLVSATN FSQESSSCCV SSLGGFLEEE QKSYDERECI PVGFFWRNKQ
KVSSADVASV GNASDTVVSV DDHQLHLSGR DLHESAKMAL NAGDFTKSLS MFEAILMAQA
QRFGPCHPSV AAAMHNVGGM HIMNRLNMLF QAVDSSSFLC SLLLVCRQRM GQHDTAENLF
AEAVQVRRQT LGSDHLEVAA SLSKLGSTRV ALQKFDLAFG DLRNASKIAT KNLGHEHKTV
AQIQSHLACL YFEGGELFAA QATFEDALEI YRAVWSSQES NRDTTMMQLT DTLCNIGSIL
NRRKRFGDAI HSFSEALDLQ RGIFSHNHPR IVQTLDNLGY SYSKNKEYGR ALTCYKTMLR
MQFSHYGTFN NFCLETFRKE IIMYEKLKRL PEAVNETKET LKLEISVLPR DHTIVVQTKQ
LLEDLVKRCK RKSSP