Gene PHATRDRAFT_49652 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49652 
Symbol 
ID7198299 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011691 
Strand
Start bp311924 
End bp315054 
Gene Length3131 bp 
Protein Length842 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184343 
Protein GI219128277 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.390924 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCTTCCTGGT CGAGTTCACT ACTCACTGTC AATTTAAGTA GTAGTGAGCT ATACGGTGTG 
AACAATCCTT CCTTCGTGCA GAAGAGTTTG GGTGCACGTC AACAAAACAC GTCAACGTCA
ACGTATACCT CGCGACCGAC CGATCACTCT TCATGTCTCC GTGCAGCGTC GTCCTGTTTT
GCTGCGTCGT CGTGGTCGTC GTGGGTTCCA ATGGGATTGT CCACGGCTTT ACCAGTGTAC
CCCCATCATC ATCACCACCA TTGCTGCGGG TAGTAAAGGG TGGGATTCGG ACGCACCGTC
CATCACTATC ACCATTATCT CCGCACTCGT ACAACCATGG TAGTCGGCGA CCACTCCGTA
TGCTTTTGCC TGCGAACGAT TCCGCGTCTC ATACTCGTCG AGTCGCGACA CGCCCGACAC
GGTACGTGAC AATCTCACGA CGCTACTTGC TCCTGTTCGT GGACTCGATC GTTGCCATTG
ACTCGCAAAC TACTTCTGTT GCTGATACTG GAACGATTGC GTGCCTGCAT GTTCGTGTGT
TGGGATTCTC GTGCTCTCGT CTTTCGTCAC TCACGCGCTC ACTCGTTCGT ACTACTATTA
CTACTACCGT GACAGACTTC TCCGTCCGAC GTCGGGGTCG TCCAAGGGGT ATCCGTCCCG
TAAATTTCGG GTGTTGTCCC TCGCCGGACG TCCCGACAAC AACAACGACA ACGACTCGAA
AGAGCGTCGC CAGTTTATGC TCGATCCCGT CACCCAAACC TTCGACCACG TGACCAAGTC
CGTGCAAGAA TCGGTCGAGA ACGCCCGGAC GCGTCTCGAA CAGCTACTCC GCTTCCTACC
CGCACCCCTC CGCAATTTCT ACCGCGCCTT CTTCGAAAAA CTCCACGCGT GGAAAGGATT
CTTGGTTAGT TTCACTGCCG GAGCCTTTCT CGCCACCGCC GCCATTATTT ACCCCATTTA
CGCTTCCGTA GAATCACTAT CCCAACCCGT CACACTCTTC GAAACCATAC TCGGCGATCT
GGAACAAGCC TACGTGGAAG AAGTCGACAC CAACAAACTC TTCGAAACCG GAATTGCCGC
CATGCTCCGG TCACTCGATC CCTACACGGA ATTCGAAGCT GCCCAAGAAG CCGTGGCACT
CACGGAATCC ATCGAAGGCC GCTACGGTGG TGTGGGTCTC GTCATTGCCG GAACACCGCG
GGCCGCGGCC GAACCGAAAT CCAACGCCAA CCAACTACTG CCCGCCGCCG CACAATCCGA
CACTGCCAGT CAAGAAGATA CCGAACGCAA TCGGAATACC ATGAGTAACG TCATGACCGA
AGAGGAAGAA GACGAATACA TGGATCGCAA GGAACAACGC AAGGCTCTGG AGAAGGCACG
CAAACAAGGA ATCCGGGTAG TCACCGCCTT TGAGGGGTAC GCCTTTGATT ACGGTACGTT
CGTTGGAACG CCCGCCCCGA CATTTTTACT AGTACAGCCA GTCGGGTCGG GGCTCTACAC
GAACCTCCGC ACGCTCAACC GGCACTTTTT TCTTCTGTTT CCCAGGCTTA CGCGTCGGCG
ACAAGCTCTT GGCGATTGAC GATAAGCCTC TCACAGCGGA TACGACGGTC GAAGACGTCC
GCAATATGCT CCGTGGACAA CCCGGAACCT TGGTAAGTAT TGAGTTCAAC CGAGATGGCG
TCGATGACGT ACAAACCGTT ACCATGCCCC GTGCCGTTGT TCGCCTCCGG GACGTCAAAC
TCGCCACCCT CGTGGGAAGT CCCCGGGACG GGATCGGCTA CATCCAATTG AGTGGCTTTA
CCTCCAACGC CGGTGCCGAA ATGCGTCAGG CCATTACGTA CTTACAGCAA CGGACACTGG
ACGCGACCAA CGGAGACAAG AGTTTACAGG GACTCGTTCT CGATTTGCGG GGCAACCCCG
GTGGCCTTTT GACGTCGGCG GTAGACGTAG CGTCCCTGCT CGTCCCGAAC GGCAGTGACA
TTGTGTCGGC CCGCGGACGG GGCTTTCCCG GAATGCTCTA CCGGAGTCGG GTGGATCCCA
TTCTGAATCC CAACACCAAA CTGGCGGTGC TCGTCAACGG ACAAACGGCG TCAGCGGCCG
AGATTGTGGC CGGGGCCGTC CAAGATTTGG ATGTGGGCGT CATTGTGGGT GCGGACCGCA
GTTTTGGCAA AGGGTTGGTG CAAAACGTGG AAGAGTTACC TTTTAATACG GCACTCAAGT
TCACCGTAGC CAAGTATTAC ACACCCAGTG GCCGGTGTAT TCAAGGCGTC AACTATAAAG
GAGGGGGTGG CCTCAAGGAA GAAAATGGAG GATACATTGC CAGTAAGGTG GCCGACGCTG
ATCGCAAGGT GTACTATACC AAAGCGGGCC GCATGGTGAG AGACGGCGGC GGTGTGGAAG
CGGATTACAA AATTGAAGCT CCCAAGGCTT CGGCCCTGGA AGTGACGTTG CTGCGATCGG
GAATGTTCAA CGAGTATGCC GCGGAATGGA GTAAAACACA CATGCTGACC AACAATTTTG
CCGTGGACGA AGATATTTAC CGAAACTTTA TTGCCTTTGT CGATCAAAAG CAGAAAACTG
GTGACATTGA GCTGGATGCG CTGTACAGTC GACCGCTATC CGATTTGAAA AAGGCTCTTA
AACGGAGTGG ATATAAGGGT GCCGAAAAGG AGGTGGAAGT GCTACAGGCC AACATTGTTC
GGGAAGTCCA AAAGGATTTC GACAAGTATC GAAAAGATAT TAAAGAAGAT ATTTCCCAAG
GCATTCTGGC CCGATATCTT CCGGAGAGTA TGTTAATTGA ACGAGGTATG AAAAACGACG
CACAGGTGGA GGCAGCGATC AAGCTGGTGG CCAACAAGAA TACATTCGAT AAGATTCTCG
CGCAAGGAAA CACGGCCGAG CGCATGGGGG GCGCCAATAG TTTGAATATG GCATCCGGCG
CCTCTGCACA AAGCACTAGC GGTGTACGAG CTACTATCCA ATGGTAGAAT TGGATCCTGC
AAACGTCTTG TACAAACAGT AACAGGATGA AGCCAACCGA GGAGAGAACT GGCCAGCAAA
CTATTCACCG GTCCCCTCGG GTTCGCTTGG GCAGCAACAT TAAGCTAACT ACGCTTCGGC
ATTTACCCAT T
 
Protein sequence
MSPCSVVLFC CVVVVVVGSN GIVHGFTSVP PSSSPPLLRV VKGGIRTHRP SLSPLSPHSY 
NHGSRRPLRM LLPANDSASH TRRVATRPTR LLRPTSGSSK GYPSRKFRVL SLAGRPDNNN
DNDSKERRQF MLDPVTQTFD HVTKSVQESV ENARTRLEQL LRFLPAPLRN FYRAFFEKLH
AWKGFLVSFT AGAFLATAAI IYPIYASVES LSQPVTLFET ILGDLEQAYV EEVDTNKLFE
TGIAAMLRSL DPYTEFEAAQ EAVALTESIE GRYGGVGLVI AGTPRAAAEP KSNANQLLPA
AAQSDTASQE DTERNRNTMS NVMTEEEEDE YMDRKEQRKA LEKARKQGIR VVTAFEGYAF
DYGLRVGDKL LAIDDKPLTA DTTVEDVRNM LRGQPGTLVS IEFNRDGVDD VQTVTMPRAV
VRLRDVKLAT LVGSPRDGIG YIQLSGFTSN AGAEMRQAIT YLQQRTLDAT NGDKSLQGLV
LDLRGNPGGL LTSAVDVASL LVPNGSDIVS ARGRGFPGML YRSRVDPILN PNTKLAVLVN
GQTASAAEIV AGAVQDLDVG VIVGADRSFG KGLVQNVEEL PFNTALKFTV AKYYTPSGRC
IQGVNYKGGG GLKEENGGYI ASKVADADRK VYYTKAGRMV RDGGGVEADY KIEAPKASAL
EVTLLRSGMF NEYAAEWSKT HMLTNNFAVD EDIYRNFIAF VDQKQKTGDI ELDALYSRPL
SDLKKALKRS GYKGAEKEVE VLQANIVREV QKDFDKYRKD IKEDISQGIL ARYLPESMLI
ERGMKNDAQV EAAIKLVANK NTFDKILAQG NTAERMGGAN SLNMASGASA QSTSGVRATI
QW