Gene PHATRDRAFT_42978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42978 
Symbol 
ID7196211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1679846 
End bp1682084 
Gene Length2239 bp 
Protein Length632 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177347 
Protein GI219111191 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCTATCCCCA TTTTGTCAGT CAACGAGAAA GATACGATAG TCAAATACAG TGCACTCGAG 
CTAGTGACTG ACTGACGAAC TGCGAAGGTT CGTCGAACGA AGCGCCTAAC TTTGGAATTG
TTTGACTGTG AGATTCTGCT GGTTTGCGGA TTTGGTTCCA TTTCGGCCCC CCGCAAGACT
CTGAGTCTCT CAATCCCACC AGCCGTTTAA TCAGTTGGGT CCTTTGGCTG CGATAACATT
ACGTAACTAG AGCCGGAAAG CAGTATCATC ATGGATAGGA TGCGCTGGAC CGAACAGTCC
GCCAGTCGAG TGGGCTGTCC AACTATAAAA AACCATTCGG TCACCCACTT TGGCGATTAT
CTGTTTTGCT TTGGTGGCTA CGACGGCCGT CGCAACCACA TGACCTTACT GATTTATAGC
ATTCTAGAGC AACGCTGGTT TCGTCCTCAC CACGCGATGG GAACGGAAGG ACAGGGCTCT
AATTTTCTCG GCGATCCCAG TTTTCTAGTC CAGGGCACCC CACCTCCGGG GCGGAACGGA
CACTCCGCCA CGCTGGCCGC GGATCCCGAC GACGAAGAGA ATGGGCGAGT TATCATCATT
GGTGGATGGC TAGGGACAGG GCCGCTAGCC GCTTCCGACA TGCATGTTCT AGATATTAGT
GGCGCTGGCA GACAATTGCG ATGGTACCAG CCTCCCATCA AAGGAACGCC GCCCGGACCT
TGCAATATGC ATTCCGCAGA TTACGTGTCT GCGTTAAAAG AAGTTTTTGT CTTCAGAGGC
GGGAACGGAC GGGAATATTT GAATGACTTG CATGCCTTGC ATGTGGAAAC CTTGACCTGG
AGACGAGTTG AAACGACGGG GGCGATTCCT CAGCAACGCG CTAACCATTC GTCGGCCTTT
CTGGAAGAAA CGCAGGAGTT GTTCGTTTTC GGTGGCTGGA ACGGTACAGA GCGTTTGAAT
GACATTCACA TACTTGACAC GGCCACGAAT ACATGGACCT GTCCGCGAGT TGGTGGTGTA
CTACCTCATC CTCGTGCAGG TATGACTTTG ACGGCACTGC GTGGACGGCT GTACTTGTTT
GGTGGTAGCG GTACGTCCGC CAAGTGCTTC CAGGATTTGC AAATTCTAGA CCGTCAGACA
ATGGCTTGGC TAGATGTCAC CCAGTACGAG ACTGGCCGCA ACGGCCACCA CGATTCAGAA
CACGAGACTA CGGGAACATT TCGCTTCGGT GGAAGTCACA CACTAGACGA TGGTCAATCT
CATATGTACG GTCATCTCGA AGAGACTCCC CGTTTCACCT TCGGAGGTGC TCAGCAAGAT
AACAGCCTAG ATGCCAATCA AGAGGCCGTC GCCCGCCCAC CTCAGCAAGG AGGGGGACAG
GACGGTGCAT CGCTTTCGTT AGGTTCCTTT ACGAGTAGCC GCGCTGATTG GCACGCTCGT
GATATGGCGG CTCGACATCG TCAACCGGCA TTGCCCAACG TTTCTCCGAA TCCCAATGAC
GAGGACTCGG TACCCGCGGT ATTGATCGAC GGTACGGGAC CTGGGAGGCG TGCTGGTCAC
ACGGCGACGG CGGTGCATCG CAAAATATTT GTTTTTGGTG GCTCGTGCGG ATCTGATTAT
TTGAACGATT TCTTTGTTCT CGATACAGAC CCTCCCCCGC ATGCGCTGGT ATCAGAGCCT
ACCAGCGTTC AACTCTTCGA ACGTCGTTTG CGGCACTTTT TTAACGATGA AGAATTTTCG
GACGTCACTT TCGTCGTTCA GGGGGAGAAG GTTTACGGTC ATAAAATGGT ATTGTCCATT
GTTTCGGATT GTTTCCGTGC CATGTTCACA ACCGGGTTTC GCGAGTCGGA AGCGATGGAG
ATCGAAATAC CTGATTGCAG CCACGCGTCG TTCTTATCCG TGATGGAGTA CGTCTACACG
GGAGCCTTAC CAAAAATGGA TATGGCCAAT CAAGATCGGG ATCGGAGCTT GACTCGCGTG
GTCGAAATGC TGGAGCTATC CGACCGATTC TTTTTAGACC ATCTGAAACA AATATGTGAG
AGCATCTTGC AGCCGGCGGT GACCCACGAT ACGGCAGAAT ATCTTTTGGG GATCGCTCAG
AAAACGAATG CGAGTCAATT GCAGTCCATT TGCGAGCACT TTGTGCGCAA CCGCAACGAG
ATTGCGTAGT AGACTATTTC GCGCGCGATG GAGCTTTGGT GTAGAGTTGT AAGATAGGTG
CGTTGTTACT ATTAGTACG
 
Protein sequence
MDRMRWTEQS ASRVGCPTIK NHSVTHFGDY LFCFGGYDGR RNHMTLLIYS ILEQRWFRPH 
HAMGTEGQGS NFLGDPSFLV QGTPPPGRNG HSATLAADPD DEENGRVIII GGWLGTGPLA
ASDMHVLDIS GAGRQLRWYQ PPIKGTPPGP CNMHSADYVS ALKEVFVFRG GNGREYLNDL
HALHVETLTW RRVETTGAIP QQRANHSSAF LEETQELFVF GGWNGTERLN DIHILDTATN
TWTCPRVGGV LPHPRAGMTL TALRGRLYLF GGSGTSAKCF QDLQILDRQT MAWLDVTQYE
TGRNGHHDSE HETTGTFRFG GSHTLDDGQS HMYGHLEETP RFTFGGAQQD NSLDANQEAV
ARPPQQGGGQ DGASLSLGSF TSSRADWHAR DMAARHRQPA LPNVSPNPND EDSVPAVLID
GTGPGRRAGH TATAVHRKIF VFGGSCGSDY LNDFFVLDTD PPPHALVSEP TSVQLFERRL
RHFFNDEEFS DVTFVVQGEK VYGHKMVLSI VSDCFRAMFT TGFRESEAME IEIPDCSHAS
FLSVMEYVYT GALPKMDMAN QDRDRSLTRV VEMLELSDRF FLDHLKQICE SILQPAVTHD
TAEYLLGIAQ KTNASQLQSI CEHFVRNRNE IA