Gene PHATRDRAFT_39864 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39864 
Symbol 
ID7195517 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011689 
Strand
Start bp225990 
End bp229265 
Gene Length3276 bp 
Protein Length966 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183952 
Protein GI219127458 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCAG CGAGTCACAC CGCAACTAGT CGGACTCCAC TGTACGCAAC GTCGTCAATG 
CATGATATAG CCACCGCGAC TGTGGAACCA ACTCTGCTGG CACAAGCGTG GACACCACAA
TTGCTGCAAG CCGAGTCCGG GACGCCGACA CCTTCGGGAT CCACACTCGA CGACTCGTCC
GCGCTTCGGG TAGTCATCGC ACGAGCCGCG GCACGCACGC TGCCTCTTTT GACGGACGCT
TCCGCTGTCC CGTTCGTCTG TCGGTACCGG GTGGACTTGG TCAATCCGTT GACCACACGG
CAAGTACACT TGCTACAAAC CTTGTCTTCC CGACACGCTA GTCTCCAGAG TGTACGAGCG
AAAGTGCTCC GAGCCGCGGC GGTGGCACAC GACGGCAAAG GCAACGCCGA AAACGAAGCC
TGGATCAGTA AAGTGCAAAC CAGTACGTCC AAGGCGGAAT TGGAAGATTG GTACGCTCCT
TATAAGCCCC CGTCGAAAGG TTCCGTGCTG GATCGTATCC AAAACGACCA TCCCGAACTA
ATTCCCCAAT TGGACGCCTT TTGGCACGGC GATCAAGATT CATTTTCGAT TCATCGTATT
TTGAAGCAGC ATCCGAAAGA CGCCGTTTTG CACGTCTTGA GCACCAAGCT TATCGCCAAC
GAACCATCAG TAGTGGAAAC CGTTCAAAAC GAATTGTGGA AACATGCCAA AATAAGGACG
AAACCACCAG CTTCGCCGTC AAACGACCCC GCGGCCGACC AAAAGTACGT TGTATACCAC
GACTTTACTG CCCCATTGTC CCGGTTACGG GATCATCAGA TACTGGCAAT TCGCCGGGGA
GTCGAACAAA AACTACTGCA ATTGTCGTAC GAAGTGGACG GATCCAAAAT TGAAGCCTGT
ATGCGATACG CCGTGCGGCG CCGATGGCCC CGCCACGGTG ACGCCCTCCC AGCCAATCTC
TTGGACGAGG CCGTGCACGA AGCGTATACG CGTACGTTGC GCCGAAAACT ATTGACTCGA
TTGTGGTCGA AAACTTGTCT GCCCCAAGCT CAGGCCCGGG CAAGTATTTG CCGAGAACAC
CTCGCGTGCA CTCTTGGCGC CGCCTGCGAG TCGCGGCGGC GGTAGTAGTG GGAATAACGC
CAGCTCTCGA GCTTCTCCAC CACTGTATCT ACTCAGTGTT GATCCAGGAT TTCAAGCAGG
ATTGAAATGC GCTGTCCTGG ACGTTAATGG GCATGTTGCG CTACAACCGT TGACGACAGT
CAAGTACTTG GGCAATGCCC GAACGACTGG TGTGCGAACG ATGAGTACGT TGCTACGAGA
CGTTGCCGAT GCGACGCAAT CGAATACTGT GGTGGTCACA TTAGGGAACG GCCACGGTAC
ACACGAAGCA CGTGATTTGT TACGGGAAGC CGCCGCTGTT GCTACTGAGA AATTAGAATT
GGATATTCAA GTAGTCCACG AAGCTGGTGC CAGTGTGTGG AGTGTGACGG AAGCGGCACG
AGAGGAATTT CCGAATGATC CCCCTTCCGC CATTGCGGCC GTTTCGATTG GTCGACGGTG
GCAGAATCCA TTGCACGAGC TCATCAAGGT CCCGCCAGCT AGTTTGGGAT TGGGTATGTA
CCAGCATGAC GTACCGCCCG CTGATTTGGA CGATGTACTG CATCGGACCA GTGTAGATGC
CGCTGCCGCC GTTGGCGTGG ACGTGAATAC CTGTCGGGTC GAAATTTTGC GCAAGGTACC
GGGATTGGCC AAACTAGCCG ATGCCATCAT GGCAGCGCGG CCTTTGGCAA CGAGGCAGGA
TTTGCTGTCG AGGGTAACCG GTTTGGGTCC GAAAACATTC CAAGCCTGTG CCGGCTTCTT
GCGGATTGTC GACGGACCAG AACCGTTGGA TGGAACCCTG GTACATCCGG AGTCCTACGA
CACTGCACGA TGGCTACTAC AAACATTGTC GTGGGATCTG TTGACGGTAC CGACGAACCT
CCCACCGCGC GCCGAATGGA AGTCCTGCTG GAAGGATGTA CTGAACGCTG GGTCGACGCA
ATTTGGCGTT AGTCCGGAAC GCATGCTTTC CGTGTTGGAA AACCTGGTGG ATTCGTTAAT
AAACGTAGAT CCTCGATTGC GCCAAGGCAA TGATACCGGG AGCCGTCGTG GTGCGTCTCC
GCGGTGGGAC GGCTCCAACT GCGCACGATT ACCGGCGGGG CTGGCCCACG ATCTAGCGGC
CTTGCAAGCG GCGTGTCCGG TCCGCAACGT CCAAGGGACG GTTCGAAATT TGGCCGATTT
TGGTGCCTTT GTCGATTTTG GTGGACCCCA CGATGGCTTG TTGCACATTT CGCAAATGAT
GGCGAATCGT GTCGCGTTGG ACACGCTCCT GATTGGACAA GGAATCGGGA TCGACATTGT
CAGCGTCCAA GACCATAAAG TGTCGCTCGC TTTGGCCGGC GGTGCAGTAA AGAACGGCGC
GAGCGTTCTT ATGCCACCGC AGACGGGTAC GGGAGTAGGG TCCCGTCGGA CCATACAAGG
TCCGCGGATC ACAGTTCCAG TGGGTCCGAA GCCAACCGGT GGTGGCAAAC GTAGCGCTTC
TACTAGCAAA AGTGTTCGAC CAACAAATAT TGGAAGGAAT ACGAAACGGA CCAAGCGATC
GACGTGAGTC ATCTAATCAT AGATACATGC ACGTGAACTG TGTATGCCAG GAATATAGAT
CTTTTTGGAA GTGGCACGGT GCATTGGTGG AGACCTGGAT CATCGGAAAC GTTGTAAATT
TCACACGCTC TTTTTTTTTG TCTGGGATGG CTCCGAATGT CTCGACGCTC ATACCGATAA
AGCGGTAGTT ATCCTTGGAG ACTGGTGCCT TTCCTCATCA GGAACACTCA CCTCGCAATT
GTTTACCCGT CGCTCCAGGA ATAGGTAGTT GATTGCTCTC TCACTACTGG TATAGTAGCT
ATTGTAACTA TAATAGTTCA ATAACTCTAT TACTAGCGAA CGGGGAAGGT ACCAAGGATT
TCTCTGTCGG TGCGTGACTG GCAGTCCCAG AAAGCAACGC TGTTTCCTCT CGCGCTGGAT
GGAGGATCGT TGATGGCCGT GTACGATCCG ATCTGTTCCA AGTTCTCGCG AATCATGGAA
CTTGTCGGTC CTCGTGTTCA CGGGCTCTTT TCGCTTTTGC GACCTCTCGG CGTCGTCACG
TTGCTGTGTG CGATTTGTCG AGGTCGCTCG CTCTTGTTTG TCACTAACAA CTCGTTTTCC
CCGTCGGCTG ACTGTGACAA CAAACACCCC TATTGA
 
Protein sequence
MKAASHTATS RTPLYATSSM HDIATATVEP TLLAQAWTPQ LLQAESGTPT PSGSTLDDSS 
ALRVVIARAA ARTLPLLTDA SAVPFVCRYR VDLVNPLTTR QVHLLQTLSS RHASLQSVRA
KVLRAAAVAH DGKGNAENEA WISKVQTSTS KAELEDWYAP YKPPSKGSVL DRIQNDHPEL
IPQLDAFWHG DQDSFSIHRI LKQHPKDAVL HVLSTKLIAN EPSVVETVQN ELWKHAKIRT
KPPASPSNDP AADQKYVVYH DFTAPLSRLR DHQILAIRRG VEQKLLQLSY EVDGSKIEAC
MRYAVRRRWP RHGDALPANL LDEAVHEAPG QVFAENTSRA LLAPPASRGG GSSGNNASSR
ASPPLYLLSV DPGFQAGLKC AVLDVNGHVA LQPLTTVKYL GNARTTGVRT MSTLLRDVAD
ATQSNTVVVT LGNGHGTHEA RDLLREAAAV ATEKLELDIQ VVHEAGASVW SVTEAAREEF
PNDPPSAIAA VSIGRRWQNP LHELIKVPPA SLGLGMYQHD VPPADLDDVL HRTSVDAAAA
VGVDVNTCRV EILRKVPGLA KLADAIMAAR PLATRQDLLS RVTGLGPKTF QACAGFLRIV
DGPEPLDGTL VHPESYDTAR WLLQTLSWDL LTVPTNLPPR AEWKSCWKDV LNAGSTQFGV
SPERMLSVLE NLVDSLINVD PRLRQGNDTG SRRGASPRWD GSNCARLPAG LAHDLAALQA
ACPVRNVQGT VRNLADFGAF VDFGGPHDGL LHISQMMANR VALDTLLIGQ GIGIDIVSVQ
DHKVSLALAG GAVKNGASVL MPPQTGTGVG SRRTIQGPRI TVPVGPKPTG GGKRSASTSK
SVRPTNIGRN TKRTKRSTNI DLFGSGTRTG KVPRISLSVR DWQSQKATLF PLALDGGSLM
AVYDPICSKF SRIMELVGPR VHGLFSLLRP LGVVTLLCAI CRGRSLLFVT NNSFSPSADC
DNKHPY