Gene PHATRDRAFT_43504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43504 
Symbol 
ID7197196 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp655250 
End bp658319 
Gene Length3070 bp 
Protein Length977 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177978 
Protein GI219112453 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCCGC TTACCAAAGG AAAAAGCGGT AACGCAAAGA ATGCCGACTT CAAACGTATT 
AAGGCCAAAG TGGGCAAGAA AGCCCCAAAG CCAGCCAATG TCACGGACAC GGCGTTCCGC
GCTGCGTCGT TGCAAACTAG CTCCCAGACG TCCTTCACCA ATGGCGGCGC TCCAGCAGCA
AATCTTTCCG ATGATGTCTC ACTCTATTCA GCCCGTGGTC GATCTTTGCA AAGTTTGGCG
TCTCAGTTGT CCCATCCGGC TGCCGCAGTA CGCGCATCCG CTGCGAAAGG GCTGTATGAT
TTAGTTTCCG GAGCAGCAGC GACGGCTACC GGTGCGACTA GCAGTAGTCT CCTGCAAGCG
CACTTGTCGG CTCTCATTCC AGCCGTTGGT AAATGCGTTG TCGACGAAGA CAGCGAGGTT
CGAACTGTTG GAACGAACAT CCTCCGTGAA ACGGTAAACA AACTGAACGA AAAAGCTACG
ATGGCACTGA GGCCATTCAT CAAACTTTTG ATTGCATTCG TGGCGTCGGC ACTGAATAGT
TTGGATCGTG ATTCTCGTCG CGACGGTGCC ATCCTTGTCG AACTGCTGAG TTCGTCGGTA
CCCACATTGG TAGCACCCTA CGCGGTGGAA CTGCTACCAG CCTTGATCCG ATCGCTGGAC
GATCGGGACA CCCGTTTGCC GAAACAACCC GGCATCAACG GAAGCAGTGA CACCGGCACC
AAAAAGCGCA AGCGTAGGCT GCCGAGCGTC GTATCGAATA AATCGACCGC TAATCTTAAT
GGGCGTCACT TGTTGCTGCA GTCCCTTGTC ACCTTGCTCC AATCGGCAGC AACCCTCACA
GGTAGTAGTA AATCCGAAAC GCAACAACAG AGTGCGACAG GAGAGTACTT GTCGGAGCCC
GATTTAATTT TTGGCACTGG AGGGCGATCC CGAAATGCTG TGATTCTCCG AGGTCGACCA
ACACGACGGC TTGCTCAACT GTCTCCCATT CAGAAGCTTA CCGACTTACC ATCCTTGGAA
GCATTTCGGC TATCTTTAGC GAGAAACGAC GGCCCGATCA AAGTGGGTTT GAATACGTCT
TTACCTCCGA AAACGGTTCA GCAATTGTAT CACAAATTGC GAGATTGCTT TGTTGAAACA
ACTCAGCGAG GCTATTTTGA CGCTAAACAA GGCTACGTGA TGACTGTTAC CGATTTGTCT
ACCTTCTTAC TCGTTGCAAA GGCTTTACGA TTGACCTGGG ACGTTTTTGG CAACGAATGG
TCTCAGCTTC AAGCAGAATC GGATGTCGCT GACATGCGCA AAAGTTTTGC ACAGGCTGTG
TCTTTAATCC TGGAGATTTT TCCGATTTCG CGTGCAGACG AAGCAACACA AGTTGTGGCC
GACGATGTGA ATGGTGAGCT GTGTGTGACT CTGGTTTTGA TGGGGCCCAC CAGCGCCAAC
CAGGGGAAGG AGAAAACTGG CTGGGCAGAT AAAGTTGTTG CGCACGTCTT GACTTCCATG
GACGAAATGC ATGTGCAGTG TCACCAAGCC AGCTCCATGA GCCCAACTTC GGCACGCTCT
GTCTTTGCTG TATTGAATGG GTTGGTTTTG ACTGGGCTGT GCAATGTGAA ATCTCAGAAC
AGATTAATTA GTATGTTCAG CTCCACATTC TTTGCACCCG AAAGCAAGGA CTCAGCTCGA
ATGTGCACAA GCATTTGTCG TCAGGCGACG GATGTGGCGA ACAGGATCTT TGAAAGGATA
AACTATGATA TCGACAAGGC CAGTGAGCCA ATGCAAAAGC TTGCTGTTGA TGTCCTGAAG
ACTATACCGG CCTATTTGGT AGCTTGGGGC GCCATCTACA TACCGGAAAG TTCGAGCGCA
ATAGCATTGC TCCATCACCT TGTGCGTAGA CTGGGCGACG AAGATAAGAG CATGCTGGAT
TTGGGCAAGC TGCGCAATGA TTTGGATCAG TTGATGATGG TCTCCGAAAG CATGCAGAAG
CAAAAGTTCA CGACGTGGTC TACGGTCCTT GAATGCTATC CGCCTACGCT ACAACGACTG
TTTGTAAGTC TCATTATTAT GCTCGGAAAG CCCAGCGATA TTACCCTGAA GCTTTTGGGG
CGAATCTCTG CCCGATGCCA AGCCAAAGGC GACCCTGACA AGTTGGCCAT TTATGTTGCT
CAATCAATGT TTAGCATCCG TAAGACCGTC TCTATGTCAT CGTTTCTGAC GTTTTTGATT
GACGGCACCG GTGTCTTTCT TTTCGAAGAG TCGGCGTTTC ACAGTAAACC CAAGACCGAG
GGAGATAACC GGTATATACG CCTGTTTGAA TTGGATAATG GTGTTCGTTT GGCAGCCACG
AACCTTGTCG CGTGTGGATC CTCGGCCAAG ACACTTCCGA TGCTGGAAGC CCTCTTATCA
ACATTAATTC GAAGCTGTGA CGTAACAAAG ACGTCGACAC AACGTGAGTT GTTCCGGATT
CGAGCTGGTT TTTCGATTCT GGCACTCTTT GCCTTGGATC TCCGTCGTCA GGGATCCAGC
ATTTTTGACA TTCTGCCCCG GGCGTTCAAG GTACAAGCCA TGGCCGGCAT CGGACGGCTT
CTCGCACACG CACCGAGTCT CGACTCGGAA TCAAACGAAA ATATCGACAG GGCCCTTCAA
GCGTGGATCC GTCCGGTGGT GACCCTGTTG GCTTCGGAGG ACGGACTTTT GGTAGATTCG
TTTACGGTCC TGGTGTCGTC CTTGCATAAC TGGCCCGACA CCCACCGGAC CAGCGCGGTC
CAGACCCTGT TGCTAGTCAT ACGGGCCCCG ACGTTGGCAC CCGTATTTCG ACGTAGTGAT
ATTGCGGCCA TGGTGGCGCA GGCTAAAGTC TTGGAGCAGG CCTCGGCGGA GAGTCCGCTC
GCAAGCATCG CGGGTCAAGT CGTGGCGGAA CTCGAATTGC ACATTAGCGG GTGAGCACAC
TCAATGACAA GTCGCACGTA TACAACAGTA CAAGAAATGC TCTTTATTGA AATGCGAGCG
CAACGGTGGC GAAAAAAAAT TGTTTCCAAT CATCGAATAA CGTTTAATAG CTAGAGCTTA
TAGTAGAGGG
 
Protein sequence
MAPLTKGKSG NAKNADFKRI KAKVGKKAPK PANVTDTAFR AASLQTSSQT SFTNGGAPAA 
NLSDDVSLYS ARGRSLQSLA SQLSHPAAAV RASAAKGLYD LVSGAAATAT GATSSSLLQA
HLSALIPAVG KCVVDEDSEV RTVGTNILRE TVNKLNEKAT MALRPFIKLL IAFVASALNS
LDRDSRRDGA ILVELLSSSV PTLVAPYAVE LLPALIRSLD DRDTRLPKQP GINGSSDTGT
KKRKRRLPSV VSNKSTANLN GRHLLLQSLV TLLQSAATLT GSSKSETQQQ SATGEYLSEP
DLIFGTGGRS RNAVILRGRP TRRLAQLSPI QKLTDLPSLE AFRLSLARND GPIKVGLNTS
LPPKTVQQLY HKLRDCFVET TQRGYFDAKQ GYVMTVTDLS TFLLVAKALR LTWDVFGNEW
SQLQAESDVA DMRKSFAQAV SLILEIFPIS RADEATQVVA DDVNGELCVT LVLMGPTSAN
QGKEKTGWAD KVVAHVLTSM DEMHVQCHQA SSMSPTSARS VFAVLNGLVL TGLCNVKSQN
RLISMFSSTF FAPESKDSAR MCTSICRQAT DVANRIFERI NYDIDKASEP MQKLAVDVLK
TIPAYLVAWG AIYIPESSSA IALLHHLVRR LGDEDKSMLD LGKLRNDLDQ LMMVSESMQK
QKFTTWSTVL ECYPPTLQRL FVSLIIMLGK PSDITLKLLG RISARCQAKG DPDKLAIYVA
QSMFSIRKTV SMSSFLTFLI DGTGVFLFEE SAFHSKPKTE GDNRYIRLFE LDNGVRLAAT
NLVACGSSAK TLPMLEALLS TLIRSCDVTK TSTQRELFRI RAGFSILALF ALDLRRQGSS
IFDILPRAFK VQAMAGIGRL LAHAPSLDSE SNENIDRALQ AWIRPVVTLL ASEDGLLVDS
FTVLVSSLHN WPDTHRTSAV QTLLLVIRAP TLAPVFRRSD IAAMVAQAKV LEQASAESPL
ASIAGQVVAE LELHISG