Gene PHATRDRAFT_34944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_34944 
Symbol 
ID7200147 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp724826 
End bp728231 
Gene Length3406 bp 
Protein Length1094 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179491 
Protein GI219117393 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTACCGG CCACCAGGCA AATGACTAGC GGAGCTGCTT ACTCGCATTT TTTGGATAAT 
GTATTTTCAC TTCCTCAAGG GCACCCAATC CGACTTAGTT TCGAACAACA AGGGTATAAT
TCTGTTGATG ATCTCCTCAG TATTTTTGAG AACGAACTAG ATGCCCTTGG ATATGTGCCT
CCAGCGAGTC CTGACACCCA TGAAGACCCT CAGTGGACCC CATTGCTCAT GGCGCACCGA
CAGATCCTTC GTCATTTCCT GCGTTGGCAG GCATCACTTG AACGGCAAAA GGGAAGTCCT
TTGGAAAATT CGGAGCTTGT TGCATTGACT AGTGGAGATT TCATTTTATA TCGACGCTCA
GCACTCGGAC AAGTCTCTAA TGTTCCGGCC ACCATCAGTC CTTCTCTGAA CAACCAGTTA
AGTACGTCCA CGAAAGCTCG ATCGGCAGTC GACGAATTTA AGCGAGGAGT CAAGCGTGAC
AAGACCCACT ATCCTATCCT TAAGGATGAC CGATACTGGG ACAATTTCTA CCGGTCTTTC
GTGGTCACCG CGGTATCCCA TAACGTCGAA AAGGTACTTG ACCCATCCTA TGCACCGACG
GACCCCTCAG AGAAGTCTCT CTTTGAGGAA CAGAAGAAAT TTGTGTACTC TGCTTTGGAA
CATACACTTC AGACAGATAT GGGGAAAAAC CTTGTTCGCG AACATAGTTT TGACTTCAAT
GCCCAAGAAG TTTTCCGTAA GGTTGTCAAG CACTACACAG AGTCTGCCAG TGCCAAGATT
GGGTCCTCCA ACACTTTGGC CTACCTCACT ACGGCAAAAT ATGGCACATC CTGGACAGGA
ACGGCGGGGA AGGGTTCATC CTTCACTGGA AGAACCATCT TCGTATCTAC AATGATATGG
TCCCTATGGC AGAGCAGTTG CCTAAACAGC TTTGCCTCAG TTTGCTTGAA AACGCTGTAC
ACGACATCCC TGAACTCCGC CAGGTCAAGA TCACCGCTAC TTTAGACTTA GCTAAAGGAG
GCACTCCCCT CAACTACGAA GGCTACCTGA GTCTATTGCT TGCATCTGCT TCTCTATACG
ATAAAGGGAA CAACCTTTCC AATTCTCGTA GTGTCAAGAG CAAGCGTAGC GCCTTTCTGA
CCGACCTCTC GTATGATCAA CCGGACTTCA CCGAAGACAA TGGAATTGAC TATGATATTG
ATCTCTCTCC TGCAGTGATC TATGAGGCCA ATGCTCACAA CCGCAAAGTC AGTCCATCTG
GCCACCGTAA TCGCGATCCG GCAACCAATC GAGAGCGTCC GTATATCCCT CGCGAGATGT
GGAATCAGCT TTCAGATGAT GCCAAAGCCA TTCTCCAAGG CCTGTCCGCA CCCGACAAAG
GCCCTACTCG ATCCGGTGAT GTCTCGCAAC GTGCGTTGGA AGCGAATACC CACGCCAAGA
TATCGAACGG AAATGGCGAG TTCAACCGTA GCGAACCAGA CAACCAGCAA GCTGAAGCAT
TCCATGACTG TGATCAAACG ACGGAGCTCC TTGCACACTT GACTGACCGT GTGAGTCACA
TGGGAGACGG CGATATCCGA AAAGTCCTTG CTACATCCCG CCGTACACCA ATCAATTGTA
CCCAGTCATC GGACAATCGA CAACAGTCTG TTCAACTCAA CGTTCTGGAA TATCAAGTCT
CTCGTCATTC CGTTGAGAAC AAAACTGCTG CTCTAGTCGA TCGAGGTGCC AACGGTGGAC
TTGCTGGCTG TGATGTCAAA GTTGTGAACA AGACAGGACG GTCTGCTAGT ATAACGGGTA
TCAACGAGCA TACCCTGTCA GATTTGGATA TTGTCACTGC CGCTGGGTTT GTTGAGTCTC
ACAAAGGCCC TATCATTGTG ATTATGCACC AATACGCCTA TCTTGGCAAG GGAAAGACCA
TCCACTCCAG TGCCCAACTT GAGCATTACC GAAACACAGT CGAAGACCGG TCTCGCAATG
TTGGAGGACA ACAGCGGATT GTTACCTTGG ATGATTATAT CATTCCTCTT CATGTTCGAC
AAGGCCTCCC GTATATGGAT ATGCGACAGC CTACCGATAG CGAGTTCGAA TCTCTTCCGC
ATGTTGTGTT GACTTCCGAT ATTGACTGGG ACCCTTCTAT TCTAGACAAT GAAGTTGACA
TGGTGAACAA CTGGTACAAT GCAATGCAAG ATCTTCCGGG CAATGCCTAT GTTGAACCAC
GATTTGACAA CACAGGCCAA TACCTCCACC GCCATATAGC GTACTACAAT CTCGATCGCG
AGGACGCTAT TGATTGCATT ATCCAGTGTC GTAAGCACAA TGTCAAACGC AATGAACGGG
ATTATGAAGC ATTACGTCCC TGCTTGGGAT GGGTATCCGG TGACACTGTC CGAAAAACCA
TCATGGCTAC GACACAGTAC GCTCGCGAAG TCTACAATGC ACCGCTACGA AAGCACTTCA
AATCGCGATT CCCGGCTCTA AATGTGCATC GGCGCAACGA GGCTGTTGCA ACGGATACTA
TCTGGTCAGA CACACCTGCT GTTGACAACG GAGCCAAGTT TGCACAACTG TTTGTGGGGA
GACGTTCCTT AGTCACCGAT ATTTATCCCA TGAAAACAGA CAAGGAGTTC GTCAATGCCC
TTGAAGACAA TATTCGCCAT CGTGGAGCTA TGGATAAACT TCTGAGTGAT CGAGCCCAAG
TTGAAATCAG TAAGAAGGTT GCTGATATTA CACGAGCCTA CAACATTGAC CAATGGCAAA
GTGAACCTCA TCATCAACAT CAAAATTTTG CCGAACGCCG TATTGCTACT ATTGAAGCTA
ATACCAATAA TGTTCTTAAC AAAACCGGTG CTCCTGATTC AACTTGGCTC TTGTGCATTG
CCTACATCTG CTATGTCTTC AACCATTTGT CCCATGAATC TTTGCACGAT CGTACACCAC
TCGAAATTCT TCTTGGTAGC ACCCCTGATA TCAGCGTACT TCTCCAGTTT CATTTTTGGG
AACCGGTGTA CTACCGTCTC GAAGATCCAT CTTTCCCTTC CGATGGTACC GAAAAGAGCG
GTCGCTTTGT TGGCATTGCT GAATCTGTTG GGGATGCTCT CACTTACAAA ATCCTCACGG
ACGACACCAA CAAGATCTTA TACCGCTCCA GTGTGCGTTC CGCATTGAAA TCCGGAGAAA
TCAACCTACG CCTTACGCCA CAGGAAGGGG AGAGTAATTC TAAGCCTATC AACTTTGTCA
AGTCGCGTAG AACTGAAAAC AAAAATTCCT ATGCCTTAAA GGATCTACCC GGTTTCACCC
CTGAGGACCT TATTGGACGC ACGTTCCCAA CCGATACTCA GGATGATGGG GAGCGTTTTC
GTGCACGTAT CACAAGGAAA ATCTTAGATC CCGACAAGCC CTCTGA
 
Protein sequence
MVPATRQMTS GAAYSHFLDN VFSLPQGHPI RLSFEQQGYN SVDDLLSIFE NELDALGYVP 
PASPDTHEDP QWTPLLMAHR QILRHFLRWQ ASLERQKGSP LENSELVALT SGDFILYRRS
ALGQVSNVPA TISPSLNNQL STSTKARSAV DEFKRGVKRD KTHYPILKDD RYWDNFYRSF
VVTAVSHNVE KVLDPSYAPT DPSEKSLFEE QKKFVYSALE HTLQTDMGKN LVREHSFDFN
AQEVFRKVVK HYTESASAKI GNGGEGFILH WKNHLRIYND MVPMAEQLPK QLCLSLLENA
VHDIPELRQV KITATLDLAK GGTPLNYEGY LSLLLASASL YDKGNNLSNS RSVKSKRSAF
LTDLSYDQPD FTEDNGIDYD IDLSPAVIYE ANAHNRKVSP SGHRNRDPAT NRERPYIPRE
MWNQLSDDAK AILQGLSAPD KGPTRSGDVS QRALEANTHA KISNGNGEFN RSEPDNQQAE
AFHDCDQTTE LLAHLTDRVS HMGDGDIRKV LATSRRTPIN CTQSSDNRQQ SVQLNVLEYQ
VSRHSVENKT AALVDRGANG GLAGCDVKVV NKTGRSASIT GINEHTLSDL DIVTAAGFVE
SHKGPIIVIM HQYAYLGKGK TIHSSAQLEH YRNTVEDRSR NVGGQQRIVT LDDYIIPLHV
RQGLPYMDMR QPTDSEFESL PHVVLTSDID WDPSILDNEV DMVNNWYNAM QDLPGNAYVE
PRFDNTGQYL HRHIAYYNLD REDAIDCIIQ CRKHNVKRNE RDYEALRPCL GWVSGDTVRK
TIMATTQYAR EVYNAPLRKH FKSRFPALNV HRRNEAVATD TIWSDTPAVD NGAKFAQLFV
GRRSLVTDIY PMKTDKEFVN ALEDNIRHRG AMDKLLSDRA QVEISKKVAD ITRAYNIDQW
QSEPHHQHQN FAERRIATIE ANTNNVLNKT GAPDSTWLLC IAYICYVFNH LSHESLHDRT
PLEILLGSTP DISVLLQFHF WEPVYYRLED PSFPSDGTEK SGRFVGIAES VGDALTYKIL
TDDTNKILYR SSVRSALKSG EINLRLTPQE GESNSKPINF VKSRRTENKN SYALKDLPGF
TPEDLIGRTS RQAL