Gene PHATR_44196 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_44196 
Symbol 
ID7204109 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp1272675 
End bp1276037 
Gene Length3363 bp 
Protein Length989 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186216 
Protein GI219113265 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.362095 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AGTGTACCAG TGCTAGCTTC GTTCACTGGT GTAGGGTCAT ACTAACAAAG TCCGCTATAG 
TATACAGCGA AGGACCCGAA TACATAAGTA GTAAATTTTC AAGAACCTCT GTATTGGTTT
GGCTTCACAG GCCACCATGG CACCTGCCAC TTGGCAAATG ACGGGCGGAG CGGTCTATGC
GCACCTTTTG GATAACGTGC TTCTTCTTCC CCAAGGGCAC CCTATTCGTC TCAGTTTTGC
ACAACAAGGA TACGAATCGG CCGATGACCT CCTATGTATT TTTGAGAATG AACTTGAAAC
TCTTGAATAC ATTCCTCTTG CCCCTGCTGA CGGCCCCGAA ACTACGGCAC CGGTTGCCTT
ACTCATGGCA TATTGACAGA TCATCTGTCA TTTCCTCCGG TGGCAAGCGT CCCTTGAGCG
TCAAAAGGGA ACTCCTTTGA AGAACTCCGA GCTTGCAGCC CTGAACAACG AAGACTTTGT
CCTGTACCGC CGATCCGCTC TCGGCCAGGT CTCTTCGACT GTTGCTCCAA TAGTCACAAA
CCCCAATGCT GCAATCCCCA CCGCTAAAAC TCGACCTGCT GTGGAAGATT TCAAGCGTGG
GATCAAATGA GACAAAACCC ATTACCCCGT GCTCAAAGAC GACAGGTACT GGGATAATTT
CTATCGGTCC TTCGTCGTCA CTGCCGTCTC CCATAACGTT GAGAAGGTAC TCGACCCATC
ATACTTGCCT ACTGATCCAC TGGAAAAGTC GTTGTTTGAA GAACAAAACA AGTTCGTATA
CTCAGCCTTG GAGCATACAC TTCAGACGGA CATGGGCAAA AATATCGTTC GAGAACATAG
TTTTGATTTC AATGCCCAGG AAGTTTTCCG TAAAGTGGTC AAGCACTATA CAGAGTCCGC
CTCTGCAAAG ATCAGCTCCT CTACCACTCT AGGATACCTG ACCACGGCAA AGTATAGCTC
ATCATGGACT GGCACAGCGG AGGGATTTAT CCTACACTGG AAGAATCATT TGCGTATATA
TAATGATACC GTCCCTACGG GTGAGCAGCT CCCACAACAG CTTTGTCTCA GTCTATTGGA
GAATGCTGTC CATGATATAC CCGAACTTCG TCAGGTTAAA ATCACGGCAA CTTTAGACTT
AGCAAAAGGT GGCAGCCCTA TTAGTTATGA CGGTTATCTC AGTCTACTAC TTGCATCAGC
ATCACTCTAT GACAATGGCA ACAACCTATC TAATGCTCGT GGCAACAAGA ACAAACATCA
TGTTTATTCT ACTGACTTAG TCTACCATCC AACTGACTTC GACAATGATC TAGACGTAAG
TTACGATATA GATGTGTCAC CCACAGCAAT CTATGAAGCC AATGCCCATG CACGCAACTC
CGGTAATAGT GGCAATCGCA GTCGCAACGC AGCTAGCCCC AGAGACCGAC CTTATATTCC
CCGGGAAATG TGGAATCAAC TCTCAGAGGA TGCAAATGGC CGGCCGCGCA ACCTTTTTCA
CAGGTGCTAC AAGCCAATAC GCATAGCCAT GGTAGCAGCG AAACCGCGGA CACTTTCCAT
GATTGCGCAC CGGAGACTGA GTTGTTGGCT CACCTTACTG ACCGCGTCAG TCGTATGAAC
GATGGTGATA TTCGTAAAGT CCTTGCAGCA TCACGTGACA ACGTCTCCCC ACAACCAGGA
GCGAGACCCA AATCCATGCA ATCCAATATG CTACGTTATC AAGTCTCTCG GCATAATGTC
AACGGTACCA CTGCAGCTCT TGTCGATCGT GGTGCTAATG GCGAACTTGC CGGGGCGGAC
ATCATGGTGC TCAACAAAAC AGGACGTTCC GCCAATATAA CTGGTATTAA TGATCACACA
TTGTCCGATT TGGATATTGT CACCGCTGCA GGATGTGTTG AATCCCATAC CGGTCCTATC
ATTGTAATTA TGCATCAGTA TGCGTATCTT GGCACTGGTA AGACTATACA CTCCAGTGCG
CAACTCGAGC ATTTCCATAA CAACGTTGAA GACCATTCAC GTACAGTTGG TGGAGACCAG
CGCATTGTGA CCTTAGATGA TTATATCATC CCCTTGCACA TCCGCCAAGG TCTTCCATAT
ATGGATATGA GGTGCCCAAC AGATGCCGAA TTTACCTCTC TCCCGCATGT GATATTGACC
TCTGATGTCG ATTGGGACCC GTCAGTCCTT GACAACGAGA TCGATCTGGC CACCGATTGG
TACGACACTG TACAGGATTT ACCCCAACTA CCATATGTCG AACCGCGTTT TGACCACATG
GGCAAATATC TCCATCGTCA TATTTCGCTT TGTGACACTC GCCACCATGC CGTTGACTGT
ATCCTTCAAT GTCAGCAGCA TGAAATTCAG CGTAATGACC ATGACTACGA AACCCTCCGT
CCTTGTCTTG GTTGGGTATC CGCCGATACC GTTCGTAAGA CTATACAGGC CACCACCCAG
TATGCACGAG AGGTATACCA CGCACCGTTA CGCAAGCATT ATCAGTCGCG CTTCCCGGCC
CTAAATGTCC ATCGGCGTAA CAAGCCAGTT GCCACCAATA CCATTTGGTC AGATACTCCT
GCTGTTGATA GTGGTGCCAA ATTTGCGCAA CTTTTCGTGG GCCGCCGATC CCTTGTCACT
GATGTTTATC CCATGAAAAC CGAAAAAGAA TTTGTTAACG CTCTCGAAGA CCATATTCGG
TTTCGCGGCG CTATGGACAA GCTCATCAGC GACCGTGCAC AGGTCGAGAT TAGTAAAAAG
GTCATGGATA TCACCCGTGC TTACAACATT GACCAGTGGC AAAGCGAACC ACACCACCAA
CACCAAAATT TTGCTGAACG TCGCATTGCC ACTATCGAGG CTAACACCAA CAACATTCTC
AATCACACCG GTGCCCCTGA CTCCACATGG CTTCTTTGTG TCACGTACGT GTGCTATGTA
TTCAATCATC TCGCCCATGA ATCCTTGCAC AACCGTACAC CCTTAGAAGT CCTTACTGGT
TCCACTCCTG ATATCAGTGT TCTTCTTCAG TTCCATTTTT GGGAACCCGT CTATTATCGA
CTCGAAGATG CGACATTTCC GTCTGATGGT ACTGAACAAA CGGGACGTTT CGTAGGCATT
GCTGACTCCG TTGGCGATGC TCTTACTTAT AAGATCCTCA ACGATACTTC TAATAGAATC
CTCTATCGTT CCAGCGTGCG CTCTGCAAAC CTTCCCGGTG AAACCAACCT ACGCCTTACA
TCACGGGATG GGGAGAATGG CCCTAAACCT ATCAACTTTA TCAAGTCTCG TCGAACCGAA
AATCTAAATT CCTATGATTT AAAGGAGTTG CCTGGTTTCA CCCCCGACGA CGTTTCTCAC
TGA
 
Protein sequence
MAPATWQMTG GAVYAHLLDN VLLLPQGHPI RLSFAQQGYE SADDLLCIFE NELETLEYIP 
LAPADGPETT APIICHFLRW QASLERQKGT PLKNSELAAL NNEDFVLYRR SALGQVSSTV
APIVTNPNAA IPTAKTRPAV EDFKHDRYWD NFYRSFVVTA VSHNVEKVLD PSYLPTDPLE
KSLFEEQNKF VYSALEHTLQ TDMGKNIVRE HSFDFNAQEV FRKVVKHYTE SASAKISSST
TLGYLTTAKY SSSWTGTAEG FILHWKNHLR IYNDTVPTGE QLPQQLCLSL LENAVHDIPE
LRQVKITATL DLAKGGSPIS YDGYLSLLLA SASLYDNGNN LSNARGNKNK HHVYSTDLVY
HPTDFDNDLD VLQANTHSHG SSETADTFHD CAPETELLAH LTDRVSRMND GDIRKVLAAS
RDNVSPQPGA RPKSMQSNML RYQVSRHNVN GTTAALVDRG ANGELAGADI MVLNKTGRSA
NITGINDHTL SDLDIVTAAG CVESHTGPII VIMHQYAYLG TGKTIHSSAQ LEHFHNNVED
HSRTVGGDQR IVTLDDYIIP LHIRQGLPYM DMRCPTDAEF TSLPHVILTS DVDWDPSVLD
NEIDLATDWY DTVQDLPQLP YVEPRFDHMG KYLHRHISLC DTRHHAVDCI LQCQQHEIQR
NDHDYETLRP CLGWVSADTV RKTIQATTQY AREVYHAPLR KHYQSRFPAL NVHRRNKPVA
TNTIWSDTPA VDSGAKFAQL FVGRRSLVTD VYPMKTEKEF VNALEDHIRF RGAMDKLISD
RAQVEISKKV MDITRAYNID QWQSEPHHQH QNFAERRIAT IEANTNNILN HTGAPDSTWL
LCVTYVCYVF NHLAHESLHN RTPLEVLTGS TPDISVLLQF HFWEPVYYRL EDATFPSDGT
EQTGRFVGIA DSVGDALTYK ILNDTSNRIL YRSSVRSANL PGETNLRLTS RDGENGPKPI
NFIKSRRTEN LNSYDLKELP GFTPDDVSH