Gene PHATRDRAFT_41582 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_41582 
Symbol 
ID7199369 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011699 
Strand
Start bp253148 
End bp256719 
Gene Length3572 bp 
Protein Length874 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185504 
Protein GI219130715 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGATG CTGATGCTCC CAATGCTCCC GAAGCTGGTC TCGCTCCGGC TGTTGCTGCC 
AATGCCCCTG GTATTGTTGG TGACGCCCAA GCTGCCGCTT ACGCTGCGTT TGCTGCTGCT
GAAGCTCAAA TGGCGAACGC TCTCGCTGTT CTTGCGGGGG AAGCACCCCC TGATGCTCAA
CCCTTTCCTC CACCTCCGGG AGTGGCCGAC ATTCCAGTTG GCCTCCCTCC GGTTGTGGTG
GATGCTCCTC TCATTGATCC GTACGACAGT CTTCTCATGA GGGCTGGCAT GAACTACGGA
ACTCGCAAGG CTTTTTCTAA GGAGGGCTAT GGTTTGATGT CTGACTTGGT TACTCTTAAC
CAGAAGCGAC TTGAATCCCT CATTGATATG ATGAACAAGA AGCATCGCGG TAAATCCTTT
CAAGGCGCAA TTCCGTTGGG CCTGAACCTT GTGGCTGAAG ACCTTGAAAT TGATATTGGC
CACAAAACCA AGACCACTCT CAAAGTTATT CTCCATTGGG CGGACCTCCA GAAAAGTCTT
GGATTAGACG TGAATGCCGA GGATTATACC AATGTTGTTG GTCAGCTTGC CCGCGAACGT
ATGGATGAAG AGGAGAAGAT TCTTGAGGCG GCCAAGAAGC TTACGCCTTC TAAGCCAACA
ACCCTCAAGG ATATGACCAA GTGGCGTTCC TTCTTTGAAA ACTGGAACTC GTACATGAGT
CAGTGTCGCA GTGCTGCGGC TATCCCTCTT TTGTACGTTT ACCGTACCAA CGAGCAGCCC
GAGACCGCTT TGGTCGGAAC CTATGTGAAT ATGGATGCCT ATTTGGTTGC CCAGACAGTA
CTGTCTGGTT CCAACTTTGA GATTGATAAT CAACGGGTTT TTGACGAATT CAAGGAAGCA
ATCACTACAA CCGGACCTGG TTGGTCTTTC ATCAAGACGT ACAACCGAAG CAAGGACGGT
CGTGCTGCCA TTTTGAAATT AAAGGAACAG GCAGAAGGAA CATTAAACGA GTCCGTTTGC
CGTGATGATG CCATCAAGAT CCTGTCAACT ACGACATACA ATGGTCCGAG TCGTAACTGG
AATATTGATA TGCTGTTGCA GAAATTTCAG TATGTTATCT CGGAATTGGT CGAAATTGAC
GGAGTCGCGT TGCCGGATGG GCAGCTTGTG ACTTATTTGG TCCAGGCATT GAAGGACCCA
AGTCTGAGTT ATGTTCGTGA CACAATTCGC ACCAATGCAA CTTATCGGAA CAGTTTTCCG
GAAGCGCAGC TTTTTGTGAA GACTTTTGTG TCTTCGTCCA CGAGCAAATC CGAAAACACG
CCTTGACAGG TCAATGATGT GCAAACATCA GGTAGTGGGG CCTCCGGTGG GAGTATGAAA
GGAGGTACCG GGAAAGGAGC CAGCAAGCCG ACTCCCTTCA AGGGTGCAGT CACGGCTCGC
AGTTATACTC CGGGAGAATG GAAAAGTTTG TCCAAGGACC AACAGGAAAA AGTGCGATCG
CTGCGTAATA AAAAGAAGCA AGGAGGGAAA CCCGAGGAAT CAGAGAGGAA TGTTGACAGT
GTAGCACGGG ATGAGCCTGT GGACACTAAG GAAGTCCATA CCAGCCGTGA AATGGAACCA
ACTTCAGATG CGGCTGGCCT GCAATTTGGC CGTGGTGCGT ATAAGAAATC GGTCGGATTC
ACTGCAGACA CCGCTTCTCC TTCCGAAAAC GGAACGAAGA AGCAGAAAAC GCATCACAAT
GCGTGAAACG CGGCACCCAA TGCCAGTGTT TCGGGGACTA AGCAATGCAT TTTACCAGAT
TGAGTGATAT TGAGCCTCAC CTCTACACGC AGCATTTGTG ATCTCAACGC ATGCACTCAT
CTTGGTGAGG GCCGCTGTGA GTTGGATTCA CATGCAGACA CATGCGTGGC TGGGGCAAAC
ACTGTCTTGA TTGGTGAATC GCAGAAGTCC GTAACTGTGC GACCTTTCTC CAGTGAATAT
TCTGCACTGA AGAATATCCC CATTGGAACG GTTGCCACAG CTTACACAGT ACCAGAAGAC
GGGAGAGTGG TGCTTCTTAT TATTAATCAG GCCCTATTCT TTGGGGACAG ATTGAAAAAC
ACCCTATTGA CCCCCAACCA GATGCGAGAC TTTGGCATTG AAGTTGACAA TGCCCCTCGG
CAGTACGTCG CCAACTCCAA GCACTCTTTG TATGTTCCTG ACTCCCAACT TCGGATTCCG
CTGCAGCTGC GCGGTATATT CTCGTTTTTG GAGTCGCGGA AGCCCACGCA ACAGGAACTT
GACGAGTGTG AGCACATCAT ACTCACCTCT GATGTGCCGT GGGAGCCTTG CTCAGCGGAC
TTTGCCCGTC GAGAAGAAGA GGCCGCTAAG AGAGACCGGA GCGTATCATT GGTAGACACA
ACGGGACTTT CCACTGGCCA CGCAATCCTA TCAGCACACC CATATGGTAT ACGAACTGTT
GCGGCTTCGC AGCGAATACT TGAGACTTTT TGTTCCTTGA CAGAGGTTGA ATTGTGCGAG
ACAAATCTGG CGGACCGCCT TATTGCCTGT GTTAATGTTG CGTCGGATGA TTACTGTGGA
GACGGGTTGG ACGGTAGAGC TGACTTGGAT GTGTACCCGG ACTCAGAAGA CTTCACTCGT
GTCGTCTCAG GTATGACATC AAGCGAAAGA CGGTCAGCGT TGACAGCTGA GGTTTTGTCG
AAGCCTTGGA ATATTGGCCT GGATTCGGCC AAGCGGACTC TGCAAGTAAC AACGCAGAAG
GGTGTGAAAA CGGTGATGCA TCCCTTGACC CGACGGTATC GTACTCGCCA ATCGCATTTA
CGATTTCCCA CCATTCGGAC CAAGGTTTAC ACCGACACCA TGTTTTCGTC CGTGATTTCC
ATCCGCCAGT ACAAGTGTGC CCAGGTTTTC ACAACCAACA TGGCCTATTC GCGTATTTAC
CCTCTGCAGA CCAAGCAGCA AGCTCCTGAT GCACTAATGA AGTGGATACA TGATGTTGGG
GTAATGAGTG ACCTAGTTTA TGATGGGTCT AAGGAGCAGG GAGGTGGCAA ACATTGGAAA
GAGATTGAGC AGCGTCACCA TATCCATCGC CATGTAACGG AGCCACACAG TCAGTGGCAG
AATCGAGCTG AAGGAGAAAT TCGTGAAATT AAGAAGGCTG TTCGGCACCA ACTGCAGGTT
TCTCGTGCAC CACAGCGACT ATGGTGTTTT TGTTGTGAAT GGGTGTCGGC TATCCGTCGA
TTAACTGCTC ACGACATTCC TGCACTAAAC GGTCGAGTTG CCACAGAGCT TTTGGAAGGG
GACACCCCCG ATATTTCTGA GTACGCGCAA TTTGACTGGT ATGAGCCTGT CTGGTTCATT
GACCCAACTT CTGCTTTCCC TGAAATGAAG AAGAGATTGG GCCAATGGGT CGGAGTTGCA
TCAGATGTGG GACAGGCAAT GACTTTTTGG ATTCTTCCAA AATCATGCAT CCCAATTGCA
CGTTCCTCTG TTGCTCGCGT CTTTCCAGAC GTAGCCGCTA CCGATGAATT TAAGGCTGAC
CTTGCTGAAC TTGATCTAGC CATCGAAAAT AG
 
Protein sequence
MADADAPNAP EAGLAPAVAA NAPGIVGDAQ AAAYAAFAAA EAQMANALAV LAGEAPPDAQ 
PFPPPPGVAD IPVGLPPVVV DAPLIDPYDS LLMRAGMNYG TRKAFSKEGY GLMSDLVTLN
QKRLESLIDM MNKKHRGKSF QGAIPLGLNL VAEDLEIDIG HKTKTTLKVI LHWADLQKSL
GLDVNAEDYT NVVGQLARER MDEEEKILEA AKKLTPSKPT TLKDMTKWRS FFENWNSYMS
QCRSAAAIPL LYVYRTNEQP ETALVGTYVN MDAYLVAQTV LSGSNFEIDN QRVFDEFKEA
ITTTGPGWSF IKTYNRSKDG RAAILKLKEQ AEGTLNESVC RDDAIKILST TTYNGPSRNW
NIDMLLQKFQ YVISELVEID GVALPDGQLV TYLVQALKDP SLSYVNDVQT SGSGASGGSM
KGGTGKGASK PTPFKGAVTA RSYTPGEWKS LSKDQQEKVR SLRNKKKQGG KPEESERNVD
SVARDEPVDT KEVHTSREME PTSDAAGLQF GRDTCVAGAN TVLIGESQKS VTVRPFSSEY
SALKNIPIGT VATAYTVPED GRVVLLIINQ ALFFGDRLKN TLLTPNQMRD FGIEVDNAPR
QYVANSKHSL YVPDSQLRIP LQLRGIFSFL ESRKPTQQEL DECEHIILTS DVPWEPCSAD
FARREEEAAK RDRSVSLVDT TGLSTGHAIL SAHPYGIRTV AASQRILETF CSLTEVELCE
TNLADRLIAC VNVASDDYCG DGLDGRADLD VYPDSEDFTR VVSVYDGSKE QGGGKHWKEI
EQRHHIHRHV TEPHSQWQNR AEGEIREIKK AVRHQLQVSR APQRLWCFCC EWVSAIRRLT
AHDIPALNGR VATELLEGDT PDISEYAQFD CHRK