Gene PHATRDRAFT_49009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49009 
Symbol 
ID7195401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011688 
Strand
Start bp293366 
End bp296485 
Gene Length3120 bp 
Protein Length1039 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183718 
Protein GI219126969 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGACC TCATTGTTTT AACCTCAAGG CTCTCTTACA AAATCCGCAC ATACCCTAGC 
AATCACACTC TCTGCTTGTG GCAACGCTGT TATCACGAAA CGAGTTCTGC CGATGACAGC
ATCGACACCC AAGCCGAGAC TTGCAGCGTA GCCTCCAAAA TGAGCTCGAG GAACAGTCAG
TTGAAGGGTC ATACTCTCCA ATTCGAAGAA GAACAATGTT TATCTTCTGC AGAGGTCAAA
GTTAGTGACA AAAAATTCGA GCCTGGGCTG ACGTTGTCGG CGTTGCCCTT TTTACAGTCG
TCGGAATCTT CCGGAGGAAG ATCGCTCTGG AAGTCTCACA AAGTCGACGA CTCAACACCG
GTAACGGTTT GGCGCCAGTT GACCCTAAGG CTATTGCAAT CCACAGAGGA GCTGACACGC
CACGAATGTT ATCTGATGGA AGAGTGCCTA CAATGGTGGA CCCGAAGACG CGTCAAATTC
GATGTCGTCG ACCCGGAAGC GAGTGAAGTT GTTTGGAAGC TCTGGTACAA ACTCTTGACG
GAATCGAACG GCAAGCCGTC GTCTTTGTTG CTCAATTCGG TATTGGATCA TTGGAGGTTG
TCGATAAAAC AAAACATCAA AGTGCCCTAC TGGCCTGACG GAGTTATCTC GCATGTTCAG
TCGCTAGCGC CGGAGCTAGT GGATGTCAAG AGTTTCGCTC TGATCCTTGG AGCGATGGCA
TATTGGTATG ACGTGGACCC CTGGAAGGCA CAAGCCTTAC TTCAAGAGCT ACCTTCACAT
GTAAAACCCA ACGCAATTGT TTGGAATTCT GCATTAACAG TTTGGGCCAA GGCGGATGCG
TTACGATGGC CCGACGCTGC CCTTCGGGCG GAACAACTTC TAGAACGGAT GAAACAACAT
CCCGACATTC AGCCCAACGA GGTGAGCTTC ACTTGTGTAT TGGAAGCTCT TGCCAACAGC
CCTTCCGCAA AGGCTCCAGA AAAAGCTGAA AGAGTCTTTC AGGATATGGA GGATGCAGGA
TTCCTGTCAC CCATCGCTTG TTTGCAAGTC ATGCAAGTTT GGGCGAAATC GGACAGCCAC
CACGGAGCTG ACAAAGCCTA TGCTCTTCTT CACGAGATGG TGCAATTGTA CCTCAAAGAC
CAATCAACCG TTAAAAAACC CATGAAGCAT TGTTTCTCAG TAGTCATGGA GGGATTTGCG
CGTCGTGGCA AACCAGAGAA AGTCGAACGA ATTCTGTTGG AGCTCCAGGA TTTATACCAA
CGCTTCCATG ATGATGACTT TGTGCCAACG GCTGCGACCT TCAATTCAGT CCTCTCTGCG
TATGCGAGAT GTGCTCTGCC TGATAGGGCT GCGCGGGCCG AGCAATTGCT GATGAGTCTA
CGTGAAATGG CCGAGTGGAG CGGTAATACT GCCTGCTTAC CCGACACTAC ATCGGTTAAT
ACAGTTGTGC ATGCATGGGC AGAGAGCAAC GATAAGGGCG CAGTCGAAGG CGCTGAAGCC
TTACTGAATG CGATGCAACT GTGGGAAGGA GTGCAGCCTG ATGCATACAC CTACACTGCT
GTCATAAAGG CTTGGGCTCG CTCCGACAGG AATGGTGCCG CAATGCGATG CGAATCCTTC
CTGAACGCAA TGTGGGCAGC ATTCGAAAAG GGGAATCACC AAGTCAAGCC AAATGATGTG
ACATATGCTA CAGTGATCTA CGCCTGGTCC AGAAACAAAG GAAAAGAAGC TCCTTACCGA
GCCGAGGCAC TTTTCCGAGA AATGATGGAA CGGCATAAGA ACGGCGATTC AACGTTGAAA
CCCAGGGAGT CTTCTTATGT GTCTCTAATG ACAACTTGGA ATCGCAGCAA CCTGCGAGAA
GCGCCCTCTC GAGTCCAATT TTATTTTAAT CAAATGCGAG GGAGTCACCT CGCAGGCGAC
AAAAGTTTGC AACCGTCCGC GAAATTGTAC AATGCTGCTC TTTTCGCCAT GAAACGGGCA
GGAGATGGGG CGGGGGCCGA AAACATTTTG GAGGTTATGT ATGCAGATTT CGAGAGGGGC
AACAACAAGG CCCAGCCAAA TACGCATGTT TTCAACACGA TCATATCTGC TTGGGCAAAC
ACTAGAACGC ATATCGCGCC AGAAAGAGCA GAAGCTATTG TTTTACGGAT GCTGGAGCTT
CATTCAGACA AAGGATGGGA TTGTAAGCCA AATGCCATAA CATACACCTG TATTCTAGAC
TGCTGGGCAA AATCAAGTCG TAGCGACGCT CCTGACCGAG CAGAGGAGAT CCTCAGACAC
ATGCAACACT TAAGCGATAA AGGTGACGAA AACGTTAAGC CAACCACCTA CGCTTGGTCG
ACTGTCCTTA CAGCGTGGTC GCGGTCGACT TCTCTGGATG CCCCTGTCCG AGCTCAGCAA
TTATTTGACG AGATGTTACG AAAATTTGAA GCAGGCGACA AATCTCTTCG GCCGAGTGGA
CCAGCATACG CTAGTGTTTT ATCCACTTGG TCGCGCAGCA ACCGACACGA CGCGCCTCAA
ATATCATCCG AAATCTTAAA ACTCATGAAA GAACGCCACC TGGCGGATAC GTTAAATGAG
AAGCCAAACC GGTACCACTA CTCAGCTGTC ATCAGCGCCT TTGCATCAAA GGGAGATGTT
CAGAATGCCG AGGCATTGTT CGAAGAAATG AAACTTTTAA AGGACGTAGA GCCACATGAT
GGTTGCTATA ATGGCCTCAT CAAGGCGTAC GGTCGTTCGT CATTACCCGA TGCAGCGGAA
CGTGCCGAGT CTTTGCTTCG CACTATGGAG AAAGAATCAG CCGTTGGAGC CGACTGCAGT
CCTACCATGA TAACCTACAC ATCGATTTTG GATATTTGGC AAAGAAGCCA AAGACCGGAT
GCAGTCGACA GAGCTGAATC CCTCCTGAAG GAAATGCTGA AACTCGCTGA ACAGGGACGG
GACAAGCTAA GCCCCAATGC GTCTACCTTT CTTGCTTTCT TACGAGTAAT TTCCAAAAGT
TGTGCTACTG ACAAGGCCGC TCGTGCTGAG GAAGTCTTGT CGTTGATGAA AGCTTTTAAA
TGCGCGCCAA CGGACGCTGT ATTTCGAGAA TTGAATAAAT GTCGAGCTGA TGCCTGCTAG
 
Protein sequence
MKDLIVLTSR LSYKIRTYPS NHTLCLWQRC YHETSSADDS IDTQAETCSV ASKMSSRNSQ 
LKGHTLQFEE EQCLSSAEVK VSDKKFEPGL TLSALPFLQS SESSGGRSLW KSHKVDDSTP
VTVWRQLTLR LLQSTEELTR HECYLMEECL QWWTRRRVKF DVVDPEASEV VWKLWYKLLT
ESNGKPSSLL LNSVLDHWRL SIKQNIKVPY WPDGVISHVQ SLAPELVDVK SFALILGAMA
YWYDVDPWKA QALLQELPSH VKPNAIVWNS ALTVWAKADA LRWPDAALRA EQLLERMKQH
PDIQPNEVSF TCVLEALANS PSAKAPEKAE RVFQDMEDAG FLSPIACLQV MQVWAKSDSH
HGADKAYALL HEMVQLYLKD QSTVKKPMKH CFSVVMEGFA RRGKPEKVER ILLELQDLYQ
RFHDDDFVPT AATFNSVLSA YARCALPDRA ARAEQLLMSL REMAEWSGNT ACLPDTTSVN
TVVHAWAESN DKGAVEGAEA LLNAMQLWEG VQPDAYTYTA VIKAWARSDR NGAAMRCESF
LNAMWAAFEK GNHQVKPNDV TYATVIYAWS RNKGKEAPYR AEALFREMME RHKNGDSTLK
PRESSYVSLM TTWNRSNLRE APSRVQFYFN QMRGSHLAGD KSLQPSAKLY NAALFAMKRA
GDGAGAENIL EVMYADFERG NNKAQPNTHV FNTIISAWAN TRTHIAPERA EAIVLRMLEL
HSDKGWDCKP NAITYTCILD CWAKSSRSDA PDRAEEILRH MQHLSDKGDE NVKPTTYAWS
TVLTAWSRST SLDAPVRAQQ LFDEMLRKFE AGDKSLRPSG PAYASVLSTW SRSNRHDAPQ
ISSEILKLMK ERHLADTLNE KPNRYHYSAV ISAFASKGDV QNAEALFEEM KLLKDVEPHD
GCYNGLIKAY GRSSLPDAAE RAESLLRTME KESAVGADCS PTMITYTSIL DIWQRSQRPD
AVDRAESLLK EMLKLAEQGR DKLSPNASTF LAFLRVISKS CATDKAARAE EVLSLMKAFK
CAPTDAVFRE LNKCRADAC