Gene PHATRDRAFT_50519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50519 
Symbol 
ID7199241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011698 
Strand
Start bp294198 
End bp297146 
Gene Length2949 bp 
Protein Length982 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185412 
Protein GI219130521 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGACG TTGTGTTCTA TTCCCTCGTA GCTCTCGCCA AGGCGGGCAT TGAATGCTGT 
CAAAACGCCC AAGTCTACAA GGCTGAAGCA GTCCGCATCG GCGAACGGCT AACGATGATA
GTGGCTCTAG CGCGTCAATG GAGAGGTGGT TGTAGCAACG AACAACTCGA ATGCTTTCAC
AGAGTCGTGG TCAATGTACA CGAATGCATG AAAGCTGCCT TTTTGTCTGA GAGCAAGAGT
GTTGCATGGA ACAGAAAGTT AAAGGTGACG TTACAATCTC AAATGCTGCT CGAGAATCTC
GTCCAAGCGG AGAGCCAGTT GAACACCGCC ATCGCTGACT TTCAAGTGAT GCAGTCCAAC
ACAATCTTTT CAGACTTTCT TGACTTCAAA GAAGGGGTCA AAGAAATGCT TGATCGGTTT
GGTTCGTGTG TTATGAACCA ATCGGGCTCC GTTACAATCA AACAGCAATT TGATCAGGTC
TTGGCGGAGA CTCAAGATCA AGCTCCCGAA ATCGGCATTT CCACTCCCGA GGATGGTAGA
TACAACGCAA CAATAAAATA TGACCCTGCT GGTCAAACGG GTGGCAAGCT ACAGCAGCGG
AAAAAGTCAA TGTTTCGTCC ATCCGAAGAA GACGTGCTCG CAATTACGCT CAGACCTTCC
TTGTTAGTCT TTTGCGACGA TCGCAACAAC CTTCTCGGAG GTGGAGGATT CGCAGAAGTC
TTTCACGGAA CCTATGACGG ACGACCAGTC GCCATCAAGC GCCTCAATGT ATCCATTCGA
GATGTGATGT CACTATCGAC TAAGCAGATT ACTAGCGATG TGGAACAACT TGCTGCCGAA
GCTATCCTAA CCCACAAATG CGGTGCACAT TCCAACATCA TCCAGGTACT TGGCTGCATC
ACCGCACTGA ATAAAACCAC GAGACCCTTG CTCGTAATGG AACTGATGCA CATTACGCTA
TTTGATGCGC TGCATGACCA CCGTGTAAAA GATAAACTGA CATTTTCCCA CTGCCTCTTT
CTGTTAAAAG GTATTGCTGG AGCCTTGGAG TTTCTCCATC TTCAAGGAAT TGTCCACCAT
GACATCAAAT CTCTCAACAT TTTGCTGAGC GAAGACCTCA CCGTAGCCAA ATTGGCCGAC
TTTGGAGAAT CGAAAGTGAA AGGTCTTAAC ACAACAAAAC CACGGCTTGA GAGAATAATG
ACAACGTCTT GTCATCAGGG CAACATAATT GCAGGGACAG CAGCCTACCA GGCACCAGAG
ATTCTCTCGG AAGATGTCAA TGACATATCA CGCGTTTGCG AAGTCTATTC TTTCGGGGTC
ACAGTTTGGG AGTGCGTGAC AAGAGAGATA CCACATATGG GTAAAAAAGA AGGGTCTATA
GCTCTTTTGG CTGCAAACAA GAAACACTTG CCTATGCTTG CGATGCCCTT GCACCCCTCA
AAGGATCTTC CAACAACAGA AATCGGTTCC TGGGAAGCGC TGAGAAAAGT CGCCGCATTG
TGCCTCTCCC GCGACCGCTC GATGAGACCC ACTGCTTCTG GAGTTGTTGA GCTTTGGCAC
CACGTCGACA CTCCTTCATT CCCTCCGGAG AGCTTGTGTT ATGCCAGTAC CAGGTGTGTT
TCCGATCCTC CACTACCGGC TCACTCGGCA TCTCAGAAAT CAACAATTCG AGGCCACGCG
TTGGATGCGC CAGATTTCGA AGAAGAATCC AAGGCAGGAG GTCTGAGGAA ACGTCGCTAC
ACTGTACTGT CAGTTATCGC AATCCTTATC GGGTTGATAG TGATCATAGT TCTTCGCTTA
CAGAGGTCCA GGGCATCTTC AGGTAAATCT ACGACATTCC CTGAATCGGA AGCGCCGTCT
GGTATTTCAA TCTCCACTCT CCCGCCCACC CAATTACCAA GGGCATCTCT GATACCGCAA
ACAGAGGCTC CGGTCTCTTT GTCCCCACCT ACAGGTCCGC CGCTGTCATC CCAATCTCTA
ACAACCCAGC CATCATCAAC ATCCATGCCG CCGAGAGCAG GTCCGGTTTC CACGACCCCG
CCCACTATTT CACCTCTCAC GGCCCAATCT CCAACAACAA GCACGCAACC ACGTCTGTCC
TTTCAAACAA CACAAGAGCT TTACGATGCT GTTGATATTT ACACTGCCGC GACGGACTCC
ACAAATTCTA CGGCGGCAGT GACGTACGGC TATCCCATCG GATCATGGGA TGTGTCCCAA
ATCACCGATT TTTCGCAAGT CTTCGATAGC TTGAATCGAA ACAGCGCGGT TGGGATTTTC
GACGAAGATG TGAGCGGCTG GGATGTCTCT GCCGCAACGA CCATGTCTGG CATGTTCAAG
GGTGCGTCTA CTTTTAGCGG CGATCTTTCG TTGTGGAACG TAAGCCAGGT AAGGGATACG
TCATCCATGT TTGAGGGAGC AACCTCGTTC GACGGTAACG TCTCGCTATG GAATGTTGGG
CAGGTAACGA ACATGTCTTC CATGTTTTTT GAGGCGAGTG CCTTCAATGG CGACCTCGCT
TCGTGGAACG TGGGGCAGGT AACGAACATG AGTTCAATAT TCTTCCTTGC GTCTAGCTTT
ACAAGCGATC TTTCGTCGTG GAATGTTGCA CAGGTAACGG ATTGGTTTGC CGCGTTCAAA
GGAGCAGCCG CCTTTACCAG TGACCTTTCC AAGTGGAATG TGGGAAAGGT CACAAACATG
CGTTTGATGT TCTACCACGC GTTTGACTTT AATAGCGACC TCACGTCGTG GGATGTTAGC
CAGGTGACGG ATTTGTCGTC AATGCTCGAG GGTGCGACCG CATTCACCGG CAACCTCTGC
TCGTGGCTCA CACAGATTCC ACCGAGTTGC AACGTGGATC GAATGTTCTC GTTCGCCTCG
TCCTGTTCAG ACCTTGCAGC CACCGTACTC CCGGACGGAC CCATGTGCCA TGCTTGCGTT
CCAATGTAA
 
Protein sequence
MADVVFYSLV ALAKAGIECC QNAQVYKAEA VRIGERLTMI VALARQWRGG CSNEQLECFH 
RVVVNVHECM KAAFLSESKS VAWNRKLKVT LQSQMLLENL VQAESQLNTA IADFQVMQSN
TIFSDFLDFK EGVKEMLDRF GSCVMNQSGS VTIKQQFDQV LAETQDQAPE IGISTPEDGR
YNATIKYDPA GQTGGKLQQR KKSMFRPSEE DVLAITLRPS LLVFCDDRNN LLGGGGFAEV
FHGTYDGRPV AIKRLNVSIR DVMSLSTKQI TSDVEQLAAE AILTHKCGAH SNIIQVLGCI
TALNKTTRPL LVMELMHITL FDALHDHRVK DKLTFSHCLF LLKGIAGALE FLHLQGIVHH
DIKSLNILLS EDLTVAKLAD FGESKVKGLN TTKPRLERIM TTSCHQGNII AGTAAYQAPE
ILSEDVNDIS RVCEVYSFGV TVWECVTREI PHMGKKEGSI ALLAANKKHL PMLAMPLHPS
KDLPTTEIGS WEALRKVAAL CLSRDRSMRP TASGVVELWH HVDTPSFPPE SLCYASTRCV
SDPPLPAHSA SQKSTIRGHA LDAPDFEEES KAGGLRKRRY TVLSVIAILI GLIVIIVLRL
QRSRASSGKS TTFPESEAPS GISISTLPPT QLPRASLIPQ TEAPVSLSPP TGPPLSSQSL
TTQPSSTSMP PRAGPVSTTP PTISPLTAQS PTTSTQPRLS FQTTQELYDA VDIYTAATDS
TNSTAAVTYG YPIGSWDVSQ ITDFSQVFDS LNRNSAVGIF DEDVSGWDVS AATTMSGMFK
GASTFSGDLS LWNVSQVRDT SSMFEGATSF DGNVSLWNVG QVTNMSSMFF EASAFNGDLA
SWNVGQVTNM SSIFFLASSF TSDLSSWNVA QVTDWFAAFK GAAAFTSDLS KWNVGKVTNM
RLMFYHAFDF NSDLTSWDVS QVTDLSSMLE GATAFTGNLC SWLTQIPPSC NVDRMFSFAS
SCSDLAATVL PDGPMCHACV PM