Gene PHATRDRAFT_49367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49367 
Symbol 
ID7195881 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011690 
Strand
Start bp51121 
End bp54046 
Gene Length2926 bp 
Protein Length896 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184175 
Protein GI219127923 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.40946 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CACATAGACT CGTCATTGAC TCCTCGTGGG TAGCAAGGAT CGCCAGCACT AAGACGGATC 
CGCTTTACAA GCAACCGTCT AATTCATGCA GACTGCAATG CCCCGTACTC GAGCTTGGGG
AAACCTAGTG GCGAGTCGCA GCGGCCAACG ATGCCCGGCT TCCCATCACC CAATTCGTCG
ATGGTTGACA AGAAATAAAC CAAATCGAAG TCCATTCGTA ACCACAGCTT CTCTCGTGCC
CACAAATTGT ATGCATGTTC ATACGCGAAA CGTTCAATAC TTTTCTACTG GGAAGTCAAG
TGATCCCACA AAAAACTGGC GCTTTGATCA GAGAAACATG AGAAATCGTC GCGTGTCCAA
GAAGGATAGT GAACAATTTG CCAAGGCCAA TCAAAAAGCC CACCAAAAGA GCAACAAACC
TCCCAACGTC TTTATTAGGG AGCTAATGAG AGACATCAAC AATCTCCAAA AAAGACTGGA
CAATTTAGAT TCTCTTCCGG CAATCTGGGA CAGGTCGACG CGCTCAAAGC TTTACGATAC
AAATTCTCAA GAACAGCGCC TTGCGGTGTT GAACGAAGGA CGTCAGCTAT TGGAAACGAT
TCGTCTTGCG GTTGAAAAAG GCACGTTAAA GCCTTTGGAA GACCACGGTC ACGAGACGTC
TGTATTGATG GAAAGAATTC TCAAGCTGTA CAGTGAAACC CCGTCCACAA CCGATACGCA
TTCATCGTTT GACGAATGCC AAACCGTGCT TTCTGTAATG GATGCTTGGA AGATTGATCG
GCAACATCTG CACGTTGTGT ATTCTATTAC GGCTGCTACA CGGGAAGGGC GGTGGGAGGA
CGCTTCCAAC TTATTCTGGA GTCACATAGA CCCAGAGGAC TCGGGCTACC GACCATATGA
TGAAGACGTT GCCAATCCCG TTGGTCTTTA CGCAGTTGCA CGGTTTGCTC AAGAACGCAA
CATGCCCGTT ATAGAACCTG TCATGGATAG TGTGTTGCGT ATGACTATGG TATCGCCGTT
TGATCAAGAC AAATGTATGT TGTAGAATGA GCCTTTGCTT GAGCGACAGT TTGATGATTC
ATCTTAACGC ATGTTTCTTT CTCGTAGATT TGCTTGCCGC GGGAACTGCA ATTGGCCATG
TTGGTGAATG GGAGGCATTT ATCAAATACT TAAAAAATAG CTTCGATGCA AGTCGGCTTG
GACAGGTATG TACGGTTGCC CCTGCCTGTG TTGACCTAGA AATAAAGCAG ACTCTAACTG
AACAAGGTGT TTTGAAATGT CAGCCCCTCG TGGCTGCTGT CATGAAGGCT TGCATTGCAA
ACGACTCCTC ACAGCAGGCA ATGGAAATAT TCGATGAGTT CGTGGTCTCT AAACTGAGCA
TCGCTGGAGA GTGGCAGTGG GCTGGAGGCG AGAATCCCCT TCATCCATTG TGTCGAGATC
TTGCCCTTCA AGCCTTAGGC CAATCTTTTG AAGAAGGATC GAGCAGGCGC GCATTGGATT
ACTACAACCA AATAGCTCAG GAAAAGTACA CAATCAGTAT TGATGCACTT GTTGGAGTTT
TTGAGGCATG TGTCAATGAT GGACAGTGGC CTGACGCTGT GGACACACTC CTCTCAATGA
TTGCAAAATC ACCATCGGAT CGATGGGTTG TCACGACGGA ATCGCTTGCT ATTGAGGTAA
TGGAGGATTC TGCAGCGAAG TCGATGGACC TCCTTACTCG TCTTGGTTCT TGCGTGGAAG
TCACAATGCG TGGCTGTAAC TCAGACGGTC AGTTTGGGGT TGCCATGGTG TGCTTGCGAT
TGTTCCAAAT ATGGATGCCT GCGGATTATT TACTGCCGTA TCAGGAAGCA ATGTCATCTA
AAGGACATGA GCATTCCGAG AACTCGCTTA TTGAGGCGAT AGCTCCTATA ATCTGTGCCT
TAAATGATGC TGATGGGGTT CTCTCTGCAT CCATGGTCGC CCTTTGCGGT CTTGGTTTGA
ACCAAGAGGC ATCAAATTTA TTTAAGTTGG TAGAGGGCAT CTTGGTGTAT GTTGATGGAA
AGATAGAAGT AGATCGACTG GAGCAGGCTC GAGACCTTGT ATCTTATGCA AACGAAGGTG
CAGAAAGATT CAAACCGACA AAGCTGCACG GGGGTTGGGA GGTGTCGTAC AGGCACATTC
ATCGCCTGAC ATCTGCATGT TCAATATTAG TCCACAGTAA ATCCTGTCCC TCACAAGAAG
AAATGCGACT GCTTTCTACC GCTGCAGCTG TCGCTGTTCG ATCATGTACC GTTGAGAAGC
AAGCCGATGC CGGTCTTGTT CTTCTAAAAT GGATTGAAGA AACCTTAGCT TCACATGCGG
CGAGTCTTCG TTCACATGAG TCTGCCCTCG CTTTGCCTCC AACTGACGCT CTACTATCGG
CGCGAATGGG AGCGTACTTA TCAATAGAAG ATGCCGACGC GGCTTTGGAA TTGTTTCAAA
ATGAAATCAA AAACAGCCAG GGGATGACAA AGTGGAGACT TAGCACCTGC GAAGCTATCA
CAGCATTTTT TCGGGTAGGG CATGCTTCTG ACGCGATGGA CTTTTTTGGG AAGGTGGTCG
CCGAAAATAG AAGTCCTGAT ATCTTTTGTA GGGTTGCCGA AGGACTTGTG GACGAGAAGG
ACTGGGATGG TGTTGCCAAG GTTTATAAAT CGGCCCTGTC ATGTGGATGC CTTTCAGAGG
AACTTTCGAT TCTTGCAATG AAAGCAGCCG CTGCTGGACG GATAGACGGT AGAGTCCGTG
TTTTCCGCAA CATTTTAGGC GAAACTGCTA AGTTTGTGGG GACACAACCT TTGGTGTGGG
CAGTAGCAAA GTACTGGATA CTCAAGCGGG CCATAGGGTT TCCAAATATT TGCATTGTAA
TGTGGTGGAA CACTAGCAAT CCACATCTCA ACGAGGTAGA GCTTGC
 
Protein sequence
MQTAMPRTRA WGNLVASRSG QRCPASHHPI RRWLTRNKPN RSPFVTTASL VPTNCMHVHT 
RNVQYFSTGK SSDPTKNWRF DQRNMRNRRV SKKDSEQFAK ANQKAHQKSN KPPNVFIREL
MRDINNLQKR LDNLDSLPAI WDRSTRSKLY DTNSQEQRLA VLNEGRQLLE TIRLAVEKGT
LKPLEDHGHE TSVLMERILK LYSETPSTTD THSSFDECQT VLSVMDAWKI DRQHLHVVYS
ITAATREGRW EDASNLFWSH IDPEDSGYRP YDEDVANPVG LYAVARFAQE RNMPVIEPVM
DSVLRMTMVS PFDQDKYLLA AGTAIGHVGE WEAFIKYLKN SFDASRLGQP LVAAVMKACI
ANDSSQQAME IFDEFVVSKL SIAGEWQWAG GENPLHPLCR DLALQALGQS FEEGSSRRAL
DYYNQIAQEK YTISIDALVG VFEACVNDGQ WPDAVDTLLS MIAKSPSDRW VVTTESLAIE
VMEDSAAKSM DLLTRLGSCV EVTMRGCNSD GQFGVAMVCL RLFQIWMPAD YLLPYQEAMS
SKGHEHSENS LIEAIAPIIC ALNDADGVLS ASMVALCGLG LNQEASNLFK LVEGILVYVD
GKIEVDRLEQ ARDLVSYANE GAERFKPTKL HGGWEVSYRH IHRLTSACSI LVHSKSCPSQ
EEMRLLSTAA AVAVRSCTVE KQADAGLVLL KWIEETLASH AASLRSHESA LALPPTDALL
SARMGAYLSI EDADAALELF QNEIKNSQGM TKWRLSTCEA ITAFFRVGHA SDAMDFFGKV
VAENRSPDIF CRVAEGLVDE KDWDGVAKVY KSALSCGCLS EELSILAMKA AAAGRIDGRV
RVFRNILGET AKFVGTQPLV WAVAKYWILK RAIGFPNICI VMWWNTSNPH LNEVEL