Gene PHATRDRAFT_47894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47894 
Symbol 
ID7203161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp373706 
End bp374967 
Gene Length1262 bp 
Protein Length254 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182381 
Protein GI219124166 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTGGA CAAAGACACT GACATCTTGC TTCACCTATA GGATAGGCAA ATCTTTGTAC 
GTTCCGCTCA CCAGTCGATG CAATAGTCGG ACTCTACCGC TTACCCGGGG TCCCAACTTC
TTGCTCCCAC CGGAAGTCGT CGTCGCCCTA TGTCGCGTGC GGGATGCTGA AGGAAACGCA
TCTCAATGGA AACACTGGTG CATATGGTTA GAAACACAAG AACGTAAACA AAAGCTACCG
GAGCCATCTG ACGATGCTTT TGTCGTGGAG CTTCCTTCAG ACTTTTTATC GGGACGCCCA
ACAGTAGAAG AACTTCTGAT GGAGATACAG GAAATTAACC TCTCCGAATT TGAATCCATA
GTGATTGCGG GAGAAGGAGA ACCCACACTG AGATTCAACA TATTATCTAA ATTTGTACAG
CACGTACAAG AACTGTGTGA TCTACCTGTT CGGCTTTCGA CCAACGGCTT GCTTTCGTCA
ACCAGAGCAA AGGACTTAGT GGAATGCGGT GTGGACTCGG TTAGCGTCGC ACTCATGACG
AGCGACGCAG ACCAGTACGA TAACCTGATG AATCCGCAGT TGCCTTCCGA ATGCTCGTCG
AGGGCACATC AAATGCTGTG TGATTTCGTG ATTGCCGCAC AAAAAGCTGG GCTTCAGGTT
GAACTTACGG CGATCGATCG GCCCGAAGTT GACCGTGAAC AAACACAGGC GCTGTCAACA
CGATTAGCTG GTGTGGATGT TCGATGGCGG TCATACTTCC CGTAGAACAC TTTTTATCGT
TATTTGTATT TTTAAAGGTA CGAATATGCT ATGTAGATTA TAGCAACAGT GACGATCATA
TAAAGTGCTT TTACCATCAT TCGAAAGCAA TGCTGATGCC GCTTCGTATA TTTCCACGCA
ACGAACACAC TTCCTCATCT CCATTCTCTG TGTCCTATGT TTTCCCGGCG GCGAATCATG
CCCTGATAAA TCAAAACGAC GTTGCCTTGC TGGAATGTGA CACAGATGTT ACGCTTCAGT
TGCATTGCAT TCTTTCTTCG CCACTGGCGC AATTTCTGTA TCTATTGTTG CCATTGAATT
TTGGTTCGCA CCCTGCACAA GCATTTTTTC CGTGGTAACC CAAGTTTCTT CTTTTATGAT
TTTGTGCTTT TTCTGCAACT GCCCTACGCG ACCTGGATTC GACAGATTCT CGTCCCTCAC
TGTCAGCTGT AGTCACCCCT GTCATGGGTA GCAGTGGTAA ATGTATGTCG TTATCGTGTG
GC
 
Protein sequence
MKWTKTLTSC FTYRIGKSLY VPLTSRCNSR TLPLTRGPNF LLPPEVVVAL CRVRDAEGNA 
SQWKHWCIWL ETQERKQKLP EPSDDAFVVE LPSDFLSGRP TVEELLMEIQ EINLSEFESI
VIAGEGEPTL RFNILSKFVQ HVQELCDLPV RLSTNGLLSS TRAKDLVECG VDSVSVALMT
SDADQYDNLM NPQLPSECSS RAHQMLCDFV IAAQKAGLQV ELTAIDRPEV DREQTQALST
RLAGVDVRWR SYFP