Gene PHATRDRAFT_50522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50522 
Symbol 
ID7199296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011698 
Strand
Start bp303560 
End bp305344 
Gene Length1785 bp 
Protein Length525 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185466 
Protein GI219130634 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTAGATAGTC TTTTCGGTGA CGAAAGTGTG GAAGGGCCGG CCCCCTTCTC GACGCGATCA 
TCCAACATCT TATTTCCTTC TTTTTATGTT TTCATCGCAG TGCAGCTTAT CCTCGTCTGC
CTTGTCGTAC ACACACACTG CAATTGAATG AAACCAGAAG CGAATGCCGC TCCGCGTTGG
CGGAACGCTC CCGCGACAGA CCATTCTTTT TTCAACAATC TTTCCCATCC GCCGTCCACT
TCCTTGGATT CAAACAGCAA TTGGACTCCC ATGCACACCG CCGACTTTGT TCCAAATGCT
TCGCTAGCGT CTACCGAAAC TGCGAACCGA TCGTTTGTCT CGACACCAAC TCGACCGATT
TCTCCGCGCA GCAGTAGGAC GGTCGACCAC GGCGACGGCA CGGAACTACG GCCTGCAACA
ACAACAACAA CTTCAACCCC ACCCAATCAG CCACAAACAC AAATGTACTT CCCTATGCTA
CCGCGTAACG TCGAAAGCTC CCAAATTGAC CCCGCAAGGA TTACACAGTA CCCGTTAGGC
CAAAAGCAGG TCTCCACCAA ACAAAGAAAT CATCAGCAAC TGGAATCGGA ACAGTTTGAT
TCCTTGCAAA ACAAGTGGGC GGTGAAAGAC GCAGCGAACG AGAAGAGCGG TATCTACGTC
TCGGGCTCCG ATACTTCCTT GTTGCATTCA AGCTTTCCGT CTCGCACCAC AAATACTATT
CGACCGTCTC CACTCGTCAC CGAAATCATG AGGCTTCCAG CGAAGCGAAA CTTTGATACG
ACAAAAGATC ACAAGACGAA CGCGCTGTCG AGAAAAATTG AATTTTCCGG AGCTCCGGTG
GCTGTGACGA CTAGTGGCTC TGCTGGGAAT GCGCCTTTTG CGACAGGGCA TTCGCCCATA
CTGCAACCCG GTGGGCTTCA ATCCTTGCAG CCTCCACCTC GATTCCGATC CATATCGGAT
ACTGTCGCAA TAGGTTCAGG GGTTTCCCCA TCAACAGGCA GTGGCTCTGC ACAAGCCGAT
GCACAGCTGC GCATCTCTAA TCCCTACGCT ACGACAGAAT TTGGATCTCG ATCCAATTCG
CCGCTAGCTA CAGCGTCAGA CACCATGGAT TATAATTCCC TGCTTCACAA TTCCTGCAAG
CTGTACCCTA CAACAATCAC GATAGTGGAA AGCGCTTTGC GCTTCGATCC CGAAGGTATT
CGCAGAAAGG TTTCGATTGT ATGTGAGAGA AACATGGGTG GTCAAACGAG CAAACTGCAG
GCGGTGGAGC GATACGTTTA TCCGATAAAC ATTGCGCTCA GATTCAATGC CGCCTTGGAC
GTACTGCAAC TACTTGCATC CAAGGGACCA GAAGTATTGA TGGAATCAGA TGGTTTGGAT
CATATGAGTT CGCTGGGTAT CGCGCTAGCT CTTGGGCACC AAACGAAGGT TATCTACTTG
TTACTCTCGA CGAATCCCCG CAGTGCACGA ACCAGAGATC GTTACTCCAA TTTGCCCCTC
CATGTTGCTG TGCGACAACC TAGCATTACT CTCGAGATTG TCGAAATGGT ACACATGGCC
TTTCCCGAAG CAATCAAAGC CCGCAACTTT CACGGCCAAA CTCCGTTGGA CGTAGCTGTC
CGAACGGTAG CTTGCCCAGA CGCCGTGCTA AATTATTTGC AAATATCTGC TTTCGGGCAG
CTCGAAGCGG CTGCCGACCA TTTTGACGAC TCCGATTTCA TCTAGCTAAA AAAGGACTTC
AGAGAGCGAA AAGTTTCTTA TGCTAGAACT TGAGTACGTT TTGCT
 
Protein sequence
MKPEANAAPR WRNAPATDHS FFNNLSHPPS TSLDSNSNWT PMHTADFVPN ASLASTETAN 
RSFVSTPTRP ISPRSSRTVD HGDGTELRPA TTTTTSTPPN QPQTQMYFPM LPRNVESSQI
DPARITQYPL GQKQVSTKQR NHQQLESEQF DSLQNKWAVK DAANEKSGIY VSGSDTSLLH
SSFPSRTTNT IRPSPLVTEI MRLPAKRNFD TTKDHKTNAL SRKIEFSGAP VAVTTSGSAG
NAPFATGHSP ILQPGGLQSL QPPPRFRSIS DTVAIGSGVS PSTGSGSAQA DAQLRISNPY
ATTEFGSRSN SPLATASDTM DYNSLLHNSC KLYPTTITIV ESALRFDPEG IRRKVSIVCE
RNMGGQTSKL QAVERYVYPI NIALRFNAAL DVLQLLASKG PEVLMESDGL DHMSSLGIAL
ALGHQTKVIY LLLSTNPRSA RTRDRYSNLP LHVAVRQPSI TLEIVEMVHM AFPEAIKARN
FHGQTPLDVA VRTVACPDAV LNYLQISAFG QLEAAADHFD DSDFI