Gene PHATRDRAFT_38585 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_38585 
Symbol 
ID7203321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011684 
Strand
Start bp452553 
End bp453935 
Gene Length1383 bp 
Protein Length460 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182537 
Protein GI219124494 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAGCAT CGAATGCGGC TTTGTGTGTC CTTCTCATCT TGACAGGATG CCAACTTGAA 
GGAAGCAGGA AATACACCAG CAAATTCAAC CCGATAGTAT GCGATGATGG TTGCGTTTCA
AAGTCTAGGA TTACACCACT CAACGGTCAG AAAGAATTCC ATAGCCATCA ACGCCCCAGC
TACTATCCTG ATTCGTCCGC TTTGCAAAGT CGACTTTCGT GGTCGTCGGT GCCAAGAGCG
GATAAGATGC ACAACGAAAC ATTTGTACAT CGACAGAAAC GAACTCCTCT GAAAAGCAAC
CAGCCTATTG TCTTTTTGGA CGTAGGTCCG CAGAAGACAT CCACTTCGGG TATTCAGAGC
TTTCTGGCGG ACAACGAAGA CGGATTGAAG CTGCATGATA ATATCACTCT ACCTGCGCCT
TTCCCTATAA GAATTGGTAA CTCCACAAAA ATGCTGAGGT TTATAAACTC GAGACATTTA
ATCTTTTGCT TCTCCAGTAA AGAGACTAGG CCCAAGCAAT GGAGACAATA CCTGAGCTTA
TCCCCCCCTC TCTTTCATTG TGAGGAGGTT TTGAGCGCAT TTCTCGACTT TGTCCGCACC
GCACGAGCAA GTTCCAAAAA CATTCTCATG TCGGTCGAGT GCTTGTCATT TTTAAACGCT
GCGCAGATCC AGCACTTTGT CAAAACCTGC TTTGTGGGAT GGGAAATTCG AGTCATAGTC
GTTTACCGCC GTTTTGATGA GTGGCTTCCA AGTTTTCACT TTCAAGATAC ACGAAGTGAT
CGGGCTCATT TACGCTCCAC GCTGGTGGAA TATCTGGACA GCCCGGAGAG TCTACATGCA
GCCGAGTTTG CGTACAGCCA TGCAGTGGCC CAGCGATACC GGAGTATCGA TCCTAATCTC
ACGATGTTGA ATTTTCACCA CATAGACAGT AATCGCAGTC TGATAGAGGA ATTCCTCTGT
CGTGGGTTGC AGGGTCTGGC ACCGCATACG TGCCGTATCG CGGAGAAATG TGTCGCTCCC
AAGGAGAATT CTGGATATTC GTTGGACGCC GGATTCGTAT TGGCCGTAGC CTTGAAAAAG
AATTTGCTGT CACGAAACGA CAGAGTCACC AACGGTACCA TCTCACTCCA GTACGAAAAG
CTGTTGTTCG AAAGTATCGA CCAAAAGCTG CAAGAAAGTA CGAATCTTCC GTTTTTTTGT
GCGACAGAGC CTGTCCAGCA GTACATCAGG AACCGGACCA TGGAATGGTT TCCATTCGAT
CTAGATTTTT TAGCCGCGAA ACAAAGCACA GGATACAACA AAAATCGCCC GCCCTTTTGC
TCCTTGGATG CATCAGCACT TTGTGAGCGT GACGATTGGC AATTGTTTCT TCGTGGACTT
TAA
 
Protein sequence
MRASNAALCV LLILTGCQLE GSRKYTSKFN PIVCDDGCVS KSRITPLNGQ KEFHSHQRPS 
YYPDSSALQS RLSWSSVPRA DKMHNETFVH RQKRTPLKSN QPIVFLDVGP QKTSTSGIQS
FLADNEDGLK LHDNITLPAP FPIRIGNSTK MLRFINSRHL IFCFSSKETR PKQWRQYLSL
SPPLFHCEEV LSAFLDFVRT ARASSKNILM SVECLSFLNA AQIQHFVKTC FVGWEIRVIV
VYRRFDEWLP SFHFQDTRSD RAHLRSTLVE YLDSPESLHA AEFAYSHAVA QRYRSIDPNL
TMLNFHHIDS NRSLIEEFLC RGLQGLAPHT CRIAEKCVAP KENSGYSLDA GFVLAVALKK
NLLSRNDRVT NGTISLQYEK LLFESIDQKL QESTNLPFFC ATEPVQQYIR NRTMEWFPFD
LDFLAAKQST GYNKNRPPFC SLDASALCER DDWQLFLRGL