Gene PHATRDRAFT_35852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_35852 
Symbol 
ID7201057 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp897427 
End bp899026 
Gene Length1600 bp 
Protein Length422 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180342 
Protein GI219119151 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.318034 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGGGG AATGTAATAC ATCCCGAAAT CATAAGACCC AAATCCTTTC ATGGGACTCC 
GGAAACGTCG TTGAGAAAGC GTTGTTTTAC CCGTTCTTTT TTCGAATTTC TTCGAGGTCA
AGCAGGCAAA GAACAATTTC TGGCTGGTGT TGCTTACTTT TCTAATAATC TCACCGTAGT
GGATACGTAG GTCCATATTT ACAGTACATT CCATACAGGA CGTGATGATA CGCAACATAA
GATTAAATTA TACGGGCTGT GACATACTAC CTATGTCACA TCACCAGATC CACTACCACT
GGATTTGCTC ACTTTTGTAC GTGGAATGAA GAGTTTGGCT CAAGTGCTTT TGTGCACAAT
TTGCATTCTA CCAGGGTGTC CAAGCTTGCA GAATTTGAAA CGGAAGCTGC TGATTCAGTG
AATAAAGCTA TGCTAGAAGC ATTGGAGAAG ATACAAGTCA ATATTGCAAA AAGGGCTGGT
AAGATAAATT GGTCTAAAAT TGAAGTAAAA AAAAAAGACG ACTCCCAACC AGTCGTATTG
CTCCCCATGC CAATTGTTTG GCCCAAAACA TCACCAGCCT TGAACACAAA CCTGACTTTC
GACACGCCGT GGCTCTCCGC CGCTCAAGTG GCCACCAAGA AAAGTGACCC CGTCGTGAAA
AGGAATAAGG ACAGCGAACA GATTTCCGCT AGCACACTTG TCGCCTCGCG ACCTAGCGTG
GTCGATCCAC GGTTGGCTAC GTTCTTGGAA CGCTTTGACA ACGCCGAATG GCGATTGCAA
GCCCTGCAGG CCAAAAACGC CGAACTCCAA GCCAAAGTGC AGGAAGCCGA ACGGCAAAAA
CGCGCCATGC AACAGTTGGC CAGGTAATGC AAGTGCCTCG ATAGAAATTC TCATAGATAG
AATCGGCGCG GCGATCTATA GTGTCGTAGC TATGCACACG GGGCCCGGAG TAACATAAGT
CAGAGTTGAA CGGTCTCGCT CGGTAGCTCA ACATTGCATC GTTCGGACTA CTCCGACGAT
CAGACGCCGT TTGTCCCAAA GTGTTGTGAA TGTGGCAGGA GTAGCGATCG TAGGGAAGCG
TCCCAAAAAT ATGGCGTGGG TGACTTTGCG AGCTCGTCGG CATGCAGGGA CGACGTTGCA
CCACCACTGC AACAGTAGCC GATTCCTCTG TACCGATCGT GTTGTAATCG AGGAAAGGAC
AATGACTCTG AATCCAAGGA ATCGTCCGAA CCGTCATAGG TTCCTTTGCG AATACATATC
TGATGTGGAT GCGTGCGGTC GTCGTCGCTG GAACCGGACA AATCCACGGT CCATACGGGC
CGTCGCGAAG TCAAGACGCA CTGTGTGTGC GACTGCTGTC GTTGCTCCTC TCCTTCGGAT
ACGTGAGCAG GATCTCGCAC ATTCTCTCCT TTCGAAAGGA TTCCTATCCC GGGCAGTCTC
CAGGCGCATA CCCCACGAGC GAGACACAAT TCGTCCAGCG ATGGTAAAGT TGGCAAGCCT
TCCGCACGGC ATGGGTACAA CATGGAGTCG AGGAAAGGTT GTTGAAGTTG TTCCAAAACA
CCGATCCTGT GGAGAGCTTG AACGGTGTTC ACGTATCTAG
 
Protein sequence
MMGECNTSRN HKTQILSWDS GNVVEKALFY PFFFRISSRS TTTGFAHFCT WNEEFGSSAF 
VHNLHSTRVS KLAEFETEAA DSVNKAMLEA LEKIQVNIAK RAGKINWSKI EVKKKDDSQP
VVLLPMPIVW PKTSPALNTN LTFDTPWLSA AQVATKKSDP VVKRNKDSEQ ISASTLVASR
PSVVDPRLAT FLERFDNAEW RLQALQAKNA ELQAKVQEAE RQKRAMQQLA RRRLSQSVVN
VAGVAIVGKR PKNMAWVTLR ARRHAGTTLH HHCNSSRFLC TDRVVIEERT MTLNPRNRPN
RHRFLCEYIS DVDACGRRRW NRTNPRSIRA VAKSRRTVCA TAVVAPLLRI REQDLAHSLL
SKGFLSRAVS RRIPHERDTI RPAMVKLASL PHGMGTTWSR GKVVEVVPKH RSCGELERCS
RI