Gene PHATRDRAFT_14391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_14391 
Symbol 
ID7203138 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp542454 
End bp544136 
Gene Length1683 bp 
Protein Length525 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182249 
Protein GI219123890 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0183197 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACCA ATGTAGACGA AAACAGGGGA GACGCTCGTG GAGCTGCACT CTTGTTGGAG 
GGAGTCACAG TGTATCGCGG TCCAGCCGAG ATTCTTAGAA ACATCGACTG GCGCGTGGAA
CCACGAACTA AGTGGGCTCT AGTTGGTGCA AATGGAGCCG GAAAATCAAC ACTTTTAAAA
GCCCTTGTTG GGGAAGTGGA TTCGCGTGGA AAAATTGTGA TCGGAAACAA GGAACAAGTG
GGGTACTTAC AGCAGACAGC TGTTGCTGGA AGCAATGGCA CCGTCTTTGA AGAGGCCTCA
TCTGGAATGC GCGAACTGAA TACGGCTAAA CAAGCAATGG AAAAATCTCA AGAAGTGGGT
GATTTACAAG CCTTAGAGAG GGCAACGACA AGATTTGAAC TCATCGATGG CTACAAACAA
GAGCAGAAAG TTGCCAGTGT TTTGAAAGGT CTGGGGTTTA CAAACTTTGA AATGCGTTGC
CACGAGCTGT CCGGTGGATG GCAGATGAGA GTAGCTTTCG CACGATTGCT TCTCAGTGAG
CCAACTCTTT GCCTGATGGA CGAACCCTCC AATCATTTAG ATGCGGCTGC CAAGAAATGG
CTTGCAAAGT ATCTTGCTAC GTACGATGGA GATGGAGCCA TGATTCTAGT CACCCATGAT
GTGGACCTAC TTAAATCTAT GGATCATATT GCTGAGGTTG TACCTGGAGC AGGAAGCTTA
CAGATTTACA AGTCGTGCAA TTACAACCAG TACTTGGATT TGAAAGAGCA ACGGGCAGCT
GCCGCAATTT CTCAGTATGA ACGAAGTACG GAAAAAGCTG CCAAGCTACA AGCTTTTGTG
GACCGCTTCG GTGCTTCGGC AACGAAAGCT TCAGCCGCAC AATCCCGTGT CAAGATGCTT
GAGAAAATGA AGCGAGACGG ATTGCTGAAT GCACCAGCGG ACGATATCAT TGCACAACGC
TTCAAGCCTT CGTTAATACT CCCGGATCCT CCCCGAGCCA TTGGTGAAAA GTTGATCTCT
CTGCAAAAAG CTGGTGTGGG CTATGATGGA GAGGTGCTTG TATCAGATAT CAACATTGAT
ATAATGAAAG GTATGAAACT TTTAATTCGT GGGCCGAACG GAGCTGGAAA GTCGACGGTG
ATGCATTCCC TTCGTGGCTC AATTTCATTG ATAGATGGTG ACAGAAGTAC AAACCCCGAC
TTGCGGCTCG GGGTGTTCAC CCAGGATTTA GCTCAAGAGC TTGACCCCAG TGCCCGGGCT
GTCGACTTAG TCACAGCGTA TGCTCGTACA GGGCTGGATG GAGATATTAC TGTCTCGGAA
CAAGAGGCAC GGGCGGCGAT GGGTAGACTG GGTCTACAGG GCGAAAAGGC TTTACGTCAC
ATTTGCGATT TGAGCGGTGG AGAAAAGGCA CGTGTAGCTT TGGCGATGTT CGCTTTGAAG
GCTAGCAATG TTTACTTACT GGACGAGGCG TCTAACCATC TCGACTCAGA ATGGTACGTT
ATAGAAGCTT CTTTCAGGAG GTTCTGTTGT AATTTTATGC ATCTATCTTT TATTGTGTTT
TACGCATCTG TCTTACTCAA CAATGCACTG CTTTTACATA GCGTTGAAGC CCTTGGTGAA
GGGCTCGGAT CCTGGGGCCA CGACACTGGC GCAATGGTCG TAATTTCTCA TGACAAGTCG
TTT
 
Protein sequence
MTTNVDENRG DARGAALLLE GVTVYRGPAE ILRNIDWRVE PRTKWALVGA NGAGKSTLLK 
ALVGEVDSRG KIVIGNKEQV GYLQQTAVAG SNGTVFEEAS SGMRELNTAK QAMEKSQEVG
DLQALERATT RFELIDGYKQ EQKVASVLKG LGFTNFEMRC HELSGGWQMR VAFARLLLSE
PTLCLMDEPS NHLDAAAKKW LAKYLATYDG DGAMILVTHD VDLLKSMDHI AEVVPGAGSL
QIYKSCNYNQ YLDLKEQRAA AAISQYERST EKAAKLQAFV DRFGASATKA SAAQSRVKML
EKMKRDGLLN APADDIIAQR FKPSLILPDP PRAIGEKLIS LQKAGVGYDG EVLVSDINID
IMKGMKLLIR GPNGAGKSTV MHSLRGSISL IDGDRSTNPD LRLGVFTQDL AQELDPSARA
VDLVTAYART GLDGDITVSE QEARAAMGRL GLQGEKALRH ICDLSGGEKA RVALAMFALK
ASNVYLLDEA SNHLDSECVE ALGEGLGSWG HDTGAMVVIS HDKSF