Gene PHATRDRAFT_45357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45357 
Symbol 
ID7200041 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp937529 
End bp939346 
Gene Length1818 bp 
Protein Length509 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179541 
Protein GI219117493 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0976017 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCGGACGCTA CCAACACACC GCCGCACGAC AAGACTCGAC CAAGAGAAAC GAGTCTGTAG 
GAGAGGATTA CACTCCTACG GTACCGCTGA CGGCAACTGC AATTGACCGC AGTACAAGCA
TACATGCATA CATGCACAAA CACAGACACA TAGCACATAG CATTGACAGT ATGAGACTAT
TGCACCGACA ACGGAGTGTT TCTGGAAGCC TCCTTTGCGT TCTGCTCGTT TTTCTTTCGG
CAGTTTCCAA ATGTCAATGT TTCTCCACGC ATAGTTCGAG TAGACGAGAC TCTTTGCAAT
TGCCTTTGCA ATCAACAATG CCACTGAAAG TCGCTGCCGC TCCCATATTG GAAGCGTCTC
CGCTAGTAAT AGAGCCAACG AATCCCAAGG TGGGGGTACT ACTACTGAAT CTCGGTGGAC
CCGAGACCGG AGATGATGTC GAAGGTGCGT TTGCTGTCGG TCCATCACCA GTAGAAAGGC
GAGAGATGAC GGATTACCCT CTGCTATCAT CGTCATCATC ATCATCATCA TGATAGTCAT
TAACATCATG GGATCTCCAT GGACCAAACG TTGACTGTAT CTCACTTTGT TTGTTCCATA
GGTTTTCTGT ACAACCTCTT TGCCGACCCT GATATTATTC GACTGCCGGG GCCGTTGGCA
CCGTTGCAAA ACTTGATTGC CTTGTTCATT TCCAAACGAA GAGCTCCCAA AAGTAGAGCC
GCCTACGAAT CCATCGGAGG AGGCTCCCCG ATTCTCAAAT ACAGTAACGC CCAAGCCGAT
TTACTGTGCC AGAGTTTGCA GCGTCGGTAC GGTATGGACG TCAAAGCCTA CATAGGCATG
CGCTATTGGC ATCCTTTTAC GGAAGAAGCA CTGGACCAAA TTCAAAACGA TAGAATCGAA
GCTTTAGTGA TACTGCCTCT CTACCCACAA TTTAGCATTT CCACCTCCGG AAGCTCCTTG
CGTGTGCTAC AGGAAGAATT TTCGAAGCAT TCCGAGAAGT ACAGCAAAAT GATGCACACC
GTGGTCCCTT CTTGGTACGA CCGTCCCGGT TACGTCAAAG CCATGGCAGA TCTGTTGAAA
AAGGAACTCG ACAGCTTTAC TGATGCACAA ATCGCTGAAG CGAAGCAAAC CTCACCCGAT
CAAAAGCCCC TACACGTGCT TTTCTCGGCC CACGGCGTAC CGCAAAGTTA CATTGAAGCC
GGCGATCCGT ACCAACGTCA AATCCAGGAA TGTGTCGCTA AGATTAGTGC TGAACTACCC
TACGAAAATG TACAAGTCCA CCTTTCGTAC CAATCGCGCG TTGGTCCGAT CGAATGGTTG
AGACCGTACA CGGACGATGT CCTGCCCGAA CTGGGCGCCT CGGGTGTCCG TAATCTTGTG
GTTGTTCCAA TTAGTTTTGT ATCCGAACAC ATCGAAACCC TGGAAGAAAT TGATATCGAA
TATCGCGAGT TGGCAGAAGA ATCAGGCATA TCCAATTGGC GACGGTGCCC CGCACTCAAT
ACGGATGCCA CCTTTATTGA TGATATGGCT GATTTGGTCG TGGACGCTTT AGCGGAACCG
GCTCAGTCCG TAACTGAAGC CTGTGTCGCG AACATGGTGG GCGACGTGGA ATTGCAGCCT
TTGGACGAGC GGTTGGGTAT CAATACCGGT GGTGTTCAAG GGGTTGGTGC CGAAGGACTC
TTGTACCAAG CCAAATTGCA AGAGAAGGAA CGAGTGAATG CTCGGGTAGC CATGATGGGA
GTGCTGATTA CGCTGATGGT AGAATTGGCG ACGGGCAAGC CATTCACCCA AATGCTGTCG
AGTCTATCGG GAAAGTAA
 
Protein sequence
MHKHRHIAHS IDSMRLLHRQ RSVSGSLLCV LLVFLSAVSK CQCFSTHSSS RRDSLQLPLQ 
STMPLKVAAA PILEASPLVI EPTNPKVGVL LLNLGGPETG DDVEGFLYNL FADPDIIRLP
GPLAPLQNLI ALFISKRRAP KSRAAYESIG GGSPILKYSN AQADLLCQSL QRRYGMDVKA
YIGMRYWHPF TEEALDQIQN DRIEALVILP LYPQFSISTS GSSLRVLQEE FSKHSEKYSK
MMHTVVPSWY DRPGYVKAMA DLLKKELDSF TDAQIAEAKQ TSPDQKPLHV LFSAHGVPQS
YIEAGDPYQR QIQECVAKIS AELPYENVQV HLSYQSRVGP IEWLRPYTDD VLPELGASGV
RNLVVVPISF VSEHIETLEE IDIEYRELAE ESGISNWRRC PALNTDATFI DDMADLVVDA
LAEPAQSVTE ACVANMVGDV ELQPLDERLG INTGGVQGVG AEGLLYQAKL QEKERVNARV
AMMGVLITLM VELATGKPFT QMLSSLSGK