Gene PHATRDRAFT_50367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50367 
Symbol 
ID7199189 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011697 
Strand
Start bp95115 
End bp96449 
Gene Length1335 bp 
Protein Length230 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185327 
Protein GI219130344 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTTCATCATT AAGTTGTTCT ATTTTATAAG TAAAATTCAA TGTAAAGCAT GTAAACAAAG 
GCATCTACCG TAGAGGACCT CCTTCTGGTT TTCTATTTTA CTCTGGTTTT CCATGCAAGC
ATGAGGAGTG TTGCGATTGC GGGGACGCCA AAGCCTGGAA CACTGCTGGT TGCAAACAAC
ACCGTCCTCC ACCAGCGCTA CTTTCCGAAA ACGCCACGCA TTACCCAGGC AATCCTTTTG
AAGCCATCCA CGCCAGCCGC CGCAGATGCA CACTGCCACC TTGGCCGTTG TGGTCGGAGC
CGCCGTACAC AGCCTTTTCA ATACCGTCAA GTGCGCCGGG ATTGGCACCA ATCCTGTACA
GTGGAGACAA CAATGGGCCG TTGAGGTGGT CAAAATTGCC AATGGCATCG TGCATCCCAA
AGATTACGCC CGCATCCAAC CCAAGGAGCG TCGCGGCAAT GACGACAACA GCAACCCCAC
AATATTGACC CGCCCGTTTC TCTGTCCACC TTCTGGGTGG AGGTCACCAC CACCCCTGCC
TTCCCAACCG GATTTCATTT GCAACTACAA CTCCACAATG ATAACGTATC CTCAAGAAGC
ACAGGCGTCG ACCCAACACC GCCACATGCC AAACAACCCC TCCTGCAGCG GTTGGGTGCC
ATGGCAAGCA TGACTACAGA TGGTGCGGAC AACACAATGA ATTGTGCGGA CTGGTGGAAA
AAGCGTGCAG AACACGTTGA CGGCATTGGT GTACGCGTTT GATACCGCTT CGCAGAAACA
GCAAGAGGCG GACCGGATCA CGTCAAGTCT CTTTCTGCTT TGCTTGCCTG GCTTGCAACT
GGTCAGCGGC GTCGGCACGA TTTGTCTGGA TCAACTCGAG GCACGTCAGT ATGATTTTGA
ACATACGGCT TCCGGGATGG ATGATGCGGA AGAAGATTTC GTCCATCAGC CGTCGAACGA
TAGCTTGTTG CCGGCCTTGA CGTACGTGGC GGGAGACCAT CCTCTCATGA CATCCTTACT
AGCGGCTCCG GACGACTGCT TTTGGATTCC CGGTCGACCC GTCACAAGCG GCTGCCGCAT
ATGCTGTGCA ATTTAGATTA CGTTATTGAG ACACTCGGTA CTGCGATTCG ACTTCTCTTA
CCACGACGCT TTCTGGTATA TCACGGGCCA CGTATGGAGC CCACACACAT TGTGATTCAC
TTGTGCCAAG TACTGACTGC ATCAAACCAA CAGCTACTGA TGCCAATGCA TGAGGAATAT
ACTCTACACA ATCATAGACA AATATAAAGG AAAAAATGCT AAATGGTTGT GTTACTTAAG
AGGTACTTCG CGCTC
 
Protein sequence
MHTATLAVVV GAAVHSLFNT VKCAGIGTNP VQWRQQWAVE VVKIANGIVH PKDYARIQPK 
ERRGNDDNSN PTILTRPFLC PPSGWRSPPP LPSQPDFICN YNSTMITVQN TLTALVYAFD
TASQKQQEAD RITSSLFLLC LPGLQLVSGV GTICLDQLEA RQYDFEHTAS GMDDAEEDFV
HQPSNDSLLP ALTYVAGDHP LMTSLLAAPD DCFWIPGRPV TSGCRICCAI