Gene PHATRDRAFT_50610 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50610 
Symbol 
ID7199446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011700 
Strand
Start bp77337 
End bp78523 
Gene Length1187 bp 
Protein Length317 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185562 
Protein GI219130839 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCGCT CCAGTTGCTC TTTCGCTGCA ATCATGCAAA TGAACAACGC TGCCTTGTCT 
CTGGTGCGGC AGGGGCAGCA CGCGCGAGCT AATAGACTCT TGCAAACCGC CATGGACTGT
CTCAAAACCG AGCTCGCTCA ACCTGCGGAA GATACTACTG TTGGAGGGAT GCAGCAGTCC
CCGGGAACAG CGTCCCTGCA GGTTGTAATG AGTATTCCGC TCCCACCGTA TCCTCCTCGT
GTCGGAGCAA CTCTACCGGG ATTCACGGCG ACGGAAACAA ACACTAACCT TCCCATACGC
TACGTCAATA ACAACGGCCA CAGTGATTCT CGGATGGAAA CTCCAGTAGC GAACGAGCAA
TACAACTGCG AGGACTTCAC CTCGGTCCCG CATGGATTTT TCAACAGAGC GTTTGCCGTC
GCACCTGAGG ATACTGCAAC CGGTGGCCGC CGGGACCCGT TTGCTGCCTC TGTCTCCGGG
ACCGTTGCCT CGGCAACTTC GGAGGAACTG GTGATTCTTA TCGTGATCTT GTTGTTCAAC
ACGGGGCTCA TCTATCATCG TGAAGGAATC GCGACGCACG CTGATCGACT CTCGACCACC
AATTACCAGG TGTCAAGGGA GAACAATCCC TTACTCAAGG CTTTGGTTCT GTACGACAAG
GCCATCCAAG TCGCCGAAAG CGTGGATTGG TCAAAAAGCA ACAGTTCTGA TACTTTCCAT
TTGTCCACGT TGCTTGCGGT GCCGTACATG GCGCTTTGTA TCAACAAAGC CCACATCAGC
GGGGAGGCTT TTCTGAACCG CGAACAACGA TCACACGCCG TACACAAGTT CTCCGAAACC
TTTCGAGCGC TTGGTCCAGC CAAAAATATT CTAACGCCCG AAGAATCCGC TTTCTTTTTC
ATTGAAAGCT GGTTTGTATC CGTGGGTTGG AATCATTTCG CTGGAGCAGC TTGACCTGGA
TTGCAAGGAA TTCTGGTGAA CAGAGAGTCG TTCGCAGGAA TGCTTACTTC GGCTTCAAAG
GATGAAGAGC AAGTGCGCAG GCCCTTTTTT CCTCATTGCT ACCGATTTTG AATTATGTTG
AAAGAATCGT CCACTTGTCC AAATCACGCG TGAGAAACCA TTTTTGCAAT GCGCCAAATT
CATATATATA CATACATCTT GACTAAAGTA AGTTGTACCT TCTGAGC
 
Protein sequence
MSRSSCSFAA IMQMNNAALS LVRQGQHARA NRLLQTAMDC LKTELAQPAE DTTVGGMQQS 
PGTASLQVVM SIPLPPYPPR VGATLPGFTA TETNTNLPIR YVNNNGHSDS RMETPVANEQ
YNCEDFTSVP HGFFNRAFAV APEDTATGGR RDPFAASVSG TVASATSEEL VILIVILLFN
TGLIYHREGI ATHADRLSTT NYQVSRENNP LLKALVLYDK AIQVAESVDW SKSNSSDTFH
LSTLLAVPYM ALCINKAHIS GEAFLNREQR SHAVHKFSET FRALGPAKNI LTPEESAFFF
IESWFVSVGW NHFAGAA