Gene PHATRDRAFT_37892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37892 
Symbol 
ID7202674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp336494 
End bp337795 
Gene Length1302 bp 
Protein Length433 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182050 
Protein GI219123476 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCTTA TGCCGGCCAA AGTTGAGAAT GATTTCCGCC TTAGTCCAAA AGTACACAAA 
CACTTTCGGC CTACGATGCC AATGTCGCCA CTGAGCAAGA CCTCCCCACC TCGTCTGGGT
TACAAATTAG GCACTCGCTT GTACAGAGCT CCGCAGGCCG CGATTCGGAA CCATTCTGTG
GTTAGTTCCA GGGATGACGT GATGAAGGGC GCACGTCGGA GACGGCCCAG TCGTGGCATT
AGCCATGGCA GCCAACGTCA GCAGCAGCAA GAAGGCATGG GATTTGCGTT AGAGCGACAA
TCGTCGGATA GGTCGTTGGG GTTGGAAGAA ATACTAGAAC CGCCCTTGTG GCATGAGCAA
AAGGATCCAT ATCGAAGACT CAACGGAAGT GGAAGAGAGC TAGGTGGTAG TGTGCCAGAA
ATAAATTGGA AACGGACACC ACCGGGTCGC AAGCTTAGAG CCTTGCACAT ACTTCAAGGC
AATGAAGATT ATTCAGAGTA TTTGGAGGAT GAGTCTTTCC GAACTTACAA CTCCTTTGAA
TGCCGGAGTT CAAAGGAAAA GGTTGCCAGT CCCGTAAAAT CACTATGGAC TATTACAAAT
TTATCCCAGC TCCTCCTTAT TGTTATGCTG GGTGGCTTTA TTTTTGACTC GCGCCGCAAA
GGAAAAACTC ACAAGGCTCA GCTGCAGCAG TATGACGAAG AACGAAGCCA TTTGCTTGAT
CAGATGATGT GGATTGACAA GGCTGCCAAA AAAGTCCATC AACGGTACCC CGTTCAATCT
CCAATAGATT TGGATCAAGA GACTAAAGAG CAGCTCAAGC AGGAAGTTAG GGATGCACAA
GATTCTTTGC AAAAGCTTCA GCTACGGGTC CAGCTGAATG ATAGACAGTT TTTACACGAA
AAATTTGGAG ACAAGCCGTT GCAAGTGGGC TTGAGCTTGG ATGCGACGGG GACGGAACGT
ATCTCCATTG CGTTGTCTGA TGACACTCCC CACGCTGTCT CAATATTTGT ACAGCAGGCT
GACAAGAATT TGTGGAGTGA CCTTCGTTTC GAACGACTTC TTTCAGGATC TATAGATGTA
TTCTCTACCC AAGCAACCAC AACTCCATTG CTAGAATTTC TCGAGCGCTC GCGTGGTTGT
CATGAACGCG GCGCTGTTGC CTTGAGACAA GAGGAAGACC GCGACATCAT GTTTTTGGTT
CTTCGGATAA ATCTCGAAGA TCAGTCTCCG CTCTCGAACA CGGACGTGTG TATTGGACGC
GTAGTTAAAG GCCTAGATTT GCTGACAAGT CGCGTTTCCT GA
 
Protein sequence
MQLMPAKVEN DFRLSPKVHK HFRPTMPMSP LSKTSPPRLG YKLGTRLYRA PQAAIRNHSV 
VSSRDDVMKG ARRRRPSRGI SHGSQRQQQQ EGMGFALERQ SSDRSLGLEE ILEPPLWHEQ
KDPYRRLNGS GRELGGSVPE INWKRTPPGR KLRALHILQG NEDYSEYLED ESFRTYNSFE
CRSSKEKVAS PVKSLWTITN LSQLLLIVML GGFIFDSRRK GKTHKAQLQQ YDEERSHLLD
QMMWIDKAAK KVHQRYPVQS PIDLDQETKE QLKQEVRDAQ DSLQKLQLRV QLNDRQFLHE
KFGDKPLQVG LSLDATGTER ISIALSDDTP HAVSIFVQQA DKNLWSDLRF ERLLSGSIDV
FSTQATTTPL LEFLERSRGC HERGAVALRQ EEDRDIMFLV LRINLEDQSP LSNTDVCIGR
VVKGLDLLTS RVS