Gene PHATRDRAFT_21557 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_21557 
Symbol 
ID7202423 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp448509 
End bp449795 
Gene Length1287 bp 
Protein Length348 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181728 
Protein GI219122803 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTCTATCTGC AATCCAACAT GGCCAAAAAG AAAGATAGTT TCAAACGCCG TCGAATTAGC 
GGTGATCTTG TCGAAAAGGC TGGAAGCCTG GATGCAGGGC CTCAACAAAA AGTCGACGAA
AAGGTTGTCA CGGAGGATAG TTCTTCAAGG AGAAAAGCTG AACTTCTCCA AGCCGAAAGT
GAAGTACATA GCGACGGTGA GGATATCGAC GATACTACCG TGCGGAATGA TGGTCGGTAT
CGAAACAAGC AGCGTTGTTT GACCCTTTGC TCTCGGGGTG TCACGGCACG CTACCGCCAT
CTATTGGAAG ATCTGCGCAC GCTCATGCCG CACCACAAGA AGGAGTCCAA ACTTGATCCA
GGTGAGGACG GGGTTGGCCA GGCTGTCAGC GATATTTGCG AAATGCGTTC CTGCAACACG
ACTATGTTCT TGGAATGTCG CAAAAGACAG GATGCGTACA TGTGGCTGGG CCGCGTCGGT
GGTCAGTCGC CCGGCCCGAG TGTGCGTTTT CATGTCACAA ACATTCATAC GATGGATGAG
CTGCGTTTGA CGGGGAATTG TATGAAAGGA TCCCGGCCGA TTATGACGTT TGATGAGAGC
TTTGGGCGTG TGGATCACTT GAAGCTATTA AAGGAACTCT TCATTGACAC ATTTGGTACC
CCGCGGGGCC ATCCGAAGAG CAAGCCATTT GTTGATCGGG TGATGGCGTT TTGCTATGCG
GACAACAGGG TAAGTCCTTC TATTACACTC TACTGTCTGC TACGTCCGTA AAACAGAGAT
CATGCTAGTG ATTACAAAAA TGATGAGCGT ATGTCAGAGA TCACTGATCA ATGAGAATAC
CATCATTATA GACTACGATG GAATGAAGCA ATGTCGGGCA ATCAATCCTT AGCCGCTTCT
CTGTTGCAGC ATGGCACTCA CATCTGCATT CACTTTTGCT AAAACTTTTA GATCTGGGTG
CGGAATTATC AAGTTATTGA AGAACAGCCG TCAAACGCCA AGGAAGCTCA TCAAATTAAG
AAGAATTCAG GAAGAGAAGA GGCTACTTCC ATGGTGGAAA TTGGCCCGCG TTTTGTTTTG
AACCCGATCC GCATTTTTCG AGGATCGTTC GGAGGTCAAA CCTTGTTCCA GAATCCTGAT
TTTGTGTCAC CTAACGAGAT CCGTTCCTTG GAACGAAAGA GCAAAGGAAG TCAATATGAT
CAGCGGAAAA ACTCGCAGAA GGAGCGACAC GAGCGGAAAT CACAACTGGT TTTGCCGGAA
GACCCGTTGA AATCCGTTTT TCGGTGA
 
Protein sequence
MAKKKDSFKR RRISGDLVEK AGSLDAGPQQ KVDEKVVTED SSSRRKAELL QAESEVHSDG 
EDIDDTTVRN DGRYRNKQRC LTLCSRGVTA RYRHLLEDLR TLMPHHKKES KLDPGEDGVG
QAVSDICEMR SCNTTMFLEC RKRQDAYMWL GRVGGQSPGP SVRFHVTNIH TMDELRLTGN
CMKGSRPIMT FDESFGRVDH LKLLKELFID TFGTPRGHPK SKPFVDRVMA FCYADNRIWV
RNYQVIEEQP SNAKEAHQIK KNSGREEATS MVEIGPRFVL NPIRIFRGSF GGQTLFQNPD
FVSPNEIRSL ERKSKGSQYD QRKNSQKERH ERKSQLVLPE DPLKSVFR