Gene PHATRDRAFT_44554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44554 
Symbol 
ID7197799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp889762 
End bp890858 
Gene Length1097 bp 
Protein Length351 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178597 
Protein GI219115603 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCACGTCCAA CCCGTCTGAT TTCGGCATTG TCACCAGTAA CATGTCCGGT AACGCATCGG 
CTGCACAGTC CGATGATGGC GACAAAGATC CCGTTGTTAC AGACAAGAAG GCAGAACCAA
AGGTGACGAA AGGGAAAGCT AAGTCGGCGA AGAAAGCTAC AAAAAATAAA AGGTTTCGTT
CCAAGAAGCC CAAAGACATG CCGCCTCGTC CGCTGAGTGC TTACAATCTT TTTTTTAAGG
AAGAAAGGGT GCGTATGTTG ACGGGAGATG TAGCGAGTGA GGGCGACGAC GCTCCAAAAC
GTATTGGGTT CGAAGCTATG GCCAAAACAA TCGGGAAACG GTGGAAGGAA TTGCCCGAGA
TTGAACTTGC TCGATACAAA GCCGAAGCGA AAGATCAAAT GGATAACTAC CGCAGGGAGA
TGGATAAGTA TCATATTAAT GTCGCAAAAC GCGCTCGTCT GGAAAAGGAA CAAGCGGCAG
CGCAAAAAGC CGAAGAGGAA GCCGCTGCGG CAGCGGCTAG AAAGACAATG TTTGAGTCGA
ACCCGCAAGG GGATATGCAG CAGATGGTAC CGGGTCTCGC TTCTACAATG GGAGGTTCAA
ACAGTGCCGC TATGGGAGGG TCGGGGAATT TCGATATGGA ACAACTTTTG CGCGCTCAAC
AAGGTATGCT TAATCCGGCG ATGAACGTTC CAATGATGGG ACTTGGAGCG AACTTTTCGC
AATTCTACGG GTCGTCCGGA CTTCCTGCTG GTATAGGGAC TGGTGGACTT GGCATGGGGA
TGGGACAGAC CAACTCTCAA TTTTTCCCAG GGTCTATGAT GCAGGGCAAT TCATTCCCGC
AGCAAGATAA CAGTCAGCAC ATGATGCAAA ACCAGAACCC CATGGCTTTC CAGCTTGAAC
AACATCTACA ACAGCAGCAA TTACAGATGC TTCAACAGCA GCAGCTACTC ATGCAAATGT
CTGGAGCTCA AGGTCAGCAA CCACCTTATG GAAGCGGAGG AGGGAGTGGG GATATTTCTG
GCTTTCCGGC AGGGCCTGAT TCGTCTTTTG CTTACAGCGA CCAAAATACC TTCCATGGCA
ACCCTGGAGG ATCTTAG
 
Protein sequence
MSGNASAAQS DDGDKDPVVT DKKAEPKVTK GKAKSAKKAT KNKRFRSKKP KDMPPRPLSA 
YNLFFKEERV RMLTGDVASE GDDAPKRIGF EAMAKTIGKR WKELPEIELA RYKAEAKDQM
DNYRREMDKY HINVAKRARL EKEQAAAQKA EEEAAAAAAR KTMFESNPQG DMQQMVPGLA
STMGGSNSAA MGGSGNFDME QLLRAQQGML NPAMNVPMMG LGANFSQFYG SSGLPAGIGT
GGLGMGMGQT NSQFFPGSMM QGNSFPQQDN SQHMMQNQNP MAFQLEQHLQ QQQLQMLQQQ
QLLMQMSGAQ GQQPPYGSGG GSGDISGFPA GPDSSFAYSD QNTFHGNPGG S