Gene PHATRDRAFT_50388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50388 
Symbol 
ID7199202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011697 
Strand
Start bp198158 
End bp199487 
Gene Length1330 bp 
Protein Length354 aa 
Translation table 
GC content57% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185291 
Protein GI219130269 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTCACTAAAT GTTTGATAGT GCGCCACGAA GGAATATTAT ACCGCGATTC ATACGTTGAT 
CCATTGATAC ATCCATCCTT TCTTCCACAC TTTCTTGTCA CCTTTGCAGG TATTAGCGCA
CCCCCGAATT CCATCCGTAG TGCTAGCAGC GTGCGCTGGT GTGTGCTGGT TGAACCTGCA
CCGACTCGCA AACAATTCGG CTTTTGTACT AAAGTCCGTC GCCGAAAATG TCGTCTGCTT
CCGATCCACA AGGCTGGGTG ACGAATCCAC CATCCGTGCT CGTCAAGGTA CGAGATATTA
CCGCCAAACC CCACTTGAAC GGCGAGCTCG GGGTCGTCGT CGGCTACCTT CCGGACCGCA
CGCGCTACGT CGTCGTCACC TGTCGCCAGC AGGAACAGCT GTCGCTCAAG CCGGAGAATC
TTCACAAGGC CAACTTCCTC GAACAAGCCA AAGGACAATA CCAGCTCCTC ACGAATGATC
CTCGTGTGCG ACGACAATTG CAGCAAGTTT ATCATCGCGT ACAGACAAAA CTGCCGGCCC
CGTTGCAACC GGAACACGTT GCGGTCGTCT TGCTCCTCCT TATTCTAGCC AGCGGATACT
TTCTGGGTGT CAGCAAAACA CTCATGATCG TCTCTCTTTT ACTCGGCCTA GCCACTCTGG
CGGGTCCGGA AATTGCGGCG GGCAAGAGTT GGGAACAAAT TGGGCGGGAC TTGCCGCGTA
GGGCCACGAG CACCTTGCAA GACACCATCC GACAGAGCGT TCCCTACGTC GGACCCAAGT
TGGCCGATAC GCCTTACGTC GTCCCCGCAC TTCTCGGTAT CCTGCTCGCC GGGACCGTCA
AGGTGTTACT TCTACCGGCT CAACCCCGCG TGCCCTTGGA GACGGCGGCC ACGGCGTTGC
GGACGGAACC CGGGGCGCCC AGGCGTGGCG GGGCCCTACC GGACGCGGAA GAACTCTACA
AGCTAGGGTT CGACGACGCC ACCAGTAATT TGGCCTTTGG GACGTCGTTG TCACCCCCAT
CCGTGGTGCC GGACGACTTT CTCGTCAACG ATGACTACAG CGATATGCCG TCCTTGACGA
CGAATACGCG GGTATCCCCG TGGAACTGGA GTACGCTCAT GAGTGTTTTC TATCTCGGAC
GGACCGTGTA TGCCTTGGGC TGGGATCCGG TACAGGGCGC ATGGAGTTGG GGACGTGCCA
AGGCCAATTT GGTGACCCAG CCTACCTATC AATCCGCCTT TTTGGCCTTG TCCGTCTATC
GGGTTGTCAG TGCCATAGCG GCTTCGCGGT AATGGCTTAC GACCTGGACG ACACCGTGAC
ATGAAAGGAG
 
Protein sequence
MSSASDPQGW VTNPPSVLVK VRDITAKPHL NGELGVVVGY LPDRTRYVVV TCRQQEQLSL 
KPENLHKANF LEQAKGQYQL LTNDPRVRRQ LQQVYHRVQT KLPAPLQPEH VAVVLLLLIL
ASGYFLGVSK TLMIVSLLLG LATLAGPEIA AGKSWEQIGR DLPRRATSTL QDTIRQSVPY
VGPKLADTPY VVPALLGILL AGTVKVLLLP AQPRVPLETA ATALRTEPGA PRRGGALPDA
EELYKLGFDD ATSNLAFGTS LSPPSVVPDD FLVNDDYSDM PSLTTNTRVS PWNWSTLMSV
FYLGRTVYAL GWDPVQGAWS WGRAKANLVT QPTYQSAFLA LSVYRVVSAI AASR