Gene PHATRDRAFT_47699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47699 
Symbol 
ID7202706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp569189 
End bp570379 
Gene Length1191 bp 
Protein Length231 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182089 
Protein GI219123557 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTCGAAAGGC GGGATTTGGC GGGTTCGCTT CGTTCGGTTC GCATAGTCGA TGTTCCAGGA 
TGCTTCGTCT TCCTTTCTAC GAACGCTCGA TAGGCAAACA CTGACATTTT ATGAATAAAT
GGGTGCCTCG TTAACTATAC TGTTCGAGAG CACGACACTT GCAGTTAGTC CAGTCTGTGA
AGCCGGTATT TCTCTAGCGT ACCTTGAATA CGAACACCAT TCAAAAATGC CATCCACATT
CTCAAAACAT CAACACAAAC CATTTTTGCA AGAAAAGGTG TGCATGTAAA TATACTGTCA
CACGTATCGC CCTTCACAAT GAATGACGGA TTGAGCACAA CAATCGTACT CCAAGATAAT
GAAACTACAA CAAAACCACC CACAACTGCC GAAGGCGGTA TCGGGAATGA TGCAACGGAC
GCGGCAACTT TGGTTCTTGT CGTCTTTCTG ATAGGCTTCT GCATGATAAT CTGTCGAATG
GCCTTGCTAC GACAAAACGC CACAGATCAA CATGACCAAC AAGTCGAATC CAAACAGACA
CAGAAAAAGA GAATCGCGGA ACGCAAAGAG TACATTGCTT CGAACATGAT GGTGCGAGAA
TGGAAAAGTG CTGCTATCAG CGATGATTCG ATGACGAGTG CGTCCGACGA CTTGACTTTA
GAATATGGGG ATGAGACCGA CAATGTTCCT ACCGGATTAA GACCTTCGTC AACGAGCGGC
GACTCTGTTA ATCGAAAGGA GCAACTTGAA AACTCCGACT CCCATTCACG AATAAAGGGA
GATGACTTTC CCAGCTATCG CGAATGCACT CCTACGGCCG CGTTGAGTGA CTATGATTCC
TTTTGTGAAG ACACCGGGTG CGCCATTTGC CTTTCAAATT ACGAACCATG CGACCGCGTC
TGCGAGTCAG TCTCGTGCAA GCACATCTTT CATGAAGCTT GCATGTCCGC TTGGCTCATC
AAGCACGATC GGTGCCCAAT TTGTCGCGAG CCCTATTTGG TAGAAACGGC GTGAAGAACG
AACGTATCAT AAATCGGACC CAGCGTGATT CCGGATAGAA GCGAAAGCCG AATCGGCGTT
TTTGGTGTTT GGTTATGGTG ACGCTCCCTC TGTGACCAGG AGCGGCAATC CCCATTAGAG
ACAAACCGCC TTCGCTGTTA ACAGAAATAT AAATCCTAGG GTACACTTCA C
 
Protein sequence
MNDGLSTTIV LQDNETTTKP PTTAEGGIGN DATDAATLVL VVFLIGFCMI ICRMALLRQN 
ATDQHDQQVE SKQTQKKRIA ERKEYIASNM MVREWKSAAI SDDSMTSASD DLTLEYGDET
DNVPTGLRPS STSGDSVNRK EQLENSDSHS RIKGDDFPSY RECTPTAALS DYDSFCEDTG
CAICLSNYEP CDRVCESVSC KHIFHEACMS AWLIKHDRCP ICREPYLVET A