Gene PHATRDRAFT_50220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50220 
Symbol 
ID7199004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011696 
Strand
Start bp27589 
End bp29410 
Gene Length1822 bp 
Protein Length576 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185191 
Protein GI219130058 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.265161 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAACA TTAAAAATCT GGACAAATCT GGTGGTATTC CAGATTACGA GGACTTAGAC 
CTGGCTTTCG ATGATCTTCT GGAATCTGAA AACGATCCTA CAATCTTTCG TAAACCGACT
CCGAGTTCGA TCGAAAACGA CGGTGAATTG ACTGGGAGCA AGCAGAGAAA AGACAGTCTG
CCTGACGAAT CATCGCATGA TTACATATTG GGAACTCTAG TTGTCCGCGT AGTAGCCGCA
CGAGACTTAG AGCCTGCTAG CAAGCATAGT TTGGGCAAGA TGATTTTTGG AGGAGCGCAC
CATTCGAGAA ACAAGGGATC AGCCAACCCG TATGCATCCG TGCGCTTCGG CAGTACGACT
CAGAGAACTT CGGAGGTTTT TGATACCGTG AACCCAATTT GGCCGAGATC GGAAACAATG
TACATGGATG TCTCTCATCC TAGAATTCCG GATCACGCTA CACAACAGCC AAAATCGGGT
ACAGTCCCGT CAATTGTCGC GACAACGGAA AGTTCAACGC AGATCTATGA ACAGCCCGGT
AACCCCCCTC AAAAGCCAAA AGCGTCCACA AGCTTAGCGA CGCATAGCAT TTTTGAAAAA
GAACAAGTCG AATCTATGCT AGAAATGAAT GAAGCAAAAG ATCTTTACAA ACCATCTCGG
CCTATTTTAA CGGTTGCTAT CTTCCACGCC AATGAAATAG GTACTTTGAA GAAGTACAAC
CCATCGAAAG GCGATAGCGA TGATCTCTTT TTAGGAATGG TTGCAATTGA TTTGACGCCA
GTATTGACTG GGAAAACAAC TATATTTGAC CAGTGGTTGC CTCTAACAGG CACTGAAAGT
ACCCGTACTA CGGTGCGAAT TGTCTGTGAG TATGAAGCGA GCGATACCGC GCCCCAGTCA
CTAGATTGGG TTCATTTTAC TAGATTTTGT GATCCCGCCG ATTTTTACCC GGCTCATGGT
GATCGCTTGT ATAAAGTGGA AAGTTGTGAC GGTGACAATG TCACATTGTC ATGGACGAGT
TCGGAAGGTT GGGTGTCATC GTTTGTGGTA CATCGCAACA TGTTGGTGTG CGCCGCACGG
CATCAAGGGC CTATAGAATT TTATCAAGAC GAGATTCAGT CAATTGCAGA GAGATTGGGT
CACTCTCCGA TGGTTGACAC TGTTCAAGAG ACTCTCCGTA CGCTTCCCGA CGAAGGATTA
GTATCAGTTA GTGTCGACAT ATTCCGAGGT GGAACGTCGC TCCTTAACCG ATGGCTCGAT
CAAGGTGTTC GTACGATCAT TGACGACATC AAATTCGCGA CAAATATCGA TGGACGACAC
AATCCTAATT TTGAAGACAG CTTAGCTACC GATACGCTTG ATGAAGCCAG CGATTCCATA
TCCCAGGCAG CATTTGCTAG ACACTCGGTG GAAAAGTCCG AAGTCGAGTC TGCTATGGGA
ACAAACTTAC AGCCCCTTCC CAATATGCCA GCGTGTCCCA TCACAGGGGA GCCCATGATT
GATCCCGTCG TTGCCGCCGA TGGCCACACA TATGAGCGCT TTGCCATAGC ACGCTGGCTT
CATGAGAGCG ACAAAAGTCC GCTGACGGGC TCAATCTTAC CACACAAAAG TCTTGTTCCG
AACTACATGC TGGTGTCAAG CCTTCAAGAA TGTGCCGTAA TTTCTGTCGA GGAGGATCTA
CCAGTTGGTA ACGACGACGG ACCACTGGTA GAAGTAGTAC GGGATGTGTA GGGATTCAGC
ACCTCCGTCC TTTCGTTTAT CTCTTGCCCT TGAGTCTTTC GAAAGTATAA GTTGTATAGC
TGAGGTCTGC ATCTTGGTGA TT
 
Protein sequence
MANIKNLDKS GGIPDYEDLD LAFDDLLESE NDPTIFRKPT PSSIENDGEL TGSKQRKDSL 
PDESSHDYIL GTLVVRVVAA RDLEPASKHS LGKMIFGGAH HSRNKGSANP YASVRFGSTT
QRTSEVFDTV NPIWPRSETM YMDVSHPRIP DHATQQPKSG TVPSIVATTE SSTQIYEQPG
NPPQKPKAST SLATHSIFEK EQVESMLEMN EAKDLYKPSR PILTVAIFHA NEIGTLKKYN
PSKGDSDDLF LGMVAIDLTP VLTGKTTIFD QWLPLTGTES TRTTVRIVCE YEASDTAPQS
LDWVHFTRFC DPADFYPAHG DRLYKVESCD GDNVTLSWTS SEGWVSSFVV HRNMLVCAAR
HQGPIEFYQD EIQSIAERLG HSPMVDTVQE TLRTLPDEGL VSVSVDIFRG GTSLLNRWLD
QGVRTIIDDI KFATNIDGRH NPNFEDSLAT DTLDEASDSI SQAAFARHSV EKSEVESAMG
TNLQPLPNMP ACPITGEPMI DPVVAADGHT YERFAIARWL HESDKSPLTG SILPHKSLVP
NYMLVSSLQE CAVISVEEDL PVGNDDGPLV EVVRDV