Gene PHATRDRAFT_50135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50135 
Symbol 
ID7198933 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011695 
Strand
Start bp155229 
End bp157421 
Gene Length2193 bp 
Protein Length626 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184976 
Protein GI219129608 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATCGATAGAC CCACCAGAGG GCCAAACCTC GGCACCCTCA CGGTATCGAA TCACATCACA 
CCGTGTGTGG GAGTGAACGT ATTTTATTTT CGGCAGTGGG ACTGGTCATG GCGACAATGG
CCGCCGCAAA CGTGATAACG GGACAGTTCG ACGATGCCGA CAGTGACGAA GAAGACTTTG
TCGCCCGCAT GCAAGCACAA CGGCAGCCAG CGACGTTGCC CGTCCAATCC TCAAAATTGG
CATGCGTTTC TTCGTTCACC CGACCACCCG TCGAGTCTTT GCAAACTGTT TTCCACTCGC
CGTACGCACG AGATGACGAC GACGACGGCA ACGACGAGGA TGATACCTAC GACGACGACG
AAGTATTACG ATTGGGACGA TGCACCGCTC ACGTCCGTTG CGGCACAGAG TGTGGCGCAA
GCGTCCGCCG CATCATCCAA GCAAATGAAT CTGTCGCACG CCGTATCCAA CCGTGTTACG
CAGATGGAAC ACCTCGAAAC GCATAAGCGT ACCTTGACAC AGGGTAGGGA CGATCGGGCC
ACCTCGGAAC AGGTACTCGA TCCCCGAACG CGACTCATAC TCTTCAAGTT ACTCAGCCGT
GGATTCCTGG AAGCCATCGA TGGATGCCTG TCCACCGGCA AGGAAGCTAA CGTTTACTAC
GCCAAGGCTG GTAAACATCA TCTGCAACCG CAACGGTACG ACGCCACGCC TACGCCTACC
GACGAGGACA ATTGCGACAA CACAAAACCC CGGCACGTTA CCGAATACGC CATTAAAATA
TACAAAACGA GCATTCTCGT ATTCAAAGAC CGCGATAAGT ACGTCGCGGG CGAACATCGC
TGGCGCAAGG GTTACTGCAA ATCAAATCCG CGAAAAATGG TCAAGGTCTG GGCAGAAAAA
GAGATGCGCA ACTACCGTCG CATATACGCC GCCGGTATTC CCTGTCCGGA ACCCATCTTG
CTCAAAGCGC ACGTACTCGT TATGGAATTC CTCGGTGTCG GCGGTTGGCC CTCCCCTCGG
CTCAAGGACG CCGCGCTTTC CGACAAGCGC CTCCGGGAAG CCTACGTACA ATGCGTACTC
ATCATGCGAC ACTTGTACCA ACGGTGTAAA CTCGTACACG GCGATCTGTC CGAATACAAC
CTCCTCTGGC ACGAAAACCA AATTTACGTT ATTGACGTAA GTCAGTCCGT GGAAACAGAC
CACCCCTCGG CGTTGGATTT CTTGCGCAAG GACGCCTCCA ACGTCAACGA TTACTTTCGC
AAAGTCGGAC GACTCAACGT CATGACGGTC CGTCAGTTGT TTGAGTTTGT CACGGCATCG
GTTTTGCCGT GTGACCAGCC GAACGTCGCG ACGGAGCAAG CCGAATTGGC GAGTCTCGAC
GCCATTATGC AACACGTGGA CCAAACCGCA CTGCAGTTGG CGGAAACATC GGAACAGGGA
CAGCGCAAAG TAGAACAGCA AGAGGCCGTG GACGAAGCCG TCTTTATGAG CAGCTTTTTG
CCGCGGAGCC TCAACCAAGT AGCCGAGCAC GAAATTCAAA AACTGGCAAC CGGGGAGGTG
GAAGACACCT ACGCGCAAGC CGTCGCGTCC TTGACAGGCA ATCGCGATGT GGTCGAAGCG
GTCGCGAAAA AGTTGGGCCG GAACGACTTC ATGACAAAGT CGGTCCAATC CATTCTGACT
AACGCATCGT CGAGGGAATC TCCTCAAGAA GAGGAGCGAA AAGCCGGAGG GAAGAATGGA
GTGCATTTTT CGGCGGTATT GGACGAGAAA GGCGACAAAT CGCCAAGTAG GCAGAGTGGG
GAAGACGAGT TCATGTCCGA ATGTGACGAT GACAGTTCTG TAGATAGCGA GGGTGAGTCC
GTGGACAAAG AAATTGGTTT CGTCAAGACT CCTATGACCC CGGAAGAGTT TACTGCCATG
AAGGAAGCTG TAAGAACGGA GCGCCGAGCA AACAAAAAAG CCGTCAAAGA CGCGAAAAGC
GAACAGCGTC AGAACAAGGT GAAAAAGAAG GATAAGAAAC GAGCCATTAA AAAATCGAAG
GCCGGAAATC GAAAGAATAA ATAGATCCCG GTACAAAATC GGAATTCCTA CGTGCCAACT
CTAGAGTCCT CCTACCGAGC GATTCTGGGA TTATGCGGTT TTAGGAACAC TGGTTTGCAT
TTGCGACTCC ATAGTAAATG AAAGTTGCCA GAC
 
Protein sequence
MATMAAANVI TGQFDDADSD EEDFVARMQA QRQPATLPVQ SSKLACVSSF TRPPVESLQT 
YYDWDDAPLT SVAAQSVAQA SAASSKQMNL SHAVSNRVTQ MEHLETHKRT LTQGRDDRAT
SEQVLDPRTR LILFKLLSRG FLEAIDGCLS TGKEANVYYA KAGKHHLQPQ RYDATPTPTD
EDNCDNTKPR HVTEYAIKIY KTSILVFKDR DKYVAGEHRW RKGYCKSNPR KMVKVWAEKE
MRNYRRIYAA GIPCPEPILL KAHVLVMEFL GVGGWPSPRL KDAALSDKRL REAYVQCVLI
MRHLYQRCKL VHGDLSEYNL LWHENQIYVI DVSQSVETDH PSALDFLRKD ASNVNDYFRK
VGRLNVMTVR QLFEFVTASV LPCDQPNVAT EQAELASLDA IMQHVDQTAL QLAETSEQGQ
RKVEQQEAVD EAVFMSSFLP RSLNQVAEHE IQKLATGEVE DTYAQAVASL TGNRDVVEAV
AKKLGRNDFM TKSVQSILTN ASSRESPQEE ERKAGGKNGV HFSAVLDEKG DKSPSRQSGE
DEFMSECDDD SSVDSEGESV DKEIGFVKTP MTPEEFTAMK EAVRTERRAN KKAVKDAKSE
QRQNKVKKKD KKRAIKKSKA GNRKNK