Gene PHATRDRAFT_50569 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50569 
Symbol 
ID7199394 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011699 
Strand
Start bp150316 
End bp151617 
Gene Length1302 bp 
Protein Length354 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185530 
Protein GI219130769 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GGAAAGAGCA TGAGTTTACT AGGCACAATG AATGTTGGCT TCTGGCTAGT TCCGTTCCGC 
GCGCTGTTCC AGCTCGATCA AGCAGAGACC GTCTCCAAGT ATCCTTCGAT TGTTTTACAG
GCCATTGTTG CATGTCGACT GCGAGCAGTG GCAGAAATGC ACTGCCGAGC GAGGACCTCG
GTGTTTTCGC AGCCGATTCG GTGGCAACGC CGTCAGGGGA CGAAGAACGC ACGATAACGA
CCAAAGGTTA GTTGCTCGTT TCCGTCACAG CAGCGTGAAG CGTTTCCACC ACATAGGCTC
GATCTGGACA GACGGCTGAA GAAGGAGCGA GTAAAAACGA AACGGCGCTT TTGCAAGCCG
ATGCGAACCA CCAATAATCG TAGAGGAGGT TGTCACAAGC AAAAGTCGCA ACCGACGACG
GCAGGAAAGG AGCGAAATCA GCGGATGGAA AAAACGGCGC GACTTCCATG CCAAAGCCGT
TTTACAGCTC CAGTAGGTAT TCAAGTGAGT CGCAACTGCA GGCGTTTCCA AAACCGGTAG
GGGAGACATT CGACGATCGC TTCCATGTGA CGAGCGTCAC ACCGATAAGA TTCACCAGGC
GAGGGCTCCG GCATGTTGCC GTCTTCTAAC AACCACCCGC AGTAACGTGG GCGCTCTAGG
CGACCTTGGC TCTGACAAAG AGGCGTGTCC TGTGGATTAC AAAGCTGACG TAGCGACTGA
TCCGGAATAC ATCAACGAAA CCGACAAAGA AATTACAGCG CAAGGAGCCA CTGCCGTTCA
AGTCGCGGAG GATTCCAAAG AAGCAAAAAT CAACATCGAA GGGCTTTTAC GGATTGTCTT
TCAAACCTTG CTGGAGGAGT GCCATATACC AAATTTTACC ACATACACCC TAGTCGCCGA
CAAACAGTTG ATTCTTGCCC AAGGCAAGCC CGTGGAGTTC ACCTTGTACA CGTGGAAGTT
GTGGCGCGAC CGGCATCATT GGGCATCATT TTGGAAAGAT CGGTACGTAT TGGTGTTAGA
TTCAATGTTG GAACCTTGGC TGGTTTTACG ACCGAACAAT GTAGGACAGC CTCCACGTAC
GCGCAACGAC CAATGGGCGT GACAACCGAC AAAGTGAAAG AGGAAGATGA TGCCGCTGCA
CCCGAGTCAG CAGACAAATC CCAATCAAAA GCTATCTCGG ATGGCGCAAC GGAACCTCCA
ACAAAACAGG AGCGGTCACA GGAACTCCTT GGGAATATAA TGGGATTGCT CGCAGGGATC
GCCATTCTAT TTCTTTTTGT GTCCATCTTG AAGTGGGAGT AG
 
Protein sequence
MSLLGTMNVG FWLVPFRALF QLDQAETVSK PLLHVDCEQW QKCTAERGPR CFRSRFGGNA 
VRGRRTHDND QRLDLDRRLK KERVKTKRRF CKPMRTTNNR RGGCHKQKSQ PTTAGKERNQ
RMEKTARLPC QSRFTAPVGI QARAPACCRL LTTTRSNVGA LGDLGSDKEA CPVDYKADVA
TDPEYINETD KEITAQGATA VQVAEDSKEA KINIEGLLRI VFQTLLEECH IPNFTTYTLV
ADKQLILAQG KPVEFTLYTW KLWRDRHHWA SFWKDRTAST YAQRPMGVTT DKVKEEDDAA
APESADKSQS KAISDGATEP PTKQERSQEL LGNIMGLLAG IAILFLFVSI LKWE