Gene PHATRDRAFT_47992 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47992 
Symbol 
ID7203233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp644230 
End bp645600 
Gene Length1371 bp 
Protein Length432 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182267 
Protein GI219123928 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00155367 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTTCGTTTCA TAATAACAGT AAACAATACA CACCAAGTCA GCAGATAATA TTCTAATCAA 
TTGGTGGTGA CCATGGCTTA CTCTGAAGCC GAGCGTGATA TATCCGCCAA AGATGCCGCG
GTTCGGAGGC TTCTTTCCCT GGCTAAAATA AAAGTCGCCC ACCCCGCGCC CTTTTCAAAC
ATTACCAAGC TGGATATCTT TGACTGTGGC ATTTCGTCCC TCCCAGATAG CTTCGCCGAA
GCATTCCCTG AACTTTCTAT TTTATTCTTG TCCAAAAACA AATTCAAGGA GATGCCAAAA
ATGATTGGTG ATTGCCCCAA GCTACAAATG GTTTCATTCA AGGATAATAT GCTTGCGACG
ATTCATCCTG ATGCATTGCA ACCACAAATG CGATGGCTCA TCTTAACAAA CAATCGTCTA
TCTAACCTTC CGGAATCAAT TGGTCGATGT CAAAAGCTGC AGAAATTCAT GTTGAGTGGA
AATCAAGTGG AATCTCTTCC AGATACTATT CGAAACTGTA TTAGCCTGGA ACTAATCCGA
TTAGCTTCCA ACAAATTGAA GGAGCCACCC ACAGCTCTTC TTGATATTCC AAGTCTTCGC
TGGGTAGCTT TATCGGGAAA TCCATTTCTG CAGCATCTGC AACCATCATC GGAAGCATTG
GACATTCTGG AAGAGGTGGA AGAAAGCATT GGCGAAGTCT TGGGACAGGG TGCTGGAGGT
GTTACTCGCA AGGTACTCTG GCGGGATCGT GTTGTTGCTG TGAAAGAATA CAACGGTGCC
ATGACCTCCG ACGGTCTTCC TGAGGAGGAA CGTCGCATAT CGTGTGCAGC ATCGGCACTC
AACTCCGCTT GTTTTATTGA AGTTCTAGGC GAAACGCAGG CAGGTTCCTT GGTGATGGAG
TATTTGGATC AGTATTCAGC TTTAGCCGGC CCGCCAAGCT TTGAAACCTG TTCGCGAGAT
GTATATACGG ACTCCGTATG TATTGTTCAT GATGAGCAAG CCGAGAAAAT CTTGTCTCAT
TTGTTGGAAG CCCTGGCTAA GCTCCACAGC GTTGGTATAT GCCATGGAGA CTTTTACGGA
CACAACATTT TGGTCTCTCA AGATGGATCG GACGTACGAT TGAGCGATTT TGGAGCGGCA
TTCTTTTATG ACAGAGAGCA TGAATACGGC ACTTCTATAG AAGCGATTGA GCTACGGTCA
TTTGCAGTTT TGGTTGAAGA AGTTAACTCT TTGCTCAAGC AGCAGAGTGA GCGATTAGAC
AAGCTCGTAA GAAAATGCCG AGAGCAGGGT TGTTCGTTTG CGAAACTTCA CATCTGGTGG
AAACAACTAC AACTGGCTGG GCTCGCATCT GCCTTCGCTG TTGACGCCTA A
 
Protein sequence
MAYSEAERDI SAKDAAVRRL LSLAKIKVAH PAPFSNITKL DIFDCGISSL PDSFAEAFPE 
LSILFLSKNK FKEMPKMIGD CPKLQMVSFK DNMLATIHPD ALQPQMRWLI LTNNRLSNLP
ESIGRCQKLQ KFMLSGNQVE SLPDTIRNCI SLELIRLASN KLKEPPTALL DIPSLRWVAL
SGNPFLQHLQ PSSEALDILE EVEESIGEVL GQGAGGVTRK VLWRDRVVAV KEYNGAMTSD
GLPEEERRIS CAASALNSAC FIEVLGETQA GSLVMEYLDQ YSALAGPPSF ETCSRDVYTD
SVCIVHDEQA EKILSHLLEA LAKLHSVGIC HGDFYGHNIL VSQDGSDVRL SDFGAAFFYD
REHEYGTSIE AIELRSFAVL VEEVNSLLKQ QSERLDKLVR KCREQGCSFA KLHIWWKQLQ
LAGLASAFAV DA