Gene PHATRDRAFT_47829 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47829 
Symbol 
ID7203066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp180912 
End bp182417 
Gene Length1506 bp 
Protein Length479 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182341 
Protein GI219124082 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GACAAGCCAT TGCTTGATAT AGTCCAAAAG ACTCTCGGAA TTGGTTTCCT CTCTCTCTAT 
AAATATATGC GTGTGTTCGT TCTCTATATC GTCGTATCTA CCATGGTAGC GGACGCTTGG
TTGACTGGTA GTCCGCAGAC TTTCCGTAGA CAATCTGTGC CACGGTCACT GGCGCCACCG
GATTGTCCCC CCATACGACA ACGACACTGG TCCGAGACCC CCACTACGCC AACCTTTTAC
AATATTGGCG GCGTGCGGAC ACGATTCACG ACACGATCCA TTCGTCGGCG TCCACTGACT
CGCTGTGACG CGAAAGATCC GTCCCGCAAA CGACGGCGAA GGTCGGAAAT CGACGACGAC
GACAGCCGTC GCAGTAACGA CGAGGACTCG TCCGAGCGAC TCGGGAGTCG AGTCAAACGC
TTGTTTACAC GAGAACCAGC ACCGATACCG GAACCAGTAC AGCCCGAGAA GTCTTCCGGA
GGACTGTTTC GTAATTTGTT TCCCAAGAGC GGGAACGACG TGGTTGAAAA GGAAGTCGCC
CGACAAAACA GCAAGCGCCA GAAAACTGCC CCAAAGAAGA CAATAAGTGT TTCCAAAAGG
GTCAGTAAAA GTCAAAGCTT GACCAAAGCT TCCAACGTGC GGCAGCGGGA ATCCAAAGAA
TCGCAATCGA GTGTCGATGG TTTCCTAGCC GGCACTGCGG GTCGATGGCA AAGTCTCTTT
AACTACACCG ACACGAAGAA AGCGACACCG GGGGCGGATG AAGATTCAGA CGACTCAAAA
AAGAGCGCAA TGACTCGGAT CTTGGGCGTC TTTTCCTCTC GTAACAACAC ATCGTCATCC
GACGAAAACG TTGTGGCGTT GGGAGGTAAG AACTCCACCA ATCCATTATC GGTTCTGCAA
AACTACATAC AGTCCTTCAG TTTCGGTGGA GACGGAAGCG ACGGTACCGG TGGAAAATCG
AAAGGTGCTG ACGAAGAATG GTTCGATGTT TTTCCGAAAA CCCGCATTTC CCCTGGTGAA
ATGGTACCCG TCACCGTGGC GGGCTTGGAT TTACTCGTTA TCGCGGCCGC GGACGGACGG
ACCTTGTACT GCCTGGCCAA TTCGTGTCCC CATTTGGGGA CGCCACTCGA GACGGGCAAA
CTCGTGCGAT TACCCGTGGA AGAGTCCACG ACAAGTTTTA TAGAGTCGTA CTCCGAAACG
GATGTTTCCA ACAGTAAAGG CCCCGACAGT GGCTTTTTTA CCGAACTCGA AGTCAGCTCG
ATACTTCAGA AGGATGGTTG CGAAGATTGC ATTGTTTGCC CGTTGCACAA GACAGCATTT
GCCCTCGGGT CGGGCCAGGT CCGGGGAGAG TGGTGTCCCT ATCCTCCCAT TCTAGGCAAG
ATCGTAGGGG CCGTCAAGCC CCCCACCGCG GCGGCAGTCT TTGACGTCCG AACCCGCGGC
AAAAATGTAC AAGTCCGTCT CAATACGCCG CTTCTGCAGC TCGGTCGCCC GGACCGTCAA
CAATAA
 
Protein sequence
MRVFVLYIVV STMVADAWLT GSPQTFRRQS VPRSLAPPDC PPIRQRHWSE TPTTPTFYNI 
GGVRTRFTTR SIRRRPLTRC DAKDPSRKRR RRSEIDDDDS RRSNDEDSSE RLGSRVKRLF
TREPAPIPEP VQPEKSSGGL FRNLFPKSGN DVVEKEVARQ NSKRQKTAPK KTISVSKRVS
KSQSLTKASN VRQRESKESQ SSVDGFLAGT AGRWQSLFNY TDTKKATPGA DEDSDDSKKS
AMTRILGVFS SRNNTSSSDE NVVALGGKNS TNPLSVLQNY IQSFSFGGDG SDGTGGKSKG
ADEEWFDVFP KTRISPGEMV PVTVAGLDLL VIAAADGRTL YCLANSCPHL GTPLETGKLV
RLPVEESTTS FIESYSETDV SNSKGPDSGF FTELEVSSIL QKDGCEDCIV CPLHKTAFAL
GSGQVRGEWC PYPPILGKIV GAVKPPTAAA VFDVRTRGKN VQVRLNTPLL QLGRPDRQQ