Gene PHATRDRAFT_50022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50022 
Symbol 
ID7198721 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011694 
Strand
Start bp170792 
End bp172357 
Gene Length1566 bp 
Protein Length362 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184907 
Protein GI219129461 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.144811 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGCGT TCGGGGAATT CAAACGTCTT TTTGCGATTT GCGGTAAGTA ATTTTCGTTG 
CTCCGTAGGA AAATGGCGAG ACGAATTCGC CTTTCGCGTT GGCATTGGCG AGGGCGCGAG
TCCGGCACAA CGTCCGTCCT TCTATCGTGT CGAAACGCTG GAAGCGGATC ACAGGAACGA
CTCGTTTTTC GTCTATGTAA GTTTCCGTGC TCGATACACT ATCACCGGTA CACGACGCCA
ACTGCAGTGA CGCCAATCGG ATGCGTTCTA CGTAGTACGC ACTGCGACAA GAAATGCTCC
AATCCAAACC TCCCTACCTC CTCCACCCGA ATTTCCATTC GAGAATTCTG GTGCTTCTTT
CCGACGCTCG GAGGCGGACC AAACGGCCGT AAGACTCACC ACAACAAAAC AACGCACAAT
CGAATCTCTT GGTCCTTCCA CGAGGTTTCT TTGTGCACAA CAGTGAGTTC ATCAACGTAC
CAGTAAATCG CAGTCCTTAT CCCCAAAGCC CGTGCCACTG CTGCCGTGCT TTTATCTCTA
CGACTGTGGA GGATCTAACT AGAGGCAGTA GCTGACTGTG AGAACGCCAG CAATCCCCCC
TTGCAGTAGT CCGCCATTGA CCTACGATTC GCTCGGTACT GCGTACATAC ATTCATCCAT
CCAGACGTAC TTTTATACCA AAGCATTCTC TCGCACAACA CCGTTCCAAT GCCCGCCTTT
CGACCCTTGG CCTCCACCCG CATGTTGCTT ACGCACGTGG GTGTTGGAAT GGGAGCAGCG
TCCTTTTGGA GAGGCGCATG GTACGTGTTG GACGATCACC TTTTCCCAGA AAACGCCACA
CACTCGGCGG CAGCCTCACT CGTGCTCGGC GTTGTGGGCA TGGGAGCTTC GCAGGGACTC
GTAGCCCGTG CCGAAGCCTT GTCACAAAAG ACACCGAAAC GGAAACTGCC CGTGGCGGCG
GCGCGTTTCG GGGCACTCTA TACCGTGGCC GTCTCGTGTG TGTTGGTCTG GCGGGGAACC
TGGGTGGGTT GGGATTGCCT TTACGAACGC TTGCATCCCC ATCCCGATAC CAAGTCGACC
GATCCCGGAC ACGCGACTCA CTCCGGAATG CTGTCGCACG TGGTGAGTGT CACGCTACTC
CTCGCTACAG GTTTGTTTGC CTCTGTCTTG GCTCCGCCCG CAGCCGTAAG TGTCATTCGC
GACTGGTCGA TCCACTCGGG GAGTCGAGCC TACTCCGGAC CGGCACAATC GGTTTTCAAC
AAGCTTTTCC CATCGTCGTC GTCCTCATCA TCGGTTCAAA CGGCTGGAGG AAACGGCTTT
AGTCCAAGCC GAGCATTCCT GTCGACAACC TCGAACCGAT TGTCCGCACG GGGTCAGCAT
CCCCATCCGT CGAGTCTCCT GCGGACAGAG GGTTCACGAA CAAGCAAAGT GCACCGAACA
ACTTACACAT CGAGTACGCG GTGAATGTAT GCGTTACCGC CGTCGCGGCC TAACTTTCTT
TGTATCCCGA TCCACACGCC AACGCTACCA GCTAGTCGAA TGTAAACTAG AAAATACAAT
TTTGTG
 
Protein sequence
MEAFGEFKRL FAICGKWRDE FAFRVGIGEG ASPAQRPSFY RVETLEADHR NDSFFVYKCS 
NPNLPTSSTR ISIREFWCFF PTLGGGPNGR KTHHNKTTHN RISWSFHEVS LCTTTYFYTK
AFSRTTPFQC PPFDPWPPPA CCLRTWVLEW EQRPFGEAHA SLVLGVVGMG ASQGLVARAE
ALSQKTPKRK LPVAAARFGA LYTVAVSCVL VWRGTWVGWD CLYERLHPHP DTKSTDPGHA
THSGMLSHVV SVTLLLATGL FASVLAPPAA VSVIRDWSIH SGSRAYSGPA QSVFNKLFPS
SSSSSSVQTA GGNGFSPSRA FLSTTSNRLS ARGQHPHPSS LLRTEGSRTS KVHRTTYTSS
TR