Gene PHATRDRAFT_39363 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39363 
Symbol 
ID7195113 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011687 
Strand
Start bp289876 
End bp291263 
Gene Length1388 bp 
Protein Length443 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183337 
Protein GI219126172 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000140732 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGCTG ACGGTGCTAG TGCGTTTCGT GGTGCGCTCG GGCGGATCGG ATGGTCTGTT 
CCTGCGGCGA ACGCGTTTAC GAACGAAGGT TTTGATGCGA TGGACTCCCT TGGCTTGGTT
ACTTGTGACC GTCTCAAGGA TATCTGCAAG ATCATTCATC GCGGTACCGA TGGTGTGGCC
GCAGTGCCAG CTGCTGGTGG AAACGCTGCG GTGGCGGTGG CGGCGGCGCC TGGCATCCCT
GGGATAGTGA TCCCCATGAT GTGGGAGTAC AAGCTAAGCG GAATGCATCT CTGGGTGTCT
GAGCGTCTCC GACAGGGAAC TCCGGTTGTT GCGGAGGACT TTACTGCGGC TATCGGAAAC
CTGTACACCA GGAAGGTGCG TGAACTAGAA GAAGCGAAGG ATGAGGAGGG TGTTCAAGTC
AAGCCTCCGG CTCCGTTCTC GAAGGAAACG AAGTGGATTC CGTTCTTCAA GTTGTTGGTC
AACTATTTGA GTTCTGTGAC GGGTGTTAAC AAAGTGCCAT TGGATTATGT CATCCGGAAA
GATGACGACA TTGCTCCACC TGATACCGAG TTTGAAACAG AGCACGAGAA GTTGGTGTTG
TCCACTCCTC ATACGGGGAC GGCTTTGACA AAGATAACGG AAAAGTTTAG ATTCAAGTGA
AGCAGTTGAC TGTAAACGGT CCGGCGTGGA CTTATGTGGC GCCTTTCGAG AAGAAACGCG
ATGGTTGCGG CGCAGTCAAG GCTTTGAAGA GCCATTATGA AGGTGATGCG GTAATGTCCA
AGTCCAAGGC GGTTGCGTTT GACGTGCTCA AGCACACTAT CTACACTGGT GAGCGTGGTA
ACTTCGGGAT GGAAAAGTAC ACAAATGCTT TGTCAACGGC GTTCCAGACT CTCGACGAGT
ATGGAGAGAC CTTGACGGAG TCCAGGAAGG TGGATGTCTT CTTGCGCAAC AATCATTGCA
CGGATCCTAA GATGCTCTCG GGAATTGCTA TAATTCAAGG CGACGCGGAC CGCATGTCTA
ACTTTGCAAA GGCGGCCGAT TATTTGGCCT TGTTTACTAA CACTGATACC TCTCAGAAGA
CAGGCCGTTC AATCTCGAGT GCTCAAAGGT CTACTAACAA GAAGAAGCCG GCTATTCGAG
CGGGTAATTA TACTCCAAAT GAATGGCATC AGCTCTCGGA CAAAGAAAAG GACAAAGTTC
GTGCAAAGCG AGCGGCTGCC AAGGCCTCTC GCAACAAGAG CAAGCGCTCG GCCGCAGCGG
TTAGTCGTTC GAGCGAGAAA CCTGACAAGG GGAGCACGGA TAAGGCAACC AATGCGGGTG
ATCAGTTTGC TCTCTTGAAC AAGAAAAAGA AACGAAAGGT TGGCTTCGAA GGTGAAACGA
GTGATTGA
 
Protein sequence
MAADGASAFR GALGRIGWSV PAANAFTNEG FDAMDSLGLV TCDRLKDICK IIHRGTDGVA 
AVPAAGGNAA VAVAAAPGIP GIVIPMMWEY KLSGMHLWVS ERLRQGTPVV AEDFTAAIGN
LYTRKVRELE EAKDEEGVQV KPPAPFSKET KWIPFFKLLV NYLSSVTGVN KVPLDYVIRK
DDDIAPPDTE FETEHEKLIQ VKQLTVNGPA WTYVAPFEKK RDGCGAVKAL KSHYEGDAVM
SKSKAVAFDV LKHTIYTGER GNFGMEKYTN ALSTAFQTLD EYGETLTESR KVDVFLRNNH
CTDPKMLSGI AIIQGDADRM SNFAKAADYL ALFTNTDTSQ KTGRSISSAQ RSTNKKKPAI
RAGNYTPNEW HQLSDKEKDK VRAKRAAAKA SRNKSKRSAA AVSRSSEKPD KGSTDKATNA
GDQFALLNKK KKRKVGFEGE TSD