Gene PHATRDRAFT_41170 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_41170 
Symbol 
ID7199027 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011696 
Strand
Start bp126615 
End bp128295 
Gene Length1681 bp 
Protein Length531 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185130 
Protein GI219129931 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0989203 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTTGC TTGATATGAT GGTAGCTTGT CAGCTGTTCT TTCAAATAGC CACTATCTCG 
GCATTCTCTT TTCGCAGGGT ATCTTTCGGA ACTATTTCAT CGCGCCGTAC CCAACACCAA
TCTCAGCTCT TCTCAAAGAC CGAGAGCAGT CGCAGCGCTC GTACTGAAGC AGCCACACGA
TCTCTTGAAA ATTTGCGAGA GAGGCAAATG GAAGAACTGG CAGAAACCGA TCGTCTTTTG
CAGCAAATCC GGCAAGTCGA GGTTAGTAGT CACAGCCCTA CCAATATATC GAGCACAAAC
AAAGCGGCTG CCTCCATTCT AGCCGGAGTC GATTACGGAT TTCAAAGTCG GAGTGAAGGC
GCAAGTTTTT CCGACCTCAA TGGCGGATCG CCTGCATTTG AAGGCTACGG ACCACCCTCG
AATTTGTGGA AACTGGGGAC GCAGCAGTTT ATGCGTAATC TCAACGCCAT GAAAGGTGAG
TACGCGGACG AGACCGACTT CGCTCTAACG GACTCGCAGA AAGAATTGCA CGCTCAGCTA
GACGCCTTGA CGCTCAACGC CACCGGAATT TGGGACAGAG AAATGCAGAA TGGGCCAATT
GAGGCCCCAT TCGTGATCAA GATACCGTAC TTTGGGCTCT GCTATATGCT TGATGAGGTA
TTTGATGGCA AGTACATTCC ATCGCGTTTC TTTTTATTGG AAACAGTTGC GCGCATGCCT
TATTTTTCGT ACATCACAAT GTTACACTTG TACGAAACTC TAGGTTTCTG GAGACGATCA
GCGGGCATGA AGAGGATACA TTTTGCGGAA GAACTCAACG AATTTCACCA CCTGCTTATA
ATGGAAAGTT TGGGGGGCGA CCAAGCTTGG TGGGTACGGT TTTTGGCTCA GCATTCAGCA
ATCGTATATT ACGTCGCACT ATGCCTTTTA TGGGGTATAT CACCATCACT TTCATATCGC
TTTTCGGAGC TGCTCGAAAC CCATGCCGTG AGCACCTACG GACAATTCTT GGACGAAAAC
GAGGAAGCTC TCAAAAAGCT GCCACCGCCA CTTCCCGCTA TCGAATACTA CGCATTTGGT
TCCTCCGATC CATTCTACGC GGAATTCCAA ACTACCGCCA TGTCCCAGGG TCAACCGGTA
AGGTCCGACT TTCATGACAA TCGCCTGCCA GGATTATACT TGCGTACTAG GGCTAACTGA
GCTTCGTCTT GTGCTTTCAT AGCTGCGGCG GCCTGGTGAG TCCATGCTGA GCTTGTACGA
GGTGTTCCAA GCCATTAAGG CAGATGAGCT GGATCACGTC AGCACCATGG AAGCATGTCT
CGATCCCGAA GCCAACACCC GATCCCCCTC CGTTGAAAAA CGCATCTTGC TCGGCCTAGC
GTTGATTTCA ATAGTTGGAT TTACGGCATC AAATCTAGGC GGGGAGGCTT CCTTGATAGA
TTCATTACCA GCAGATGTTG TCGGCGAAAC TTCAACGGGG GGAGCGGTTG ATGCCGTCGT
GGCTAGTATT AGTGCGGCAG CGGCGAAATT TTCACTCGAC GAAACTTCCC AAGGTGGCTT
GGGCAAAGCA GCGGTAGAGT TGGAAGAGCT AGGAGCCACC GGAGCTTTGC TTGAGGGGAG
TCGCCGTGCC GTAATTGGGG CGTTGCAAGC TGTCCTTCGG TTCATAGGTA TTCTTCTGTA
A
 
Protein sequence
MKLLDMMVAC QLFFQIATIS AFSFRRVSFG TISSRRTQHQ SQLFSKTESS RSARTEAATR 
SLENLRERQM EELAETDRLL QQIRQVEVSS HSPTNISSTN KAAASILAGV DYGFQSRSEG
ASFSDLNGGS PAFEGYGPPS NLWKLGTQQF MRNLNAMKGE YADETDFALT DSQKELHAQL
DALTLNATGI WDREMQNGPI EAPFVIKIPY FGLCYMLDEV FDGKYIPSRF FLLETVARMP
YFSYITMLHL YETLGFWRRS AGMKRIHFAE ELNEFHHLLI MESLGGDQAW WVRFLAQHSA
IVYYVALCLL WGISPSLSYR FSELLETHAV STYGQFLDEN EEALKKLPPP LPAIEYYAFG
SSDPFYAEFQ TTAMSQGQPL RRPGESMLSL YEVFQAIKAD ELDHVSTMEA CLDPEANTRS
PSVEKRILLG LALISIVGFT ASNLGGEASL IDSLPADVVG ETSTGGAVDA VVASISAAAA
KFSLDETSQG GLGKAAVELE ELGATGALLE GSRRAVIGAL QAVLRFIGIL L