Gene PHATRDRAFT_14032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_14032 
Symbol 
ID7202536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp512673 
End bp514073 
Gene Length1401 bp 
Protein Length467 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181572 
Protein GI219122480 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTCAACCAGC TGTCATTGTC TCGTCCGCTG CTACGTGGTG TTGCCGCTAT GGGATTTGTA 
AAGCCGACAC CAATCCAAGC CGCCGTTATT CCGCTGGCTT TGGCGGGAAG GGATATCTGT
GCTTCCGCCG TCACAGGTTC AGGGAAAACA GCTGCTTTTC TCTTGCCGAT TCTGGAGCGC
CTGCTGCATC GTTATTCTGG CCGCACTAAG GCTATCATTT TGACACCTAC ACGCGAATTG
GCGGCTCAGT GTTTGGGCAT GCTGACGTCG TTTGCCCAGT TTACCAATCT GAGGGCGTCG
CTCATCGTTG GTGGTGCCAA AAACGTGAAC GCACAAGCGG CCGAGCTGCG GTCTCGTCCG
GACGTTATTG TTGCGACACC TGGTCGCCTT TTGGACCACA TCACTAACTC GGCTGGTGTA
ACGTTGGAAG ATATTGAGAT CTTGGTACTT GACGAAGCAG ATCGTTTGTT AGATTTAGGC
TTTCAGGATG AAGTACATGA ATTGGTCAAG GCATGTCCCG TACAGAGACA GACTCTTCTA
TTCTCGGCAA CTATGAATAC TAAGGTAGAT GACCTAATTC AGCTCAGTAT GAAACGCCCA
GTTCGAGTTC GAATAAGCGA CAAGGCGAAC AGTATGGACA TTGAAGTGGC GCCGCGCTTG
GAGCAAGAAT TCGTACGCGT TCGCGCTGGA AACGAAGGAG CTAACCGCGA GGGAATGCTG
TTGGCGTTGC TTACACGAAC ATTCAAGAAA CAAACAATTG TTTTCTTTGA TACGAAAGCC
GCCGCGCACC GTTTAATGAT CCTTTGTGGT TTGTGCGGGA TTAAATGCGC GGAGTTGCAC
GGCAATTTAT CCCAGCAACA GCGACTAACA GCGCTTGAAG AGTTTCGGAA AGGCGACGTC
GATGTTTTGT TAGCCACCGA CTTGGCGGCG CGAGGTTTAG ATATTGATCG CGTGAAGACC
GTAATCAATT TTGAAATGCC GTCACAGGTT GCTACCTACG TCCACCGCAT TGGACGTACC
GCTCGTGCAG GTCGAGGAGG TCGGAGTTGC ACGCTTATTG GCGAGGGGCG TCGACACTTG
ATGAAAGAAC TAATCAAGGA CGCGGAAGTC AAGAATAAAC GGCACACCAC AGGTGACACA
GCAAAAAGTT CGTTTGAATC AGGTGTGATT CGATCACGAA CAATTCCTCC CGCCGTCATG
GGTCACTTTG TTGCTAAAAT ACAGTCTTTG GAAACACATG TAGATGAGGT CTTCCAAGCC
GAAGCCATCG CAAAGATGGA CCGACTAGCA GAGATGGAAG TAATCAAAGC GCAAAATATC
ATTCAGCATT CTGACGAGAT TAAGGCACGC CCTCAACGTG AATGGTTCGC ATCCGAGAAG
CAGAAGAAAC TTACAAAGGA A
 
Protein sequence
FNQLSLSRPL LRGVAAMGFV KPTPIQAAVI PLALAGRDIC ASAVTGSGKT AAFLLPILER 
LLHRYSGRTK AIILTPTREL AAQCLGMLTS FAQFTNLRAS LIVGGAKNVN AQAAELRSRP
DVIVATPGRL LDHITNSAGV TLEDIEILVL DEADRLLDLG FQDEVHELVK ACPVQRQTLL
FSATMNTKVD DLIQLSMKRP VRVRISDKAN SMDIEVAPRL EQEFVRVRAG NEGANREGML
LALLTRTFKK QTIVFFDTKA AAHRLMILCG LCGIKCAELH GNLSQQQRLT ALEEFRKGDV
DVLLATDLAA RGLDIDRVKT VINFEMPSQV ATYVHRIGRT ARAGRGGRSC TLIGEGRRHL
MKELIKDAEV KNKRHTTGDT AKSSFESGVI RSRTIPPAVM GHFVAKIQSL ETHVDEVFQA
EAIAKMDRLA EMEVIKAQNI IQHSDEIKAR PQREWFASEK QKKLTKE