Gene PHATRDRAFT_16891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_16891 
Symbol 
ID7199152 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011697 
Strand
Start bp186197 
End bp187546 
Gene Length1350 bp 
Protein Length450 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185337 
Protein GI219130364 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGACGG GGGGTATGAG TCTCTCCAAA GAATTCTTCG AACTCCTCAA GGCCATCGGC 
GAGTCCAAAT CCAAGCAGGA AGAAGACCGG ATCGTTCAGA AAGAAGTGAC GCGCTTGAAG
AGCAAACTCG AAAACACACC GGGGAATCCT TACCACTCCA ATACGTTGCT CACCAGCAAG
AAGCGCGCCA AGGAGTTCCT GGTGCGACTT TTGTACGTGG AAATGCTCGG TCACGACGGA
TCCTTTGGAT ACATCAAGGC CGTCGAAATG GCCGCCTCGG CCTCGCTTTT TCACAAGCGT
ACCGGCTATT TGGTCTGTGG CGCCTGTCTC CCGCCCTCGC ACGAATTCCG TTTCATGCTC
GTCAACCAAA TGCAACGCGA TCTACAGTCC ACCAACGTAC TTGAATGCAG CGGTGGTCTC
CTCGCCTGTA CCAACCTTAT TACGGCTGAT ATGGTCCCCG CCGTCGCCAA CGAAGTTAGT
AAACTGCTGC AGCACGATTC AGCGACCATT CGCAAAAAGG CGATTCTCTG TCTGCATCGA
TGTCACCAAC TCGCGGATGA CGTTGTTACC AGCGAATCTC TGCACGAATC GCTACGGAAA
CTTGTTTGTG ATAAGGACCC TTCCGTGATG GGGAGTTCGC TGAATGTCAT TGAGGCCTTG
TCTCTCACGA ATACCGCGCC TTTCAAAGAC CTGGTCCCCT CCCTCGTTTC CATTCTCAAA
CAGATTTGTG AACACCGGTT GCCTTCCGAG TTTGACTACC ACCGTGTCCC GGCGCCGTGG
ATGCAACTTA AACTCGTACG CATTCTGGGT CTCCTCGGCA AGGCCGACAT GCCCGCGAGC
AAGGGGATGT ACGAAATTCT ACACGAAACG CTGCGCAAGG CCGATACCGG GATCAATGCG
GGATACGCGA TTGTTTACGA ATGCGTTATT ACCATTATTG CCATTTATCC CAACGCCAAC
CTGTTGGACG CCGCAGCCGA AGCCATTGCT CGCTTCATGC AGTCTCGATC GCACAATCTC
AAGTACCTAG GAGTTACCGG ATTAGCCATG ATTGTGGAAC AGCATCCACA GTACGCGGCG
CAGCATCAGT TGGCCGTGAT GGATTGCTTG GAAGATGACG ACGAAACGCT ACAGCGAAAG
ACGCTCGATC TATTGTACCG CATGACGAAC GTAGTTAATG TGGAATTTAT CGCCGAAAAG
CTGGTGGAAT TCTTACGCCA CACGACCGAT TTATTCCTCA AACAGACCTT GACGACCCGT
GTTTGTTCCA TTGCCGAGCG CTACGCCCCC AACAACGCCT GGTATATTCG TACCATTACC
TCTCTGTTGG AAGTATCTGG AGACATGGTT
 
Protein sequence
MATGGMSLSK EFFELLKAIG ESKSKQEEDR IVQKEVTRLK SKLENTPGNP YHSNTLLTSK 
KRAKEFLVRL LYVEMLGHDG SFGYIKAVEM AASASLFHKR TGYLVCGACL PPSHEFRFML
VNQMQRDLQS TNVLECSGGL LACTNLITAD MVPAVANEVS KLLQHDSATI RKKAILCLHR
CHQLADDVVT SESLHESLRK LVCDKDPSVM GSSLNVIEAL SLTNTAPFKD LVPSLVSILK
QICEHRLPSE FDYHRVPAPW MQLKLVRILG LLGKADMPAS KGMYEILHET LRKADTGINA
GYAIVYECVI TIIAIYPNAN LLDAAAEAIA RFMQSRSHNL KYLGVTGLAM IVEQHPQYAA
QHQLAVMDCL EDDDETLQRK TLDLLYRMTN VVNVEFIAEK LVEFLRHTTD LFLKQTLTTR
VCSIAERYAP NNAWYIRTIT SLLEVSGDMV