Gene PHATRDRAFT_33840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_33840 
Symbol 
ID7197874 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp471180 
End bp472382 
Gene Length1203 bp 
Protein Length400 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178231 
Protein GI219114871 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGACAT CGACTATAAG CGCAACGTCG ACCTTAGTCA AGGGTGCAAA TACTCAAGTC 
TCTAACCTAC GAAAGATTGC TCCGCAAGGG ACCGCTCGTA ATTCCTCCGC CATGAAGGCC
CCGGCTGCGA TGAGAAAGCG TACGAAACGA CAAAGAAGAG GTGCGAGAAA GCCCACAGAT
ATGCCTCGAC GGCCACTCAG CGCCTACAAT CTCTTCTTCA AAGAGCATCG ATCAGTCATT
CTTGCTGAGC TGGAGAGCAG GGAAGATAAA GATAATTCTG GGCAAGGGAA GAAGGCCTCG
ACAGCTAGTC TTTTCTCGAC TATGGGAAAG GCGATTGCGA AGCGATGGAA AGAGCTTCCG
GAGGAAAATT TGACTCGATT GAAGAATTTG GCCAACGAAG ATATGAACCG ATACCGCAAG
GAAATGAACG AATATCACCG GAAACTCGCG CAAAAAGCCC GTCTCGAAAC AAAGCCTTTG
GACGATAAGA CTGAAATAGG AAAAGAAAGT GACAAGCTGC CCAATCCCGA GCAAGTGCAA
GCTAAGAATG CTAACTCTAC TATGGAAGGA GCAGTCCGTC CTGTTCCCGC ACACACCATG
CAGTATCTCT CTTCGTTTGG TGACACGATC CCTTGGCTGA ATACACGGCA GCTGCTATTG
ATGCAACGGA ATCAGTTTTT CCGGAACCTT CCGCAAGTCA GTATCGGACA AGCGGGGATA
GCCATGGGCG ACAGAGACCC TTTCGAGCAA CTACTGTGTG AACAAATAAT TCGAGCTCAA
CTTCACAGTC AACAGCAGAC TCAGACAACT AGACTAGCCC ATTTTCGCTT TTTAGATCAT
GAGGAAGCGC TACTTCAAGG CAGTGTCTCC GGTTCGGGCC GATTTCATGA AGCGAATGAC
TACGCAGTCG GAAACAATGC AATGTTAGGA GTCGGCTATC CGGGTGCAAT CACCTCTCGT
ATTTATGGAG CGGATGGTTT CCTACATCAA GGCTATGTGA CTCTTGGTCA GCATAAGACT
GGCAGACAGT TCTTAAGTTA TACTGGACGA AGACAAGGAC AATACCAGCA ACTTTTAGCT
CAGCAGCAAG TCGAGCGAAA CCTCGAGCAA TATTTGGCTA CCGGATCAAG CCCAACGTCC
TTTGGATTGG TTGATTCGAA TAGATCAAGA GGAAATCATC CCTACAGGTC ATCATTACCT
TAA
 
Protein sequence
METSTISATS TLVKGANTQV SNLRKIAPQG TARNSSAMKA PAAMRKRTKR QRRGARKPTD 
MPRRPLSAYN LFFKEHRSVI LAELESREDK DNSGQGKKAS TASLFSTMGK AIAKRWKELP
EENLTRLKNL ANEDMNRYRK EMNEYHRKLA QKARLETKPL DDKTEIGKES DKLPNPEQVQ
AKNANSTMEG AVRPVPAHTM QYLSSFGDTI PWLNTRQLLL MQRNQFFRNL PQVSIGQAGI
AMGDRDPFEQ LLCEQIIRAQ LHSQQQTQTT RLAHFRFLDH EEALLQGSVS GSGRFHEAND
YAVGNNAMLG VGYPGAITSR IYGADGFLHQ GYVTLGQHKT GRQFLSYTGR RQGQYQQLLA
QQQVERNLEQ YLATGSSPTS FGLVDSNRSR GNHPYRSSLP