Gene PHATRDRAFT_31657 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_31657 
Symbol 
ID7196299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp635791 
End bp636990 
Gene Length1200 bp 
Protein Length399 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177128 
Protein GI219110753 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACCAC CGAGAAGCAA CAGGTCAAGG AAAAAAGAGA CTCGACGTAA AGCATTTCTG 
CATACAGTTG TTGCTACAGC TACTGTCGGA ATCGTTCTAT CGACGTTGAT TCTGGTTTGT
ATAACCAACA GGGCTACCGA GGGAGCTTCC ACGCCTACTC GTTTAAGACA TGACATGCTT
AATATTCAGC TATCTCGGCA AAATGTCGTT TATCCAAACG CTTCAGCAAC TCATTTCGGG
TACAAGATCT CCCCTCGGAT GCAATTCGAC GTAAGAGTCA TTTCCAACCG GGTAGAAGCC
ATTTCCGCTC AAGGAGAAAC CCAACGAATA CCGACGTTAC AGTCCCCGGT CGGTGGTGCC
TTTGTGCATT TAGGCAAAAC TGGCGGGAGC GCTTTGTCAT CGCTGTTGCG AAATGGGTGT
CACTCGTGGT CACCTCACCC CTGTCGAAAC GTAACAGACG AAACAATGGC ATCTCGACTA
ATCACAAGCT ACTACCATGT TCCTGATTTC GGCTTACTAC GTCAGTCGAA TCACGATTTC
TATTTCATTA CACTTCGGGA TCCTTTTGAC CGTGTTATAT CCGCTTTTGT TTTCGAACAT
ATTATCAACA GAAGGGCCCG AGGTGAACCT GTGGAAAACC CAGTTCTGCG AGACAAGCTA
GAGCGAGCAT ACCAATGCTT CCCAAGTTTG GAAGCTTATG TTGGCTTTCT AAAGGGGGAC
TCACTCGATT TCTCGTATCC GTATCACCAA GCAGTCATCG TCGATAAATC TTGCAAAAAT
CTCGCTCGGG CCGCCTTGCA CGGCAAAGTT CGGCCGTTCA ACCACTTCTT TTTCAACTAC
CGTCGGATAT TTTCGTTTTT GCCAAAACCA GAGTCACAAA TCTTTCTAGT CACTCGGCAA
GAGAATTTGT GGGAAGATTG GGAGCGGGCC AATGTACTAC TGGGTCAGGA GGAACCAGTA
ATAATTCCGG CGGACTTGGA TTTCCGGGCT GTCCGGAATA CCACGACCTT GACGCTGCCT
GTTACGAGAG GTTTGAGTGC TGATGGAAGG CAGACCCTGT GCACTGCATT GGAGCATGAA
TATTACGTCT TTTTCTGGAT TCTCAGGCGC GCCAGAAATA TCCAGTCCGA GCATTTGCAG
CAATCTATCG AGAAGGCCAA GGTGAATTGC CCCAAACTGC CATATACTAA CTTCGTCTAA
 
Protein sequence
MTPPRSNRSR KKETRRKAFL HTVVATATVG IVLSTLILVC ITNRATEGAS TPTRLRHDML 
NIQLSRQNVV YPNASATHFG YKISPRMQFD VRVISNRVEA ISAQGETQRI PTLQSPVGGA
FVHLGKTGGS ALSSLLRNGC HSWSPHPCRN VTDETMASRL ITSYYHVPDF GLLRQSNHDF
YFITLRDPFD RVISAFVFEH IINRRARGEP VENPVLRDKL ERAYQCFPSL EAYVGFLKGD
SLDFSYPYHQ AVIVDKSCKN LARAALHGKV RPFNHFFFNY RRIFSFLPKP ESQIFLVTRQ
ENLWEDWERA NVLLGQEEPV IIPADLDFRA VRNTTTLTLP VTRGLSADGR QTLCTALEHE
YYVFFWILRR ARNIQSEHLQ QSIEKAKVNC PKLPYTNFV