Gene PHATRDRAFT_49053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49053 
Symbol 
ID7195302 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011688 
Strand
Start bp438331 
End bp440705 
Gene Length2375 bp 
Protein Length688 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183609 
Protein GI219126742 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.377912 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATGGGA GGGGCTCTCC GCAAGAGATT GGTGTTGGCC TTGTTGGTAA AATTCTTTTC 
ATCACGGCCA TTGTCGTGGT CGTGTGTGTA TCTCTTACCC AACCTTGGTC GAAAATGAGT
GCATCTGCTG CATCCCGCAA GGTGGTGCAA TCAGTCGCCC ATCGGGTGGA AATTGATTGG
TAGGTAAAAC CAAAGGTTCC TTTCCTTGGC CGCGCCGCGG CGGACACTTC CGCCGCAATC
CTCCAATTCC GTAGCGTACG CACTTACCGC ATACCCTCGC TCACACCGTT CCCCATCTCT
CTACATATCA TCGTGCTGGT GTCCACATTA GGTCCCGCCA TGTCTGGAGT TGCTCTCCTC
AAGTAGCCGC CTCCTTCAAC AAGCTCAAAT CGTGGGTGGC GCTGTCGGAT TCCATGGCGG
AAAAGTACGC CACCTTGCCG ACGCCAATTG ATTTCGGCAA GGCCAAGAGT GCCGTTCGCG
ATAAAAGTCT GGTGGACTCG CTCGAAGCCT TCTACAAGTC CAATCAAGCG CCGGCGGAAA
CGCACGAGTG GGCGCCGGAA GAACAGGAGT TGACGGCCAA GAAAATCGCC TACCTACAGG
AACTCGACGC CATGCACCAA GAACTCTTGC CCGTACTGGA AGCCGAGTTG GCTTTTCAAA
AGAACAACCG AACAACCAGC GATACTACCG TCTTTGATAT GAAGGTGAAT TACCCCGTTA
TTCACGAAGA AATCGAGGAC GAAATTGAAC GTCGTGAATG GTTCAAGGAT ACCGGAATCG
GTCCCAACAA GTAGGCAGTG TAGCGTTACC TATAGTCGAA AACTAATTCT AGTTGGACTC
GTCCGAGACT TTTTCTAGTT GTCCGATCAC GATTGGGCCT GTGAGTTGTC CGCAACGGCG
CTCTCGCCGT TTGGAAACGC TGGGTACGAC TGGCTTCGCC CTGGAGGTGA ATCCGGAGCT
TGTTCATTAT TTATTTTTTT CCAACCGGTC TCAGAGTCAC ACCATTTCCA GTGATGGAAA
ATGGTTGGGA GCAGACACAA ACGCAATTCC AAAACCCCCA ATGTTGGGTG GTGCAGAGTT
TGCGGAAACT AAAAGGGCTC ATTTTATGCC AGAGTGGAAC CAATCGTAGC GTTCAAAACC
ACCATAATTT CACCATGGAT CCTTTACTTC TCGCTCGGAA ATTACGCCGA CGCCGCGGAC
GCTCCATTCT TGCCTCGCAG CTTTCCAATG CAGCCGTTAC CTTGTGTACG GGAGCCGTGG
TGGCAATGAT TTTAGCTGCC TTGCACGTGT CCCGACTAAA TACTGATCGT GGCGTTCCCA
ATGCTCCCGA AAGCTTGCCT CGGAGTCGTC GTCCCGGTGC ACCCATGGTG TTTGGTCAGC
GACAGCTCCG GGAAGCAAGG GTTGAGCATG CGCGGTCGGG AAACTCAGAC GCGTCTACAC
CGCTGGATTT TGTCGTGGCG GGCTTTCCCA AAACCGGTAC CACGACACTA TTATACGCCT
TTCGGGATCA CGAAGAAATG GACATTGCCA ATTCGGAACG CTGTAGTGTC GCGCACGTTC
CACTATCCGA GAGTCGGGCG CAGCAAGACT TGAACGCGGC CGTTGCCGAA CTCTCTCCGT
TGAGGAGCGT CAAACGCGGT ATCAAATGCC CCACGATGCT GAGCAACTCT GTATCGCTAT
CGCGGCTGCA GAGACATTCA CCGTCGGCCA AACTCATTGT TGGATTGCGT CACCCGACGG
AACTGTTGCA AAGCTTTTAT AACTACCGTG TAACGGAAGT TTACGACAAG CGGTCCCACC
AGTCAATACC CAGCTTTGAT GAGATCGTTC GGACCGGCAA GGCTTGGAAA GGCGTCTCCT
TGGAAGCGGT CCGTTTTGAC TTGTTTTTGA TGCAGCTCGG AAAGACCAAT ATGTCAACCA
TTGACCTTCA GCAATACGCC GGTCGCAAAT ACATGGGGGT GAAGCCTAGC GAAATGCAAG
TCTTTCTATA CACCCTTGAT CAAATTGAGG ACAGGGATCC GACGAACAGT TTGGTATTTC
GGAAAGGCCT TGAGAGCTAC CTTGGTCTGG AGACTCCGTT CCAGCCCTTT GGCCGTGAGA
ACACCAACCA CTTTGTCGGA AACAAAGCGT ATCCCGAAAC GCTCGATATT TGCCAACCCA
AGTACAACAA ACTGAGAAAA AAGTTAGCGG ACCAGGGACG TAAAACCGCT CGTTGGATTC
ATGAGAGGCT TTTGGAGAGT CCTGACGTCT TTGTCGGAAA CAAAGATGGC TTTCTTCAGT
CGCTCGAATC GTGGGGGACT GATATTTGTC ATCTCTCATC CATAGCGGAC GTAAAAGTTT
TGCCTCAGAG ACGGACTCTC AAGAAACCTA TTTAG
 
Protein sequence
MYGRGSPQEI GVGLVGKILF ITAIVVVVCV SLTQPWSKMS ASAASRKVVQ SVAHRVEIDW 
SRHVWSCSPQ VAASFNKLKS WVALSDSMAE KYATLPTPID FGKAKSAVRD KSLVDSLEAF
YKSNQAPAET HEWAPEEQEL TAKKIAYLQE LDAMHQELLP VLEAELAFQK NNRTTSDTTV
FDMKVNYPVI HEEIEDEIER REWFKDTGIG PNNCPITIGP VSCPQRRSRR LETLVMENGW
EQTQTQFQNP QCWVVQSLRK LKGLILCQSG TNRSVQNHHN FTMDPLLLAR KLRRRRGRSI
LASQLSNAAV TLCTGAVVAM ILAALHVSRL NTDRGVPNAP ESLPRSRRPG APMVFGQRQL
REARVEHARS GNSDASTPLD FVVAGFPKTG TTTLLYAFRD HEEMDIANSE RCSVAHVPLS
ESRAQQDLNA AVAELSPLRS VKRGIKCPTM LSNSVSLSRL QRHSPSAKLI VGLRHPTELL
QSFYNYRVTE VYDKRSHQSI PSFDEIVRTG KAWKGVSLEA VRFDLFLMQL GKTNMSTIDL
QQYAGRKYMG VKPSEMQVFL YTLDQIEDRD PTNSLVFRKG LESYLGLETP FQPFGRENTN
HFVGNKAYPE TLDICQPKYN KLRKKLADQG RKTARWIHER LLESPDVFVG NKDGFLQSLE
SWGTDICHLS SIADVKVLPQ RRTLKKPI