Gene PHATRDRAFT_47726 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47726 
Symbol 
ID7202905 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp652068 
End bp654205 
Gene Length2138 bp 
Protein Length571 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181947 
Protein GI219123262 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.347863 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TAAAATCCAT CTTTCCTTTG AGCGACTGTG ACGACCTGTG GAGGGAGAGT GATTTTTGTC 
ACTGTGAAGT TGTCGGTACT CCGGCACTCG GCAGGTAAAA TACTCGCGGT GGGTGCTTTG
GTGCGTTCGG TGTAGGACGC ATCCTTCACT GTCACTGGGG GTATCGGCAA GAACCATACA
CCGTAGAGAG AGCGAGACGA AAAGGACACA CCGTAGCGTA GAACAGTCGG ACGAAACGTA
ACTCAGAAGA GTCGGTATTG GTACACACTC ACCATTGTTG GGAATTATGA GTGCGACGAC
TCCGCCAGTC GCTTCTTCGA AAGTGGACCT GTTCGAGTAC ACAGAAGAGC AACTATTGGA
AGAAAAGGTA CTCACACACT ACGAGATTCT CGGTATATCG ACCTTTTGTT CGCAAGATGG
TACGTTTTAC CGAATCACAC TTTGGATTGG CGTCACGCTC GGCGTACCTC TGAAGGATTT
ACTGGTCGTT CGCCTCGTCT CACCTCTTCC GTACCAACCC CGTGCTAACC TTCTTGATTC
CAATCGCCGC GACAGATGTC AAAAAGGCCT TTCGGCGCTC TTCACTCAAG TACCATCCCG
ACAAGCACGG CCACGATAAG GACTACGCAT TCCTGGCGCT CAAGCAGGCC CACGATACGC
TCTACGATCA CGAAAAACGC CAAGCCTACG ACTCCACAAC CTTACCCTTT GACGACGCCA
TCCCCCCACC GCGGGATAAG CTCCTGCAGG ACGATCTACT CCTCTACAAG GACAACGACT
TTTACGAGCT CTATCGACCC GTCTTTGAAC GCAATCTGCG GTTCGACGCA AACTTGCGAC
CGGACGCCGT CGGCAACGCC AAGAACGGGA ATCACAACGG CAAGAAGAAA AAAGCCGGCA
AGGCCAAGGC GCCCCCGACT TTGGGCGACG CCGACACCCC CATTGCCCAA GTACACGCCT
TTTACGAATA CTGGATTCAT TTCGAATCGT GGCGCGATTT TTCGGCCCAA GCCACGGACG
AACTTCAGGT GGAGAACGAA CTGGAAAATG CCGAATCGCG CTTTGAAAAA CGCTGGATCC
AGAAAGAAAT CGATAAGCGC GCCAAGCAGC TGAAAAAGAC AGAAATGAGT CGGATTCAAT
TGCTGGTGGA ACGGGCGATG GAAGCCGATC CGCGATTGCG GAAGTTTCGA CAGGAACAGT
TGGCCGCCAA GGAACAGGCC AAGCGCGAGC GACAGGAAAA GGCGGAACAA CAAAAGATAC
AAGCGCAATT AGAGCATGAA CGGCAACAAC AACAAGAAGT AGTGGACCGG CAGCGGCGAG
CGGAAGAAAA GGTGACGCGC GAACAACAAA AGAAACATAT ACGGAAAGCG CGACAGAGTC
TGCGAAAAAT GGCGTCGGCC TCTTTTGAAT CTCTCGAATC GGAACAGAAA TCGTCCATTG
TATGGGCGGA TACTTACGAC ATGAATCTGG ATGTCGAAGT GCTGTGTACA AATTTGGATT
TGACGGGGTT GCAATCGTTG GCTCAAGAAT TGGAAAATAT CACTTGTCCG AAAGAATCGT
TGACGATGAT TCACCAAGAA GTACTAGTGG CTAAGCAGCG GGAGACAGAC GGGGACTTTA
GCAATGGCGA ACAATCGTCT CCATCTCACA ACGGAACTTT TTCGACGAAA GAAACGACAA
CGTCGCCTGT TGTAACACCG GCGTTGAAGC CGAACCTCTG GACCAAGGAG GAATTGTCAG
CCCTGGCTAA GGCAGTCAAG AAGTATCCAC CCGGTGGTTC CTCACGATGG GAACAGATTG
CGTTGTTTGT GAATAATTTG TGCAAACAGG ACGAACCCCG ATCCAAGGAG GAATGTATCG
AGAAATACAA CAACGTGGCG AAGACGCACA GCAAACCAAC CGAAAGCACG AACGGCGTCG
CGGCAGCATC AGAACCCGAA AACTCTTCGC AATCCAACGA AGACGTGTGG ACGGCCGAAC
AGGATCAGCA GCTGCAAGAT GGACTAGCTG CGAATCCAGC GAGCATGGAC AAAAATGAAC
GGTGGACCGC AATTACAGAG TGTGTCCCGG GAAAATCCAA GAAGCAGTGC GTACAACGGT
TCAAAGTGAT TCGGGATGCC TTGAAAAAGA AGAAATAG
 
Protein sequence
MSATTPPVAS SKVDLFEYTE EQLLEEKVLT HYEILGISTF CSQDDVKKAF RRSSLKYHPD 
KHGHDKDYAF LALKQAHDTL YDHEKRQAYD STTLPFDDAI PPPRDKLLQD DLLLYKDNDF
YELYRPVFER NLRFDANLRP DAVGNAKNGN HNGKKKKAGK AKAPPTLGDA DTPIAQVHAF
YEYWIHFESW RDFSAQATDE LQVENELENA ESRFEKRWIQ KEIDKRAKQL KKTEMSRIQL
LVERAMEADP RLRKFRQEQL AAKEQAKRER QEKAEQQKIQ AQLEHERQQQ QEVVDRQRRA
EEKVTREQQK KHIRKARQSL RKMASASFES LESEQKSSIV WADTYDMNLD VEVLCTNLDL
TGLQSLAQEL ENITCPKESL TMIHQEVLVA KQRETDGDFS NGEQSSPSHN GTFSTKETTT
SPVVTPALKP NLWTKEELSA LAKAVKKYPP GGSSRWEQIA LFVNNLCKQD EPRSKEECIE
KYNNVAKTHS KPTESTNGVA AASEPENSSQ SNEDVWTAEQ DQQLQDGLAA NPASMDKNER
WTAITECVPG KSKKQCVQRF KVIRDALKKK K