Gene PHATRDRAFT_37131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37131 
Symbol 
ID7202117 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp231085 
End bp232951 
Gene Length1867 bp 
Protein Length577 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181332 
Protein GI219121977 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTCCCG GCATTATCCT TCTGCGGGAA GGAACCGATA CATCCCAGGT AAGAAGTTTG 
TCGGTTTCGC TGTCGGATTC ACAATACGTC ATATCACAGC ACACCATCCC GTACTCGTGG
ACTTTTCAAC CTACTCACAT TTCCTCTGCA TTTACATTGC TTGAATATAT ATATCTAACA
GGGAACACCA CAGCTCATTT CGAACATTAA TGCCTGCCAG GCAGTGGCGG ATACGGTGCG
GACGACGTTG GGCCCTTCGG GGCGGGACAA GCTCATCGCG ACGGGTCGTC ACGTGACCAT
CAGCAACGAC GGCGCCACGA TTATGAAACT CCTCGAGATT GAACATCCCG CCGCCAAGAC
ACTCGTGGAC ATTTCCATGA GTCAGGACGC CGAAGTCGGC GACGGTACCA CCAGCGTCGT
CCTCCTCGCC GCCGAAATAC TCGCCAAAAT GAAACCCTTC GTCGAAGAAG GCGTCCATCC
GCAAATCCTC CAACGCAATA TACGCAACGC GGGGAAAATG GCCGTGGAGA AAGTACAGGA
ACTCGCCGTA CCGTTTCAGG GAGACGAGCT GGAGGATATG CTGCTCAAAA CGGCACGTAC
CGCTCTCAAT TCCAAGCTAA TTGCCAACCA TAAAGATCTC TTTGCACCAA TGGTAGTGGA
AGCGGTACAA GCATTGCATC AAGGAGGCTC TCTGGACGAT CTATCAAGCT TGGTCGCTAT
CAAACAAATA TCCGGTGGGG ATGTCCGGCA ATCCTTTCTC GTCAACGGCG TGGCTTTCAA
AAAAACCTTT TCCTACGCTG GTTTTGAACA AATGACCAAA CAATTCACTA ATCCAGGGAT
TCTATTGCTC AACGTCGAAT TGGAACTCAA GTCCGAAAAA GAAAACGCCG AGGTACGCAT
CACAGACCCA TCCCAGTATC AGAGCATCGT CGACGCCGAA TGGAAGGTAA TCTACGACAA
GTTGGACGCC TGTGTAGATT CCGGTGCCCA GATCGTCCTC AGTAAATTGC CCATTGGTGA
TTTAGCGACG CAATACTTTG CCGATCGGGG ACTCTTTTGT GCGGGCCGTG TGACAGATGG
AGACCTGAAG CGTGTGGCCA AGGCCACGGG TGGTAGCGTC CAAACCAGCA CTCACGGTAT
CACCAAGGAC ATGTTGGGGA CGTGTGGGGT CTTTGAGGAG CGCCAGGTTG GTGACGAGCG
CTTCAACGTC TTTACAGACT GCCCCCAAAA GCTAACGTCC ACCATTGTTT TGCGCGGAGG
AACCGAACAA TTCATTGCCG AGTCCGAACG GAGTGTGCAC GACGCCTTGA TGGTCGTCAA
GCGATCGCTC CAGTCGGGAT CGGTCGTAGC CGGTGGCGGT GCCGTCGAAA TGGAAGTCTC
GCGTTGTCTG CGCGAGCACG CTCTGACCAT TGAAGGAAAA GGACAGCTTA TCATTACAGC
CTACGCCAAG GCGTTGGAAG TCATTCCTCG TCAATTGTGC GAGAATGCGG GGTACGACTC
AACCGATATT CTGGCTGCAT TGCGAAGAAA ACATGCCGTC GACGCGGACG GAAAGTGGTA
CGGAGTCGAT GTCATTAACG GTCATATTTG CGATACTTTT GATTTGGGCG TATGGGAACC
GAGCGACAAC AAGGTAAATT CGTTCGACGC TGCCACGGAA GCAGCGTGTG TGATTCTGTC
CATTGACGAA ACTGTCATGG CGCCCAAGTC ACAGGACCCC AACGCTCACC ATACGGGTCA
AATGGACCAG GGTAATAAAC CAATGAGTAA TATGATGGGA GGCGCCATGC AGGCCGCCCA
AGGAGGCGCT CGGTCGGGTC AACTTGGGCC CGGAGTCAGC TACATGAAAG GCCGGGGAGG
CGGTTGA
 
Protein sequence
MRPGIILLRE GTDTSQGTPQ LISNINACQA VADTVRTTLG PSGRDKLIAT GRHVTISNDG 
ATIMKLLEIE HPAAKTLVDI SMSQDAEVGD GTTSVVLLAA EILAKMKPFV EEGVHPQILQ
RNIRNAGKMA VEKVQELAVP FQGDELEDML LKTARTALNS KLIANHKDLF APMVVEAVQA
LHQGGSLDDL SSLVAIKQIS GGDVRQSFLV NGVAFKKTFS YAGFEQMTKQ FTNPGILLLN
VELELKSEKE NAEVRITDPS QYQSIVDAEW KVIYDKLDAC VDSGAQIVLS KLPIGDLATQ
YFADRGLFCA GRVTDGDLKR VAKATGGSVQ TSTHGITKDM LGTCGVFEER QVGDERFNVF
TDCPQKLTST IVLRGGTEQF IAESERSVHD ALMVVKRSLQ SGSVVAGGGA VEMEVSRCLR
EHALTIEGKG QLIITAYAKA LEVIPRQLCE NAGYDSTDIL AALRRKHAVD ADGKWYGVDV
INGHICDTFD LGVWEPSDNK VNSFDAATEA ACVILSIDET VMAPKSQDPN AHHTGQMDQG
NKPMSNMMGG AMQAAQGGAR SGQLGPGVSY MKGRGGG