Gene PHATRDRAFT_45071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45071 
Symbol 
ID7200081 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp112983 
End bp115972 
Gene Length2990 bp 
Protein Length853 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179363 
Protein GI219117137 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.706986 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACCAT ATTATGAAGA AGAGGTCACG GAGGCTATAC TGCCAGGTTT GGAGTGGCGA 
GCGGAAAGCC GTGTCCGCCG TCCAATATTG GCCAGAGGAG GCATTATCGC TGACGAGGTA
AGCTCGACCG TTCTTCTTCT TCTTTCTCCA TTAGCGCACT CTTGAGCTTC TCTTACATGT
GCTTTTGTTC AGGTCGGCTA CGGAAAGACC GCTATCACTC TGGGACTTAT TGATTCTTCG
CAGAGGATAA ATGGCGATTG TCCGGAACCG CCGCCCGCTT ACCGTGACCG CTTGATTCAG
ACGAAAGCTA CTCTAATCAT CGTTCCGGAG CATCTCATGG GTAAGGACTT ATATCTGTTT
GATTAATTTG TATTTCTGGC AATTAGCTCA TTTTGCTAAA CCTTTCAACT AAACAGGACA
ATGGCCGGAA GAAGTACAAA AGTTCTTGGG TAGAAGCAAG AGAGTCATTG AGATCAAAAG
TATGGCTTCA ATGAACAAGA TTACAGTGGA AGACATTCAG AAAGCCGACA TTGTCGTTGC
GAGCTTTCAG ATACTGAGCA ACGAGACCTA CTTCTTGAAC CTTGCCGAAT TTAGTGGTGT
GAACGCGTCG GCGTTGCCGT CGAACAAGAA TGGCGGCCGG CATTTTGATT CCTTGTATAA
TGACTGCCTC GAGGGTCTCT TGGTGAGAGT GCCACATTTA ATTGGTGACA CGCGCAAGGT
TTTTTCAGAA ATACGTCAGG ATGCTGAATG TCACGAAAGT GCGAATGCGG AAGAAAACTT
TCGTGTGGAC GGAAAGAAGA GCGCCTATAA AAATGGCTCC AAATCGTCCA AGATGAAGGC
GCCCGAGACC AGCAAAATTG GCCGTGAACG TGATCCATGG GGGCTGTCCA CTTCAAAGGT
AAAGGCTTCC TACCGTAAGA TGTCCTGTCC ACCCTTGGAA TTATTCTTTT GGAACCGACT
TGTCGTTGAC GAGTAAGTCT TCAGCAGTGC GGTTTATTTT TCTAGCTACT GTCATTCTGA
CGTTTTCTTT CATTGCAGAT TCACGTACCT TGAAGACAAC AAGCGCCATC GGGCTTTGTC
GTTCATACTT GGCGTCAAGT CGTCGTATCG TTGGTTGCTT TCGGGAACAC CCAAGCATTC
TAACTTTGAC GACATTCAGA GCCTTTCTTC TCTTCTGGGC ATTCACCTGG GCATTGACGA
GAGCCTTCCC GGGGTCAAGA TGAATAGAAG TAGAGGTGTT GTCGCCGAAG AAACGACCGG
TCTCGAAAGT TTGTCTTTGT ACCTGGAAAT GCACTCCATG CAATGGCACG AACGTCGTCA
CGTGCAGGCG CAATCATTTC TGGACCAGTT CGTTCGACAA AACAAGGCCG AGCACGACGA
GATTCCGTGG GAGGAGCACT TAATTTTTGT AGATCTACCC CCTGTGGAGC GCGCTATTTA
TTTGGAACTC GAGACTCATT TGAAGAGTTT GGATATGAAC AAGAACGCAC AAAAGACAAA
GAGGAAGAGC ACTGGCGACC GCGACAATCG CATGCAACGG ATTCTTCAAG ATTCGGCGAG
TGCGGAAGAG GCCTTGCTAA AGTGCTGCAG CCACTTTAAC ATGTCGTCAG AGGCCGCCAC
CGCGCTGGAG ACGATTACCG ACATTATAAA GCTTCGAGAC ACGCAAAAGA AGGAATTGGA
ACGGGACATT GTCGTCTACC TTGCATCTGC TTTTCGCCAG CAGCATCGGA TTCTCCAGCA
TCAGTCGGAT TGGCTCCTCG TGTCTCGATC AGAAAAGGGT GAGGTCGCCA GTGCTCTGCA
ACAGTATCTG CGAGAGGTTG AGAAGCGTGA CAGTGTAACG CACGGGGCCG ACGACGAAGT
TCACGACTGT ATCTTGCAAC TTGTCCGACA AGCGGAAGAA GCCTTTCACG CAGATCCCTC
TCGGATCGAC TCTTTTTTCG ACGTGGACGA GGGTGACGAT CCACAGGAAG GATCCAGCCC
CAAAAGGCGT CGGGGTGCCT CCCCGCAGAA ATCGAAGAAA GAAGCGGCCG AAAAATTTAC
CGAGCGCCTG TTTGCAATGA AGATCCAACT CCGCGATCAT CTGCACCTTG TGCGATCTAT
GGGTAAGGAA CTGTGTGGCC GCGTACGCAG TCTCCGGTAC GTGCAATGGA TTCGCAAATT
CCAGGACGCC AGCATCAGGT TCACGTGCGG ACACTGTCGC GCCACGGGTC TCGAGAGTGA
CCAGGTCGGG GTGTTGAGTT CGTGCGGTCA CGTTGGTTGT TTGGGATGTT TACGGGTCGA
GTCGGCGGCG GAAAAGTGCG TCGAGTACCC GTCGTGCCGC GCGCGCGTGA GCAGCGCCCA
CGTTGTGTCG TCGCGTCATT TGGGGTTGCA CCGGGCCGAC TCGAGTGGCG GACGGTACGG
CCGCAAGTTG ACCGTCCTGG TGGAGAAAGT CCGGGAGATC ATTGCGATGG GCGACCGCAT
GATTGTCTTT TGTCAATTCG ACGACCTCAA GGAGAAGATC CGGCAAGTCC TGTTGGAGAA
CGGTGTGCCG TCGTTGGAAG TGGCGGGTTC GGTGCACCGC CAGATTGCGT CGTTACGGGT
CTTCCAGAAA GAAATCCCGG GCCCGACCGA TCCACGGGTG TTGGTATTGA AGATGGACGA
CGAGCAGAGC GCGGGCTTGA ACTTGACGCA TCTGAACCAC GCACTTTTCG TCCACCCGCT
ACTGGCGTTG TCGCGTGCGG AGTACGACGC GTACGAAACG CAAGCGATCG GACGCATCCG
GCGATTCGGT CAAACCAAGA CGGTGCACCT GCACCGGTTT CTCGCGCGGA ATACCATGGA
TATGGAAATT TGGGAGGAAC GGACCAAGCC TCGGGCGTAG TGTGAATCGT CGGTTGTACC
TGTATGCATG TGCGTGTGCG TGTGTGTATG TGGTGCGGAT AGAGGGAGGA ATGGATAAGG
AATTGGATCG GGAACGTAAA ATCCAATAAT GTGAGGGGTT TTGACTATGA
 
Protein sequence
MEPYYEEEVT EAILPGLEWR AESRVRRPIL ARGGIIADEV GYGKTAITLG LIDSSQRING 
DCPEPPPAYR DRLIQTKATL IIVPEHLMGQ WPEEVQKFLG RSKRVIEIKS MASMNKITVE
DIQKADIVVA SFQILSNETY FLNLAEFSGV NASALPSNKN GGRHFDSLYN DCLEGLLVRV
PHLIGDTRKV FSEIRQDAEC HESANAEENF RVDGKKSAYK NGSKSSKMKA PETSKIGRER
DPWGLSTSKV KASYRKMSCP PLELFFWNRL VVDEFTYLED NKRHRALSFI LGVKSSYRWL
LSGTPKHSNF DDIQSLSSLL GIHLGIDESL PGVKMNRSRG VVAEETTGLE SLSLYLEMHS
MQWHERRHVQ AQSFLDQFVR QNKAEHDEIP WEEHLIFVDL PPVERAIYLE LETHLKSLDM
NKNAQKTKRK STGDRDNRMQ RILQDSASAE EALLKCCSHF NMSSEAATAL ETITDIIKLR
DTQKKELERD IVVYLASAFR QQHRILQHQS DWLLVSRSEK GEVASALQQY LREVEKRDSV
THGADDEVHD CILQLVRQAE EAFHADPSRI DSFFDVDEGD DPQEGSSPKR RRGASPQKSK
KEAAEKFTER LFAMKIQLRD HLHLVRSMGK ELCGRVRSLR DQVGVLSSCG HVGCLGCLRV
ESAAEKCVEY PSCRARVSSA HVVSSRHLGL HRADSSGGRY GRKLTVLVEK VREIIAMGDR
MIVFCQFDDL KEKIRQVLLE NGVPSLEVAG SVHRQIASLR VFQKEIPGPT DPRVLVLKMD
DEQSAGLNLT HLNHALFVHP LLALSRAEYD AYETQAIGRI RRFGQTKTVH LHRFLARNTM
DMEIWEERTK PRA