Gene PHATRDRAFT_47293 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47293 
Symbol 
ID7202328 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp210541 
End bp212283 
Gene Length1743 bp 
Protein Length580 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181508 
Protein GI219122347 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGACT TGGTAACCTT TTCGCTTTTT ATCTTGCTCT TTTGCACTTG CTCCGACTCG 
CTCGCGGTCT ACGGACGGTC CGCTGCGACG ACTTCGCGCA CAGCTCCCCC AATCGTTTAC
ACGATTGCCG GATCAGATTC CGGCGGTGGC GCAGGAATTC AGGCGGACCT ACACGCCATT
CATTCCTTTG GGTGTCACGG ATGTTCCGCT ATCACCTGCT TGACTGCGCA AAACTCGGTG
GGCGTCATTG GTGTGCACGC CCCTCCACCG GATTTTCTCC GAGCCCAGCT GGAAACTTTG
TTGGAAGACT TGCCACCGCA GGCTATCAAA ATCGGAATGC TCGGAACGAA AGAGCTTGCA
ATCGAAGTGG GAGCCTTTCT GAAGAAGTTG AAGGCTTTAG ATCGGAAAGT TTGGGTTGTT
TTAGATCCGG TCATGATCAC GACGTCGGGA CACCGGTTAA TTGAAGAAGA CGCACAGGAG
GCCATGGTCA AGCACGTTTT CCCGCATATT GACGTTTTGA CGCCAAACAA GTTTGAGGCT
GAAGCTTTGT TGAATCGCAC CCTTGAGACA ATCAGCGATG TAGAAGAGGG CGCAAAGGAC
TTGATTGCAC TCGGGGCCCC ATCCGTTTTG ATCAAAGGAG GCCACACGCT GTACGAAGGG
GGCAAAGCAA GCAACCACAT AGCCTACGCC CAAGATTACT TTTTATCGTC TGTGCAAAGA
AATACCGGTG AGCCACGATT GTGCGATGGG GACCTTGGCG TCTGGTTGCG ATCGCCACGT
TACGAGACAG AGCACACGCA CGGGACCGGT TGTACCCTGT CGGCATCCTT GGCGGCTTCT
TTGGCGCTGG GGGAACAAGA ACGCCAAAAA CCGGACGGAA AGCGACGAGG AGCAACTAGT
GCCATAGATA CAGTGGATGC ATGCTGCTTA GCCAAGGCAT ACGTCACGGC CGGTATTTTT
CATGGAATTC AGCTGGGACA AGGTCCAGGT CCGGTTGCAC AGACTGGGTT CCCGTCTTCC
CACCAATACT TCCCCATGGT GGTTGCAGAC GCAGCGGAAG ATCATCAAAG ATTTCCACGA
ATGAAAGCCT ATGATGACAA GACTACCTAT GATGATAATC GACCAACGCT TGGTCGGATA
TTGCCTGTTG TCAACGATGA GGTTTGGGTC CAGCGCCTGT GTCAAGATCC CGGTGTCCAC
GACATTCAAC TTCGGGTTAA AGGTATTGAC GACAACAAGA AAATCTTGGA AATTATTAAG
AAGTGCCAGA AGCTATGCCA GGCATCGGGC AAGCGCCTTT GGGTTAACGA CTACTGGCAA
GAGGCAATAG AATCGGGATG CTTTGGTGTC CATGTTGGCC AGGAAGATCT TTACACATGC
ATACGAGCAG GTGGTCTGCA ACTTTTGCGA GAGAAAAGGA TGGCGTTTGG AATTTCCACA
CACTCTTACG GTGAGCTTGC CACAGCATTG GGGGTCAAAC CGTCCTACAT CAGCCTTGGT
CCCGTATTCG CAACAAGTAG CAAAACCGTT CAGTTTGACC CGCAGGGCTT GTCACTAGTA
CGGAAATGGA GAGAGCTTAT ACAAAAAGAG GTCCCGCTGG TTGTCATTGG CGGATTCTCA
GATGCTGAAC GAGCAAAGGC TGTGCGAGGA TTGGGGGCCA ATTGTGTGGC AGTCATTGGA
GCCGTAACTC AGGCAAATGA CACAGTTGAG GCTGTGTCAC AGATGAACGA AGCCATGCGA
TGA
 
Protein sequence
MKDLVTFSLF ILLFCTCSDS LAVYGRSAAT TSRTAPPIVY TIAGSDSGGG AGIQADLHAI 
HSFGCHGCSA ITCLTAQNSV GVIGVHAPPP DFLRAQLETL LEDLPPQAIK IGMLGTKELA
IEVGAFLKKL KALDRKVWVV LDPVMITTSG HRLIEEDAQE AMVKHVFPHI DVLTPNKFEA
EALLNRTLET ISDVEEGAKD LIALGAPSVL IKGGHTLYEG GKASNHIAYA QDYFLSSVQR
NTGEPRLCDG DLGVWLRSPR YETEHTHGTG CTLSASLAAS LALGEQERQK PDGKRRGATS
AIDTVDACCL AKAYVTAGIF HGIQLGQGPG PVAQTGFPSS HQYFPMVVAD AAEDHQRFPR
MKAYDDKTTY DDNRPTLGRI LPVVNDEVWV QRLCQDPGVH DIQLRVKGID DNKKILEIIK
KCQKLCQASG KRLWVNDYWQ EAIESGCFGV HVGQEDLYTC IRAGGLQLLR EKRMAFGIST
HSYGELATAL GVKPSYISLG PVFATSSKTV QFDPQGLSLV RKWRELIQKE VPLVVIGGFS
DAERAKAVRG LGANCVAVIG AVTQANDTVE AVSQMNEAMR