Gene PHATRDRAFT_47172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47172 
Symbol 
ID7201953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp708876 
End bp710771 
Gene Length1896 bp 
Protein Length629 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181252 
Protein GI219121810 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGCACCATGG CGACGACCAA CGATACCGAT GACGCCTGGG GCGAAGTGTT TGCCTTGGCG 
GAAGGAAAGG ATACGAAAGT CTCCTACGGT AAAATGGAGG CGGAAGCGGA CCCGGAGCGG
TCGGCGTTGG CGTCGCGGGC CCGACACAAA CGTAAGCGAG ACGTCGTGTC GAAGTCGGCG
TCGTCGCCAC TGAACGAGGA GGAAGCGTAC ACGGCATTCT TGGAATCGCG CACTGTCATA
GGATCATCGA TTCCGCAATG GATTCGCTTG GGCGAGACGT TCACTTCGCA CGCGGTGTGC
AAGGGATGGG CCGTGTCCCG AAAAGAATAT AAGCAGGGCA CCTGCCGTCG TTGCCGACGT
TCGGCCGTGC ATCACCTCGC CGAAATAGAT CCAAAATTTC CATGGTGGTG TATTCTTTTC
GTTGGAGTCC GAAATCTGCG TTGTATAGCT GTCGGTAGTA TTCTACGGCA AGCGAAAGAG
AAAGAAGGGT CTCTGGAGGG TCCAGATACC ATTCTTCGAT ACGTCGGAAG GCGAATCTGG
AAAGAATTGG ATACGGGCAT GATACACGGC AGTATCATTC GCTTTCCCAC ACTACAAGCA
AAGTGGAAGA TTTTCCGATC TTCTCTTCAT TCTGTGCTAC ATTCCAAGGC CGACAAAGAG
TTCTTGGATA AGGAAACGGC CTTTCGATGG ATTATCCACT CGGACGCCAT TTACTACCAG
CTGTACTACT TGCAACTTAC ACGACAAATA CCTCTACTTC CACATGATGC TGATACCGTT
AGAAGGATTC CTCATCCCAG CGAATACTTT GGGCAATCGC AATTTGCTAC GGATCATCAA
CAGGCCAAAA TAGCTTTGCG TCTCTTTCTG CAACGGACCG AGACCAATAC TCCTCGAGCA
TCCGATTGGA CGGATCGCTT TGGTTGGACC CACCGCAGCA GTGCCAAGCA TGGGCACATT
TTGGAGACTT TGCACGAATA TCGTATGCTG GAAACGGTGC TGCTGTTCGA CTTCTCGGGT
CTAGTCTCCA CAACCGAAAC CATCACCGCC TGGTTCGCCC AAGTCTCGCC GTCGAACCAG
CTAGACCAGC ACGACACTCC GGCACCACCA CTTTGGATGG CCTGGCGAGA CTCGTGCCGC
GACTTTCTTT GCCACTTGTA CGCGTATGCT ACAATTTCGC AATCAGTCAT TTCTCAGCTT
CCATCTCTGT TATCCAAGTA CGGTATGCAT CAAGGAATCA TTGAGGCGGG AGCAGGGACC
GGATATATTG CAAATCTTTT CATCCGAGCG GGCATTTCGA CCGAAGCGTT CGACGTGCAC
CCAACCAACA GTGGTTCCAA TTCATCATCC GTGCACAATG GGTATCACGG TGCAACCCCT
TCATATGTCT CGGTACGCCA AGGCAAGTCC AGTGCGCTCC ACAAATATTT CTCGCACACA
TTTAGCAAGG CTTTGCTACT GTGCTATCCA CCCCCGGACT CAACCATGGC CTACGACGCG
TTGCGCTCCT TTGTGCAACA CGGAGGATCG CTTTTCGTGC ACGTTGGCGA ATTCCGGGGC
CTTACCGGCA ATTCAACTTT TGAGCAATTA TTAATGGATG ACTTTGCTTT GCTGCAGCGT
TTCCCTTGTC TACCGTGGGG TACTGACGCT GCGGAGCTCA CTATCTGGCG TCGTCGAAAG
GCCAGCGACA ACACGTCCAA AAGTCGCCTA CTTCCTTGTT CGTCTTGTGG AACTAGGGAA
TCGGTACAGC GCCTGCGACT AGTCCGCTAC CTTACCTACT GCAGCGCTGA ATGCGCACAG
CAGCATCAAC CTTCAATAAG CGAGCATCTT CGATGGGCCT TCCTCCCACC GATGCGGATT
GATTGGAAAG ACGACCGTTT TTTTGCGATT TTGTAA
 
Protein sequence
MATTNDTDDA WGEVFALAEG KDTKVSYGKM EAEADPERSA LASRARHKRK RDVVSKSASS 
PLNEEEAYTA FLESRTVIGS SIPQWIRLGE TFTSHAVCKG WAVSRKEYKQ GTCRRCRRSA
VHHLAEIDPK FPWWCILFVG VRNLRCIAVG SILRQAKEKE GSLEGPDTIL RYVGRRIWKE
LDTGMIHGSI IRFPTLQAKW KIFRSSLHSV LHSKADKEFL DKETAFRWII HSDAIYYQLY
YLQLTRQIPL LPHDADTVRR IPHPSEYFGQ SQFATDHQQA KIALRLFLQR TETNTPRASD
WTDRFGWTHR SSAKHGHILE TLHEYRMLET VLLFDFSGLV STTETITAWF AQVSPSNQLD
QHDTPAPPLW MAWRDSCRDF LCHLYAYATI SQSVISQLPS LLSKYGMHQG IIEAGAGTGY
IANLFIRAGI STEAFDVHPT NSGSNSSSVH NGYHGATPSY VSVRQGKSSA LHKYFSHTFS
KALLLCYPPP DSTMAYDALR SFVQHGGSLF VHVGEFRGLT GNSTFEQLLM DDFALLQRFP
CLPWGTDAAE LTIWRRRKAS DNTSKSRLLP CSSCGTRESV QRLRLVRYLT YCSAECAQQH
QPSISEHLRW AFLPPMRIDW KDDRFFAIL