Gene PHATRDRAFT_43490 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43490 
Symbol 
ID7197542 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp599190 
End bp601260 
Gene Length2071 bp 
Protein Length630 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177967 
Protein GI219112431 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.416955 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTTCCGGCGA TATTTGGTTT ACTTTTTTAT TAGCGTGACT TGCCTTCTAA GCTTGTCGTA 
GCTCATCTGC TATTCCTTCC ACCAATCGGT TTTCATTACA GTCGCTCTAG ATAGGAACAG
AGCGCCCCGG GATGTTCGAA TCTCGTACGG GCAGGACGAG GTTCCTCACT GTTCTACTAG
TGAGCGCTGG AATTGGCTCG GAAGGGTTCG AAGCATCCGG CCGTCCTCGA CTACGGAATG
GGAAATTGAA GACGCCCCGA CGTTTCTTCG TGCGACACGC TACTATCGCC GAGAATGAGG
CACCGGAGAG CTCTCATCTA GGTGCGACAC CCCCGGATAT GGTTGCCTTT GCTTCCGGAT
ACACAACCGT ATTTGAGGAG CTAGTATGTA AATCCTGCCA AGCATCGCAG GGAGAGGTAC
CAGACGATCT ATTCGGAACC TATTTCCGTT CTGGACCCGC CATGTTCAGT GCTGGCTCAA
TTTTACCACC CAAAAAGTCA TTGATCCAAC CAAAGCAACC GCCAGTTTTA GACGGGGAAG
ACAAGACGCG GATGGTACCA CACCCCTTCG AAGGCGACGG CGCTGTTCTG GGAGTAACAT
TCTCTAGAGA AGGAGATGTC ACCGCCCGTT TCCGATTCAT TCGAGGCACT CCTTTTACGT
ACGAACGCAA GAAAGGCAAA CGTGTTTATA CTGGTATGGA TTCTACTCGG ATGGAGGGCC
CGTCAGCCGG TGGCGGCTTA GCCAATGACC TCCCTCTTCC ACTTTATCGT CACAACTTGA
TGCCAGGGCT GAACAAACTT CGGAAAAATA CCGCAAATTC TCGCGCAATC TATTGGGGGA
AGCGTCTCTT TTCGCTATGG GAAGGAGGAC AACCGCACAA ACTGGATGCG CTTGCACTGT
CAACAGATGG GAGATCGATG TTGGGAGGTG CTATAAAGAA GGAAGCAGAT CCGTTCGGAG
GAAAGATGAT TTACGATCCA TCCAAAAACC GCGCTTTGTT TTATGCGGTA TCTCACGAAT
CAAAGGACTC CAGCATTGTT TGTTACGAAT TCGACGACAA GTTCCGCCTG ATAGAAAACG
GACGGATCGA AACGACAGTA CCAGGGTTTG CTTTGATAAC AGACTTCGCC GCGACCGAAA
ACTACGCCGT GTTTGTACAG CCTCCCATCG CTACGAATGG GATGAAGTTC CTTATGGACA
AAGGTCCCGG TAGAGCGTTG AAAGTGGAAG ACCGACCGTC TATCGTTCAC CTCATTCCAC
GTCCCGAGTC TTCAAAGCAA CAGATGTCTT TACCTCTCCC GATCGATTCT CTTTCGGATT
CAAACTTACA CTTTATAAAC GCATATGAGG ACGGTGGTTT GATCATTTTT GATGCATTCG
CTCGGACGGA TCCAAAATAG GCGACAAAGT GCTATCTTGG CCTTGGGGAT CGTCCTTGGA
AGAATACCAG GCGTGCGCCT CCAAAAAATC CCTTTGGCGG TACACGATAG ACACGCAGAG
GGGATCCGTT TCTAAAAAGC TCATGTTCAA CGACCACTGC TTTTTTGGTG GGATTAACCC
TGCTGTTAGT ATGAAAGAGC ACCGGTATAT CTACATGAAT GTCGGGGCTT TGGGAGCGGA
TGTGGCTCCA CCTCAAGGGA TTGCACGATT TGACTGTGAA ACAGCGGAAA GTCAAGTCTG
GATGCCCGAA AATTTTGAGT TCTGCGGAGA ACCAATGTAT GCTAGACGAG CGACAGAGGA
TGGATCAAAC GACCCTGGGT ACATTTTGTC GGTTCTCTAC AATGGAAAGA AAAACGAGAG
TGAATTGCTA ATCTTGCAAG CCAATAAGAT TCCGTCGGGG CCAATTGCTC GCCTTCCCTT
AGATATTGCC ATTCCACACG GACTTTTCGG ATGCTTCAGT ACAGCTGAGG AAGCTACGTC
CTGGTCGACG GAAGAAATCG AAAGGCGGGC CAAACTTGCT GACAAGATGG AATCCAAGGG
AAACATGTGG AACGAGGTCC GCAGCGAATT TTCAGGTCTA GGTTTGCGGT TTTCGGACAT
GGACGAGTAT GGGTTTGATT TTTTGTTTTA G
 
Protein sequence
MFESRTGRTR FLTVLLVSAG IGSEGFEASG RPRLRNGKLK TPRRFFVRHA TIAENEAPES 
SHLGATPPDM VAFASGYTTV FEELVCKSCQ ASQGEVPDDL FGTYFRSGPA MFSAGSILPP
KKSLIQPKQP PVLDGEDKTR MVPHPFEGDG AVLGVTFSRE GDVTARFRFI RGTPFTYERK
KGKRVYTGMD STRMEGPSAG GGLANDLPLP LYRHNLMPGL NKLRKNTANS RAIYWGKRLF
SLWEGGQPHK LDALALSTDG RSMLGGAIKK EADPFGGKMI YDPSKNRALF YAVSHESKDS
SIVCYEFDDK FRLIENGRIE TTVPGFALIT DFAATENYAV FVQPPIATNG MKFLMDKGPG
RALKVEDRPS IVHLIPRPES SKQQMSLPLP IDSLSDSNLH FINAYEDGDK VLSWPWGSSL
EEYQACASKK SLWRYTIDTQ RGSVSKKLMF NDHCFFGGIN PAVSMKEHRY IYMNVGALGA
DVAPPQGIAR FDCETAESQV WMPENFEFCG EPMYARRATE DGSNDPGYIL SVLYNGKKNE
SELLILQANK IPSGPIARLP LDIAIPHGLF GCFSTAEEAT SWSTEEIERR AKLADKMESK
GNMWNEVRSE FSGLGLRFSD MDEYGFDFLF