Gene PHATRDRAFT_50485 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50485 
Symbol 
ID7199324 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011698 
Strand
Start bp191030 
End bp193289 
Gene Length2260 bp 
Protein Length530 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185395 
Protein GI219130486 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.517757 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AGCTGGCAGT GAGTGTTCAC TGCATGCACC TCCTGACTTC TCCGGGGGAA GCAGGAAACA 
TATTTCGTCA GCAGCTTATA TTTACAAAAC TGCTTCGCTC TACCTCTTTG TGGTTCATTG
AGTAGTGAGA GTTGCTACGG CAGAAAACTG TTCCGCACAG CACCGTTCCT TGTGCTAGTC
CGTTCCCTAG CGCGGTTCAA CTTTCCTGAC AACCGTATAT TTCTTGGAAA GAACGGTCAC
ATTTCTGAAC CCTTCGACTG TTGGAAGGCC GGAGTTTCCG GCTGCAAATG CATTGAGCAT
CTCGTCAATG GATTCTGACA ACGTATTAAC AGCTGATGCT ATTGGGGACG GAGAGGAAGA
TTTCCTGTTG CGGAACCGCC TGCTTTCGAG TAGCAACACC AGTATGGCAA GCCCACCATT
GCACTCGGGA GGTGGAACTG ACCATGAGAG TTGCGGGGCT ACTTGCAGCA CTCCAATCAT
GCCAACACCG ACTAGTGTAT CACCCTCAAC GCCGTCAATA GCAGCGACAC GCCGGAGCCA
AGGATTCGGT AGAATCCTTT CCTTTCCAGG TCCAATCCTT TACAATATTC AAAGCAGATG
TACTCGACTT ACTGCCGGAA TCCTTCAGAA CCCAATATCT GGAAAGGGTG GACCTAGCGG
GTGGTTTCTC CTGCTGACGA TAAGTGCCTG GTTCGGCTTG GGCGTTGTGG CGATCGTAAC
CACAAAACTC CTTTTAACGA GTTGGAAGGT TCCTCCTCTG TTGTTGACAT TCCAACAACT
GACTGCAGCA TCAACATTGC TACGAGTCGT GTTGGGTTTG CAACAGAACC TTCAGCCGCT
GCCATGGGAA AACTATTGTC GCGCCACCAT AGCACCTGAT GCTCCATCTG GTACGGGAGC
ATCGACTCTG GATGCGACGG GTACCGAGGA ACACTCGATT GTTGAGCTGG GAGCCGATCA
AAATCACTCT GCCGTGGAGG ACCGAATCCA GACGCATATT TCTAAATTTC ACAGCCCCAA
TAACTCACCG TGGAATGTGG AAAATACGGA ATTTTTTCTG ATTGGACTGT TCAACGCATT
GGACTTCTTG GCTTCGAACA CCGCATTCTC TTCATCAGCT GCTTCATTCG TGGAGACAAT
CAAAGCGAGC GACCCCATTA CTACTACGGC TGTTGCCTTG ATTTGGAAAA TTGATCAGGT
CAAACGGCCG GAAGCGATTT CCTTAATGGT ACTCATAATC GGTGTCCTGC TTTCTACAAT
AGGAAATGCC ACAAGCTCCA ACACTACTGG CGAGGATCCA CTTTCTAGTT CGGAGCTGTC
CGTAGACGAG ACTGACGATG ACAGTGCTGC CGAAGCGCAA GAAGCTTTGT ACCTGTCCAT
CCGCACCGCC ATTACTGTGG TCACAGCCAA TTTGTGTTTC GCCTTTCGGG CCATGAATCA
GAAATTGTAC CGCAGGCATA CAAGCACGGG AGACCAGCTG GACGATGCGA ATTTGTTGTG
TCGGTTACAA CAGACTGGAG CCTTGAGTTT GCTGTTCCCC ACAATGCTCC TGTACGCAGG
ATTTGTGTTT GACGCTCTTT GGCAAACGCC GAGAGAAATT GTGTTGCAGT ACGTTGGGTT
GGCGGCAGTC AATGCGGGTG CATTTGTTGC CTACAAGTAA GTTTGCATCT GTAATGGAGG
GGATTATGTA CGTCGTGACG TAAGCTCATG CGCGCTTTCT GGTTTTCATT TTCATTTTAG
CCTTGCAGCA TGTTATGTTT TGAGCAACCT TACGGTTTTA CACTATTCAG GACTTGGTTG
CATGCGCCGC ATGTTTGCTA TTTTGTCCAC TAGCATTTTT TTCGGCGTCC CCATTTCAAT
TTTGGGAGCT GCGGGCATCG TGTTGTGCCT CGCTGGTTTC CTATCCTTTA CGTACACACG
TTCCCAACGT ACAGCAAACA AAGCTATCTT GAAAAGTTTC GATCACAAGG ATTCCAATGT
ATAAAGCAAT GCGACTTGAG TTTTCTATAC CGGACAGTCC ATTCCAAGTT CCTGCCTTGA
ACCCCGCGTC ACAGTTTTGG CTTGCAGTGC CGAACCGAAA GCAAGCGTTA AACCAAAATT
GGAGTGCTCT GTTGGAAAAC AAAGTTGCAC GGGTCAAGTT ATGTCCGTTC TTGGCCAAAA
CGTAGGTATT CCGCTATGTG ATTTACTGTA AGGCAGTTGA GTACTGCAGC GGTCTGTATA
GACATGCAGG GTTCAACATA ACTTTTAAAC TTATGCTAGG
 
Protein sequence
MDSDNVLTAD AIGDGEEDFL LRNRLLSSSN TSMASPPLHS GGGTDHESCG ATCSTPIMPT 
PTSVSPSTPS IAATRRSQGF GRILSFPGPI LYNIQSRCTR LTAGILQNPI SGKGGPSGWF
LLLTISAWFG LGVVAIVTTK LLLTSWKVPP LLLTFQQLTA ASTLLRVVLG LQQNLQPLPW
ENYCRATIAP DAPSGTGAST LDATGTEEHS IVELGADQNH SAVEDRIQTH ISKFHSPNNS
PWNVENTEFF LIGLFNALDF LASNTAFSSS AASFVETIKA SDPITTTAVA LIWKIDQVKR
PEAISLMVLI IGVLLSTIGN ATSSNTTGED PLSSSELSVD ETDDDSAAEA QEALYLSIRT
AITVVTANLC FAFRAMNQKL YRRHTSTGDQ LDDANLLCRL QQTGALSLLF PTMLLYAGFV
FDALWQTPRE IVLQYVGLAA VNAGAFVAYN LAACYVLSNL TVLHYSGLGC MRRMFAILST
SIFFGVPISI LGAAGIVLCL AGFLSFTYTR SQRTANKAIL KSFDHKDSNV