Gene PHATRDRAFT_49978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49978 
Symbol 
ID7198561 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011693 
Strand
Start bp486539 
End bp488639 
Gene Length2101 bp 
Protein Length526 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184715 
Protein GI219129058 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAACATTTGA CTGGATTTAC ATCTCAAATT ATAATTCGGA TTTCTTCATC TTCCCGAACC 
GTTTGCCGAG AGGAAGATTG GTGTTCGCAG CTGCTCAGAA AGTCAAGCAG CTGATTGCTC
CGTCTTGTGT CCGGAGTGAT TGACTGCGAG TCTTAATCCG CTATACACGT TCAGGCAAGA
GCTCTTGAGC GCAATTTCTC GGTCCCGTTA CATTGTGTTT TCAGCATGAA CGATCTACGC
TGGCTGCAAG ACACGGCAGG AGAGGCAAAT CAGCAAGAGA ACGAGAGCAC TGACACTTCT
GCTGAGTCGA GCAAAAGCAA TCAACTCGTC GGAGTCGTGT TATCAAATGT TTTTCTGTTC
TTTCTAATTT TTGGACTGTC CGCGACTGTC GATGTCAAGA ATATGAAGCG ACAACTCACC
AATAGATTCG CTATTGGCTG CGGTGTTGCA ATGCAGTTTA TTGTCATGCC ACTGCTAGGG
TTTGTCGCTG TCGTTTCCCT TCGAAACCAA GGTCTTTCCG AAGCTATGGG AGTTGCGTTG
TTGGTTGTTA CATCATCTCC TGGAGGATCC TACAGCAACT GGTGGTGTTC GACTTTTAAT
GCGGATCTGG CTTTGAGCGT TGCCATGACA ACTGTTTCAA GCATACTCAG TATATGCTTG
TTGGTACGTT CTTTTCCGAC TGGTTTTGTG GGGACCACGT AATTGCTACT CTGTTGTAAA
AATTACAAGA TCCACCTTCT TACCCTAAAA CTGGTCTTCT AATGTTCCAT TTTTAGCCCC
TCAATCTCTT TCTATACACC TATCTGGCCT TTGGTATCAC GGATAAGGAC CAAGAGTCTG
TAGTTGAAGC TTTGGATTTT GGAACTCTCT TCATAACACT TGGAATTGTA CTCGGCGCCA
TTCTATCTGG TCTCGCAGCT GGCTATCGTT GGGACAACGC CACCTTTCAC GTCTATGCCA
ATCGGTTTGG TACTATTTCG GGCATGTTAC TCATCTTGTT CTCGGTTTTC TTTTCTTCGG
GGGCCGATGG GGCTGAGTCT AATTTTTGGA GTCAACCTTG GGCCTTTTAT TGGGAGTTGC
CTTTCCTTGT TTGCTCGGTA TCGCCCTCGC CAACATTATT GCTCGATCCG TGCGTTTAAG
TCCACCGGAG ACGGTTGCCA TTTCGATCGA ATGCTGCTAT CAAAATACAG GCATTGCAAC
ATCAGTGGCT ATAACAATGT TTGATAATGT CGAAGAGAGG GCTCAGGCAG TTGCAGTCCC
GTTATTTTAC GGTATTATCG AAGCTGTGGT AATCGGCATT TACTGCATTT GGGCCTGGAA
AGTTGGCTGG ACGAAGGCCC CCAAGGATGA GAATTTATGC CTTGTCATTG CGAGGACGTA
CGAAATCGAT GAAATTGCTG CAAATGACGA ACCTCAGGAT GAAGAATTCA ACGGCAAAGA
ACAACTTGTA GAAGCCTCTA GCCCCATGGG TGCTCAAAAG GATAGTTGTT GCATGGAAGG
ATCAGGAGAA AGGGTGATCG AGCTGCATCA AGAGAACACT GAGAGGGACC GCCTTTACAA
TGAAGGTTCA GGCTTTTGGG CGAGAATATT TCCACCTATT CTGCTTCGAA AGCTCTCGTC
TCTGTTGATG AATGGAGTTG AAGTTGACGA GAACTTGGAA GAGCAAGGAG ACGGCGTTGA
TATCAAAGTC GAAGGAAATT TGTTGGCTCG AAGTCGATTA GGAACAGCTG AAACATCACT
CTCATCGTCA GTATCCAGTC CGCCACATCG CTCTAGAACG GGCTCAAGCA TGTCCATTGA
GCAAGGTTGC CTAAACACGG ACGACCCCTC AATACCATGT TTAACCGTTC CCGAAGACTC
TGAACACGCC TTACCCGACC TACTACCCAT GATCTCCTCG TCAACTGAGC ACTATCTGGC
TTCAGAAGAC CTTCCTTATG CAAACAGAGC CGCTAAAGAG GAATAGATGT TGTCGCGATA
GCAACTTGTC ACCGAAACGG ACTCCCGGAC AAAACTACCT GAAGGTAGAC CATACTACCT
AATGATACTA GAAACCATTC TAAGCAGTGA CCATAAAAAT AAAGCATCAA CAAACACGAT
C
 
Protein sequence
MNDLRWLQDT AGEANQQENE STDTSAESSK SNQLVGVVLS NVFLFFLIFG LSATVDVKNM 
KRQLTNRFAI GCGVAMQFIV MPLLGFVAVV SLRNQGLSEA MGVALLVVTS SPGGSYSNWW
CSTFNADLAL SVAMTTVSSI LSICLLPLNL FLYTYLAFGI TDKDQESVVE ALDFGTLFIT
LGIVLGAILS GLAAGYRWDN ATFHVYANRF GTISGMLLIL FSSTLGLLLG VAFPCLLGIA
LANIIARSVR LSPPETVAIS IECCYQNTGI ATSVAITMFD NVEERAQAVA VPLFYGIIEA
VVIGIYCIWA WKVGWTKAPK DENLCLVIAR TYEIDEIAAN DEPQDEEFNG KEQLVEASSP
MGAQKDSCCM EGSGERVIEL HQENTERDRL YNEGSGFWAR IFPPILLRKL SSLLMNGVEV
DENLEEQGDG VDIKVEGNLL ARSRLGTAET SLSSSVSSPP HRSRTGSSMS IEQGCLNTDD
PSIPCLTVPE DSEHALPDLL PMISSSTEHY LASEDLPYAN RAAKEE