Gene PHATRDRAFT_35518 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_35518 
Symbol 
ID7200771 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp94902 
End bp98267 
Gene Length3366 bp 
Protein Length982 aa 
Translation table 
GC content56% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179975 
Protein GI219118402 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAACGG GAGTCACGCG CGGTGACCAA GCCACCGGGA TTTCTCTGCC AGCCGTACTG 
CGGTACCGTA CCGTACAATA CGTTATGCGT TGTTCGTTGT TCCCTAGTGG CCATCCCACA
ATTCCATCCG GACTCTCCCT TGGGTTGGTC GTTTGACAAA CCTACCAAAA CACCACTCTA
GCCCGACCTT GTTACCCAAC ATCTACTCAC GGTCCGAGTC TATCCGAGTG ATTCGTCGTC
GTGGATATCG GTGAGAAAAC GAGACGCCCA GAGACTTTTT GAAGAGGCAA CGATTTCGCT
TCCTACCCGA CACCATGATA GTACTGCGTG ATTCACACCC CGACGGGTCG AAGGATCCGT
ATTGCAACGA CTACACTCGC AACAGGCACG GGTATGGATG CTGCTAGTGA GCTTGTTGTG
TGCGAGTCTC GTCGTCCCCG CACAAAGTCA GAGTGCCCGC ATTACGGGCT TTTCCTTGCT
GAACGCGGAC ACCCATCAAG TGATTCAACC ATTACGCAAC GGAGACGACA TTGATTTGTT
CCGCGCCGGA ACGACGTTGC TCTCGATTCG TGCGGAAGTG TCCGGATCAC TCGAGGGAGG
TTCCGTCCAG ATGATCCTGA ATGGACGAGT GCGCAACGTG GACGTCAGTC CTCCGTACAG
TCTCGGAGGC GACAACAACA ACAACAACAA CACAACGCAC AATACGAGAG TATCCCGGAA
TTGGCCCAGT GGGGTGGACA TTCCGTACAG GCTCGTATTA TGGAATTCCC GGACGGCACC
GGAAACGTGC AAGACTCCCG TTCGATTGGC TTTGCCATCC GCAATTCGGA TCCCAACGCT
CCCACGGCGG CACCCGTCAC ACCCGAACCT ACGACCCCTT GGACCGGCGA AAGTATCTCC
CCCAGCACTG GTGACGGAGA TGACTTTGCC ACGGCGGCGC CTACGGCTTC CAACGTGCAG
ACTCCTCCCA CCACACCCAC GTCGACGGTG GCCTCGCCTT TTCCTACGGC TCCACCCGTC
CCCGTACGGC CACTCGAACC AACCGACGTA CACGCCTATC CCGCCAGTGT TCGGGGAACA
CTCTCCGGTA CCTTGGAACC GTGGAGTAAG CTTACCTTGT GTTTCCTAGC CACTACTACT
GATGACACCA GCGCGACCGC AACGACAACC TCCGCATTCA CCCACGAACG CAACGAAACC
GTCAATCCCT TTACGGACAT TCGCCTCGAC GTCACCTTTA CCGCGCTCGA AGAACCCGTG
GAACTCGTCG TTCCGGGATA CTACGCGGCC GATGGCCACG CCGCCCATAC ACACGCTACC
GCCGGGGCCG TCTGGTGCGT ACACGCCACC CTGCCCTCGG AAGGCTCCTG GATGTGGCGC
GCCAACTTTT GGCACGGTGC CAACGTCGCA CTCTTTGACG TCAACCACGG AGGCGTCGTC
AAAACACCGC TCTTTCCCGT ACACGGGTCC ACCGGACAAT TCATCCTCAC GCCAACCACC
GCCAACGGCG ACGACGAAGA CGACCTGGCC ACGGGCCGGG CTGTTACCAA CGCCACGACA
CCGCGGCGGA CGCGGGGACG ACTCCAGTAC GTGGGAGAAC ACGCGTACAA GTACCCCAGT
GGCAACGATT GGTGGTTGAG TTTCGGTGCC GCGAGTCCGT CCAACGGTCT CGCGTACGAT
CGCTTCGACG GAACCACCAA TGCCGGGGAA CGCCGCAAAT CCTGGACACC CCACGCTGAC
GATTACGTAT CCGGCAATCC GACCTGGGCC GGTGGACAAG GCCGAGAACT CGTGGGTGGT
ACGTGCGTGT GTGTGTGTAC ATGTCGAACC CCCCAAAGTG ATGGTTGTTG TAGTGGTAGT
GTTTGGTGTG AAGGTTGTGT TTGTGATTGT ATAAATGTGT ATACTGACGC TGTCGATTGT
TCTCGTTTTC GTTCGTCTTC CATCTTTGTT ACAGCACTCA ACTACTTGGC GAGTCAGAGT
TTGAATCTCG TGACATTTTC GACCTTGACC TTGGGTGGAC CGGACGGTAA CGTTTTCCCG
TTTGTGTCAC CGCAACCATC GGATCGATTC CGTATGGACG TTTCCAAACT GGCGCAATGG
GAAGTAGTGT TCCAGCATGC CGATGAACTC GGGTTGCTGC TGAATTTGCG ACTCGAGTCC
GAGTCCGCAG CGGACGTCCT GGATGGGAAA GCTGGCGTCT TGGGACTTCG ACGACGCTTG
TACTATCGTG AAATGATTGC TCGATTTGGT CATCATCTTT CCCTGATTTG GAATTTGGGC
ACCGCAACAG CTACGGCTAG CTTCAGTACC GCAAACCAGC AATCGCTGAC CAACTACATT
CGGAGTGTGG ACCCGTACGA ACATCCTGTG GTTTTACAGA CGCCATCGAA CCAGCAAGCC
GAAGTTTACG AAGCCTTGTT GTCGAGTTCG AATGTAGCCG TGGAAGGAAC CTCGCTAGCT
TCCGATCTAT ACGATACGTT CAACGATACG CTGATCTGGA GATCGTTGTC CGCCGAACAA
GGTCACAAAT GGGTCGTCAC CAGTGAATAT CAAGGTTCGC AAGGCGCAAC CGCGGATAGG
GATGATCCCA CGCACGATGA ATTCCGCGTT GAAGTCCTGT GGGGTAATCT TTTAGCGGGT
GGTACCGGAG TTGCGTACCA TTTTGGTGAC GAAAGGGGCG ACAGCAGCGG GTGTTCCGAC
TTGGCCTGTC AAGACTGGCG CAGTCGGGAG GCTTTATGGG GTCAATCACG CTATGCTCTG
GAATTCTTTC GTGAAAACAG TATCCCGTTT TGGAACATGG GCAATTCGAA CGAGCGCTGC
ACGGACGGCA ATCGATGCTT TTCTAACGAC GAATTTGTTG TGGTGCAGGT CCTACGAACC
GACACACCCA GTCTCGTCGA CTTGACGACG CCGTCTCCCG TCGTCGCAAC GTACAGCTTA
AAGTGGTTTG ACCCACTCCT CGGCGGACCC CTCCAGGATG GCAGTGTTGC CTCCGTGTTT
TCCGGTCCTG CACAGGATCT TGGCACTCCA CCAACTTCAA CTGGCCAGGA GTGGATTGCT
TTGCTCACAC GCAACCGGTT GCCACCCACG ACAGCCCCAA CAATTTCGTT GGCCCCAACA
CAAAGTCCTC TTCTGGTTGT GGTGCCTCCG ACCCACGCAC CTCACGTACC AGGGACACCC
ACTGGTACAC CCATAGAGAT GCCCTCGTTC AGAGAGTCTG ATTTCCTTTC CAGAACCATT
GAACCGACCT CTGGACCCCC TAGTGAAGGC GTGTCGAGTG CCGTGGCTCC TACGGCGAAT
ATCAGTGCCG TTATTCAATG GATTCTTTTA TTCTTGATCT TGGGGCTGGT ACGAGTGAAC
CCATAA
 
Protein sequence
MSTGVTRGDQ ATGISLPAVL RYRTVQYVMR CSLFPSGHPT IPSGLSLGLA RVWMLLVSLL 
CASLVVPAQS QSARITGFSL LNADTHQVIQ PLRNGDDIDL FRAGTTLLSI RAEVSGSLEG
GSVQMILNGR VRNSRRRQQQ QQQHNAQYES IPELAQWGGH SVQARIMEFP DGTGNVQDSR
SIGFAIRNSD PNAPTAAPVT PEPTTPWTGE SISPSTGDGD DFATAAPTAS NVQTPPTTPT
STVASPFPTA PPVPVRPLEP TDVHAYPASV RGTLSGTLEP WSKLTLCFLA TTTDDTSATA
TTTSAFTHER NETVNPFTDI RLDVTFTALE EPVELVVPGY YAADGHAAHT HATAGAVWCV
HATLPSEGSW MWRANFWHGA NVALFDVNHG GVVKTPLFPV HGSTGQFILT PTTANGDDED
DLATGRAVTN ATTPRRTRGR LQYVGEHAYK YPSGNDWWLS FGAASPSNGL AYDRFDGTTN
AGERRKSWTP HADDYVSGNP TWAGGQGREL VGALNYLASQ SLNLVTFSTL TLGGPDGNVF
PFVSPQPSDR FRMDVSKLAQ WEVVFQHADE LGLLLNLRLE SESAADVLDG KAGVLGLRRR
LYYREMIARF GHHLSLIWNL GTATATASFS TANQQSLTNY IRSVDPYEHP VVLQTPSNQQ
AEVYEALLSS SNVAVEGTSL ASDLYDTFND TLIWRSLSAE QGHKWVVTSE YQGSQGATAD
RDDPTHDEFR VEVLWGNLLA GGTGVAYHFG DERGDSSGCS DLACQDWRSR EALWGQSRYA
LEFFRENSIP FWNMGNSNER CTDGNRCFSN DEFVVVQVLR TDTPSLVDLT TPSPVVATYS
LKWFDPLLGG PLQDGSVASV FSGPAQDLGT PPTSTGQEWI ALLTRNRLPP TTAPTISLAP
TQSPLLVVVP PTHAPHVPGT PTGTPIEMPS FRESDFLSRT IEPTSGPPSE GVSSAVAPTA
NISAVIQWIL LFLILGLVRV NP