Gene PHATRDRAFT_47120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47120 
Symbol 
ID7201919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp512439 
End bp514685 
Gene Length2247 bp 
Protein Length748 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181391 
Protein GI219122100 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGCTT CGGTGCGAAT TTTGACGACA TCGTCTGTTG ATTCCAATCC TGCCATTCTT 
CTCGTGGAGC CTGATGGATC CAAAATTTTG ATCAACTGTG GAGAAGGAAG CCAACGGTCT
TTTCTAGACT CCCAGCAGCG GGTTTCGACA GTGAAGGCCG TCTGCTTGAC GCATCTGTCT
TATGAATCCA TCGGCGGCCT ACCAGGGATG ATTCTCACAG CTAGCGATGT TCAGAACGCG
ACCATCGAAA ACGCAAAAGC TGCCGCCGCA GCCAAAGTTC GCAAAAGCAA TAATTCTACA
CAACTTTTGC CCCCCTTTCC AACGGATACG GCACAGGGCC TCAACGTTTT TGGACCAGAA
GGGACTAACT CTTTTCTCAA GTCTCTGCGG CACTTTATGA GACGAGATTC CTTTCGGCTG
AACGTCCACG AAGGGCTCGT CGAGGGCATT CGTGTTTGTC TCCCAAAAAC TCGTAAACGC
AAAAGTGGTC AACCAGTTTC TGAAGGAGCT TTTTTTTCTG TAAAGAGCTT TCCTTTTGTG
GAAAGACGTA TCGGCGATCG CAAGCGTTCT AGGACTTTGC CGGAGCGCAA AACTCTTTCC
TACCTCTTCT GGACACCACG TTTTCCAGGA AAGTTCATGG CCAACGAAGC GAAAAGGCTT
GGGGTACCGA AAGGACCTAT GTATGGGATG TTGAAAAGTG GGAACAGTGT GACCTTTTCC
GACGCTTCTG GCGAACAACG TACAGTGACA AGCAATCAGA CTGTGCAACC AGATAGTCCA
GGAATAGGTG TCGCTGTATT GCGGTATCCC GAAGATTTCT TTGAAGAGCA GCTTTTAGTA
TTTTTCAAGC AAATGACAAT GAAGAGAGTA ATAAGCTCGG TGGGAGTTGA ATTGGAAATT
GCGATCCATA TTGCTAGTCG GAGCTCGTTT GGTGACAAAA TTGCGCGGCA ATGGAGAGAC
GAATTTCCGT CCACCGTACA GCATTTACTG TTGGACACCG ATATTAGCGC GGACTCACAT
GGCACCCCGT TTCGATCAGC GGCGCACGGC GCGTTATGTC GATCTCTCGT TTGCCCGGAC
CTGTATGTAC AGGTTAGAGA GCCAAATACG TTGAGACGAC CATATGGACC TGAGCTGGCT
CGTGCGGGCT CAGAATTCGT TCTACTACCT CGGGGTAAGG TTGGCTTTTC AGACTTCGTT
GATTATAACA TAGATGATGG CAAAGAGAAA GCGAGAACTT TAGTGAAGGA CTCGGGAGCT
TCCACGTTGG CGAAGGAGCT TTTGGCTGAA TGCGCTCTAT GCGTGAATGA ATCATTTTCA
GGGGAGCTCT TTTTCACAGG TACCGGGTCC GCAATACCAT GCAAGCATCG GAATGTGTCC
GGAATTTGTC TCACCTCACC GAATGGAAAC TCTATTCTTC TTGACGTTGG AGAAGGAACA
GTTGGACAAC TCCTCCGCGC AAACAGTGGT CCAACATCAA GTACACTTGC ACACATCAAA
GCTGTGTGGA TCTCGCATCC ACATGCTGAT CATCACTTGG GGATTCTACG ATTACTCCAC
GATCGGAAGG CGCCCGACCC CTTGTTACTA ATGTGTCCAT CACCCATTAT TTCGTTCCTG
ACGGAGTATT GTTCCATGGA TTCTGACCTG TCGAGCGCAT ACGTTGCCGT TAATTGCAAT
GATTTGATCC GAGAAAATGC AAAGGCGAGC TTTCTACTGA AAGAGGCTCT CGGAATTGAT
AGTAGCTTTG CGGTTCCGGT GACTCACTGT CCATACTCTT TTGGCTTGAT TTTAGAGGGC
ACTTGTTTTG GTAAGCTTGT CTACAGTGGC GACTGTCGTC CTTCCAGCCA GCTCGCCAAG
TGTGCTTTAG GCGCTGACTT GCTAATCCAT GAAGCTACTT TTGAAGACGG AATGGAAGTT
GAAGCGGCCT TGAAAAGGCA CTCTACCATT GGAGAGGCTC TTTCGGTTGG AATGGAAATG
AAAGCCAAAT GCGTCGTGCT TACGCATTTT TCGCAGCGAT ATCCAAAGGT TCCACCAACT
CCAGTCAACC ACGAAGGATC AATCCCGGTC ATCTTTGCGT TCGATTTCAT GCGTCTGTCA
CCCAGCAACT TAGTGATGGC CTCCAAGGTG ACTCCGGCAA TTCGTCTTTT GTATCCCGAG
GAAAGCGAAG GAAGGCAAGG CGCGGAAACT GAAGCAGAGT CTATAATGGC AATTCCTGGA
CTGTTCGCAC AGAGCGAACT CCTGTAG
 
Protein sequence
MTASVRILTT SSVDSNPAIL LVEPDGSKIL INCGEGSQRS FLDSQQRVST VKAVCLTHLS 
YESIGGLPGM ILTASDVQNA TIENAKAAAA AKVRKSNNST QLLPPFPTDT AQGLNVFGPE
GTNSFLKSLR HFMRRDSFRL NVHEGLVEGI RVCLPKTRKR KSGQPVSEGA FFSVKSFPFV
ERRIGDRKRS RTLPERKTLS YLFWTPRFPG KFMANEAKRL GVPKGPMYGM LKSGNSVTFS
DASGEQRTVT SNQTVQPDSP GIGVAVLRYP EDFFEEQLLV FFKQMTMKRV ISSVGVELEI
AIHIASRSSF GDKIARQWRD EFPSTVQHLL LDTDISADSH GTPFRSAAHG ALCRSLVCPD
LYVQVREPNT LRRPYGPELA RAGSEFVLLP RGKVGFSDFV DYNIDDGKEK ARTLVKDSGA
STLAKELLAE CALCVNESFS GELFFTGTGS AIPCKHRNVS GICLTSPNGN SILLDVGEGT
VGQLLRANSG PTSSTLAHIK AVWISHPHAD HHLGILRLLH DRKAPDPLLL MCPSPIISFL
TEYCSMDSDL SSAYVAVNCN DLIRENAKAS FLLKEALGID SSFAVPVTHC PYSFGLILEG
TCFGKLVYSG DCRPSSQLAK CALGADLLIH EATFEDGMEV EAALKRHSTI GEALSVGMEM
KAKCVVLTHF SQRYPKVPPT PVNHEGSIPV IFAFDFMRLS PSNLVMASKV TPAIRLLYPE
ESEGRQGAET EAESIMAIPG LFAQSELL