Gene PHATRDRAFT_44468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44468 
Symbol 
ID7197700 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp655320 
End bp658190 
Gene Length2871 bp 
Protein Length932 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178550 
Protein GI219115509 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00222146 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGTACG ATTACGATGC ACTCTGTCGA GAGTTTGCAC AGCACCGAAA GGCCTCAATG 
GTGACGGAGA ACGTGATAGA GCGCTCGGGC CATTTGCAAA AGGCAGTCCA GGCCTTGGAA
ACTCTTTCAT CTTGCTGTCC CATGACACCT GCGCTTTGGA TACAATATGC CAGCACTGCT
GCGGAATGGA TTTCACAAGC GTTGTTACGA CAGGAGGATT CGTGTGACGC AGATTCGAGA
ATCAATAAAG AATCTCTCCA AACGCGTTTA CAGACTCTGG AATTGGGCTT ACAAGAATTT
CCCGGATACG TTTTGTTGCA TTTACATTAT ATAGAGCTCT TGATGCACAA AAACGCATGT
ACTGATGCGC CCAAGATCGA ATCGGCGCTG CGGACGGCGA TTGCACAAGT AGGAGGGGGT
TCTCACCGGA ACGAGGGTAG CTGGGTCGTG CAGCTCTACA ATCATCTTGC GACCTTCTTG
GTCAAGCAAA ATCGAGTGAA AGAGGCGTTG CAGTGTTTTG TACAACGCGC CCGAATTCCC
ATGAAAGATG TGAACGATGA GATTGCCAGT GACTACAGAG GATTTTGCGA GAACCACGGC
CTTACTCCTT CTACCAAGCA CTTGGAACAA ATGGAGCAGG GCCGGCGACT AGAAGCCAAA
CTTTTTAATC GGTATATAAC GCTGGAGGAC GAAATTGATG CAGTCATGCA TAGCCAGGGG
ATTCTTCCGA GGTATGATGT TGGTGTAGAC AAACTCGACT GGAAGATTAT GCTTCACACG
GATCGCTACG GAATGGGATT GGGAGGGGCA GACGTGGCAA CCGCATTTGT CAAATATGCT
CTCGAGTGTT CAAATATCTT CAAAAGCGCC GCTCGGCAAG TAGACGAGGA CGACCAGGAC
CTAGAAATAG AAGAAATGCG TCGCCATATT TCTGGCCTGG CCTCGGCAGT ATTTGAGCGC
GGTGTGGCCG AATGTCCGAC CGTCGACTCT TTGTGGCTTT CCTATATTCG CTTCTTGACG
GAACAAGACA ACGGAGATTC TCTGTCCCTG CTACCAAGCG TTTCTCAACG GGCCGTACGC
AATTGTCCAT ACAGCCAGGC TTTGGCTTGT CAACAAATGG ACAACGTATT ACTCTTGGCT
GACAAGGGTC TTATTGTTTT TGATCCCGAT GCACTAATGG AACAGGTGCA GACAGCACTA
GACACAAAAT TTCTTCCAAA TCCAGTACAG TTTTTGGAGC TCTACCTTTG TGTTATTCGT
ATCGTCAAGC GGCGCATGCT AAGTATCTTG TCTGGTGTGG CTGTAACTGA ACAAAGTGGC
AAGGCGGTAC TACGGTATGA CGAGGCGGAG CCTATCCTAA AGTCAAGCAA TGCGAGATCT
CCAAAACGTG ACGGAAAGAT AGACGGGGTG CTACAAGAAG TCCAGGACTT ATGCGAGGAC
TTGACAGACA TGTACGAACA TATCGAACAA AAAATGAGGA AAGTTCTGGG AAAATGGTCA
GAAGGCCGAT CACTCTTGTG GTTGGAGCGA GCATATACGG AAAAATATTT CTTGAACCCG
CTGCGTCGCA TTTTTGAAGG CTCTGGTGAT TCCTCCAGAT CTACAGAAGA CCTAGAAATG
CTGCTATTGT GCGAGAAGCC AGTTCGGTCT CATAGTCCGC CCCATCCTGA CTTGTACTTG
CGCTACATCG AACAATACTT GTTGAGCTAT CCTGTTGTGA ACGCGACTGA TGTCCTTCAT
CGTTTGAGAC GTACTCGTTG GCTGTACCAA AAGGCCATTG TGGGGGTCGG AAGGAGCAAG
GAATCAAAGC CCGTTCCTTC CTTGGTGATA CCAGATTTCG ATAGCGCGTT CGCACATTTA
AGTCATCACT GGCTAGAGTT TGAACAGATG TTCGGGTCTC GAAGTTCGGT GGCTGAAGCG
CACAAGGCAA TTGCACGAAA GATGCACAAA CTTGGTGAGA ACGTTTCACA TCCTTCATCG
GACCCCTTGC GAGAAAGGAG GAGTGATGCT CCATCTTCTA TGAATGTGCA TATGGTGCAA
GGTCCCGATC GAAAACGAAA AGTACATATT GATTCTGATG ATTTTGAAGG GAAAAGAATT
CGTACAAAAA CAGATTGGGT AGATCGAGAC ATTACGACTG ATGATGACCG CTATGTCGAT
CCGGGGCAAT CAAAAAATGC TAAAAAGCCA AACCAGGATT TTAAATATCA TCCTTTTTCT
GTCCGTGTAT CTGGATTAAG TGAGAAAACA GACGATATGG ATCTGGTGGA TGTTTTCCGA
CCTAAGTGTG GTGAAGTAGT TCATGCGAGA ATCATCCGAG AGAAGGAAAT TCGTCACAGC
TTGAAGGGAA AATCAAAAGG ATGGGGATTG ATACAGTTTG AAGATAGAGA ATCGACCGAG
AAAGCCCTGG CATTGGACGG TATTATCGGT ATACACGAAA AACTTGTTGT GATTGAGCGA
TCTTACATGC CTGCGGTAAT GATTGTGCCG CCCGGAATGC ACAGAGTTCA ACCCAAGGGG
GAGGGAAAGA GCTCAAAAAT AAACGAAAGG CGCAAGGAAC GAGAGCAAAA AACGACGCGA
TCCACAAAGT CAGGATCTGG CCCTGTATTA GGCCCCTTAT CGGAAGATTC CTTCAATCCT
CTGCAATTTC GACCCCGCGG CATTCAGGCA AAACCACGGT CGAAAGTAAG TGTAGACTTG
CAATAGCAAG GCAAGTGCGC AATAGGTCAC TATCACTTCG ACTTGGGTAA AGCCTCTGCC
ACAGTGGCGT TATTTCCGAC ACACCACTGT CGCCAGAATA TTCTTTGAAG TGGATACTAC
CATTACATTG AACTTTTAAA CAAGCTTATC TGCAAAGTCT TTGCAATAGT C
 
Protein sequence
MSYDYDALCR EFAQHRKASM VTENVIERSG HLQKAVQALE TLSSCCPMTP ALWIQYASTA 
AEWISQALLR QEDSCDADSR INKESLQTRL QTLELGLQEF PGYVLLHLHY IELLMHKNAC
TDAPKIESAL RTAIAQVGGG SHRNEGSWVV QLYNHLATFL VKQNRVKEAL QCFVQRARIP
MKDVNDEIAS DYRGFCENHG LTPSTKHLEQ MEQGRRLEAK LFNRYITLED EIDAVMHSQG
ILPRYDVGVD KLDWKIMLHT DRYGMGLGGA DVATAFVKYA LECSNIFKSA ARQVDEDDQD
LEIEEMRRHI SGLASAVFER GVAECPTVDS LWLSYIRFLT EQDNGDSLSL LPSVSQRAVR
NCPYSQALAC QQMDNVLLLA DKGLIVFDPD ALMEQVQTAL DTKFLPNPVQ FLELYLCVIR
IVKRRMLSIL SGVAVTEQSG KAVLRYDEAE PILKSSNARS PKRDGKIDGV LQEVQDLCED
LTDMYEHIEQ KMRKVLGKWS EGRSLLWLER AYTEKYFLNP LRRIFEGSGD SSRSTEDLEM
LLLCEKPVRS HSPPHPDLYL RYIEQYLLSY PVVNATDVLH RLRRTRWLYQ KAIVGVGRSK
ESKPVPSLVI PDFDSAFAHL SHHWLEFEQM FGSRSSVAEA HKAIARKMHK LGENVSHPSS
DPLRERRSDA PSSMNVHMVQ GPDRKRKVHI DSDDFEGKRI RTKTDWVDRD ITTDDDRYVD
PGQSKNAKKP NQDFKYHPFS VRVSGLSEKT DDMDLVDVFR PKCGEVVHAR IIREKEIRHS
LKGKSKGWGL IQFEDRESTE KALALDGIIG IHEKLVVIER SYMPAVMIVP PGMHRVQPKG
EGKSSKINER RKEREQKTTR STKSGSGPVL GPLSEDSFNP LQFRPRGIQA KPRSKVTITS
TWVKPLPQWR YFRHTTVARI FFEVDTTITL NF