Gene PHATRDRAFT_43097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43097 
Symbol 
ID7196876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp2026885 
End bp2029163 
Gene Length2279 bp 
Protein Length671 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176892 
Protein GI219110281 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCAGACTCAC CTCATCGGAC GGACTGGACG GTACTTGCTC CTCGGAAACA GGATACACCG 
TCCTTTTGGC AAGTGACGGT TACTTGACGC CAATTGCTCA CATTTAGGTT AACTTAGAAG
CCATTGGAAT TTGAAATTAT AGGAAGGCTT TTCATTAACA TCGGAAGATT TTTCCTTTTC
CTCCCACTGA TCCAAGTGCC TTTGTCTTTT TGATTTCATG GCGGCAATCA TTGGCAATTC
ATTACTTGTA CCTCGGCGGC TGGTTCTTAT CGCAATGGCT GTCTCGGTTC TATTGTTCGT
TCTCCGACCC GTGAGTTCTT TTTCCTTTAG ACCCGTGGGA CGGTCTTATG CCGCGGCGAT
TACGCAAAGC AATGGTCTAC GACGGAGCAC ACGGGGGCAC GCTGTCGGTG TGTCGATTTC
CCGGAGGGAC CCCGATGCGA TGCCCTCTCG CCTCTTTTCG TCGTCAACGG ACAGCAACAA
GGAAACAAAG GCGTCAGTAG AAGAGCAGAT CAAAGTGAAA GGAGACGAGA TTCGAGCGCT
CAAGGAATCT GGAGCAGACA AATCAACCGT TGCCCCTTTA ATTGACGAAT TGCTCGCTCT
CAAAGCAAAG CTTGATCCTT CTATTCTCGA ACCCCCAAAA AAGGCACCGA AAGCCCAAAC
GCAGCCAAAA AAGCAACAGA ATCAATCCGG TAAGAGGGAG AACGATGATT CTGATTTTAT
CACTGCTCGT GAAGTGGATT ATTCGAAATG GTACAACGAT ATCGTTCGCG TCACTGGCCT
CGCTGAAACT TCACCAGTTC GTGGCTGCAT GGTAATCAAG CCGTGGGGCA TGTCACTCTG
GGACCGTGTT CGTACCGAGC TTGACGCCAA AATTCAAGCA CATGGTGCCG AAAATGCTTA
CTTCCCTTTG CTCATACCCC AATCCTTCCT TTCCAAAGAA GCCGAACATG TTGACGGTTT
CGCCAAGGAA TGCGCCGTAG TCACCCATCA CCGTCTCACG ACTAACCCAG ACGGCAGCGG
TTTAATGGTC GACCCCGAAG CGGCTCTCGA GGACCCCCTC ATTGTTCGTC CCACTTCCGA
GACCATGATT TGGTACATGT TCCGCAAATG GATCGTCTCC CACCGTGACT TGCCACTCAA
AATCAACCAG TGGGCCAACG TAATGCGTTG GGAAATGCGG ACCCGACCGT TTCTGCGGAC
TTCTGAATTT TTGTGGCAGG AGGGACACAC AGCCCACGCG ACACGGGATG GAGCCATTGC
GGATGCCCAA GCTATGCTTG ATAACTATGC CACATTGTGC GAAGATTTGC TGGCCATGCC
GGTAGTACGC GGTGTGAAGA GTCCATCGGA GCGCTTCGCA GGCGCCGAAG ATACGTATAC
AATTGAAGCC TTGATGCAGA ATGGTTGGGC CTTACAGTCC GGGACATCGC ACTTTCTGGG
GCAGTCTTTT GGTAAGGCCT TTAACGTGAC GTTCCAGGAC GAGAATGGTA CGCAGCAAGA
TGTGTGGGGG ACCAGCTGGG GTGCCTCCAC TCGATTAATT GGTGCTCTTA TCATGACGCA
TTCGGATGAC GCTGGTTTGG TCTTACCACC GAAAGTAGCT CCAGCTCAGG TTATAATTGT
CCCAATCCCT CCAAAAAAGG ACGACGCAGA GACGAAACAA GCCATGGATA TTGCTATGAA
TCAATTGACG GCAAGCTTGA AAGCTGAAGG TTTGCGCTTC AAGGTGGACG ATCGTGATTT
CGTCCGCAGT GGTGCAAAGT TCTTTGAATG GGAACGCAAA GGTGTGCCTC TGCGTATTGA
AATTGGACCA CGAGACGTTC GCAACAACGT CTGCGTCTTC AAGTACCGTG CGGGTGAGAA
TGCTGACGAA AAGCAAACGA TTCCGCTGTC CGAAGCGGCG GCATCCGCAA CGGCCGGCTT
GAAAAGTATG CAGCAAGACT TGTTGGAAGC GGCAAAAGCG AGATTGACCA ATGGAATTAC
GACGGACACG ACATACGAAG AAATGAGAAC GTTTTTGGAA GCCGACGAGG CGTCCGAGTA
TTCTGGGAAG GGTCTGTTCT TGGTGCCGTG GAAGTGTGAC GCCGAGAACG AAAACAAGAT
CAAAGAGGAA TGCAAGGCTA CTATTCGATG CTACCCGCTT GACGCAAACA AGCAAGGTTT
GCACCAAGGC AAGAAATGTT TCTATAGCGG CCATGACGCA ACGCACATGG CACTGTTTGG
AAGGGCGTTT TAGCGCTAGA AAGGAATTAT AACAAGACTG TAAATCGCAT CGTGCTAAC
 
Protein sequence
MAAIIGNSLL VPRRLVLIAM AVSVLLFVLR PVSSFSFRPV GRSYAAAITQ SNGLRRSTRG 
HAVGVSISRR DPDAMPSRLF SSSTDSNKET KASVEEQIKV KGDEIRALKE SGADKSTVAP
LIDELLALKA KLDPSILEPP KKAPKAQTQP KKQQNQSGKR ENDDSDFITA REVDYSKWYN
DIVRVTGLAE TSPVRGCMVI KPWGMSLWDR VRTELDAKIQ AHGAENAYFP LLIPQSFLSK
EAEHVDGFAK ECAVVTHHRL TTNPDGSGLM VDPEAALEDP LIVRPTSETM IWYMFRKWIV
SHRDLPLKIN QWANVMRWEM RTRPFLRTSE FLWQEGHTAH ATRDGAIADA QAMLDNYATL
CEDLLAMPVV RGVKSPSERF AGAEDTYTIE ALMQNGWALQ SGTSHFLGQS FGKAFNVTFQ
DENGTQQDVW GTSWGASTRL IGALIMTHSD DAGLVLPPKV APAQVIIVPI PPKKDDAETK
QAMDIAMNQL TASLKAEGLR FKVDDRDFVR SGAKFFEWER KGVPLRIEIG PRDVRNNVCV
FKYRAGENAD EKQTIPLSEA AASATAGLKS MQQDLLEAAK ARLTNGITTD TTYEEMRTFL
EADEASEYSG KGLFLVPWKC DAENENKIKE ECKATIRCYP LDANKQGLHQ GKKCFYSGHD
ATHMALFGRA F