Gene PHATRDRAFT_42856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42856 
Symbol 
ID7196446 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1325975 
End bp1331557 
Gene Length5583 bp 
Protein Length1366 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177270 
Protein GI219111037 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.875616 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATAAATGTAG ACAGGAAACG ATCTGTAAAG CTGATATAAA TGCGGTGTAA TCATATGCAT 
ATGCACGCTC GTATACTGAC TTTCTTCGCC GTTCCTATTC CAGATGAGCA TTGGGCCCAG
GAAATCAATC CGTACATACA ACGACACACT TACACTTAGG AAATAGTATG GCTGCAAGGA
TTTTGACCAA TATTTTCACA TATTGCCTCT TTCCAAACAC CACATCTGAA CTCTTTTTGT
AGAATTCGAC CATTGACTGT GGTGAACATT GCCGATCATC CTTTGAAAAA GCGTGATACA
GTCCCCGACT CATCTGCTCG ATAGCAGGAA GGCGATCTCC TCTGACGAAA GTGATCTACG
TACGTTTTGA TATTGTAGAA GAGAGAACAT TAATTGACTG CGATTGTACT CAGATCACTA
CTATTCGACA AAGAAAGACT CACTTTGAAG AGTCCTACCT TAGAATAAGA AAAGAAATGA
AGACTGAAAA TGGAAGAAAG CTGAACAGCT GGTATGTTCT GTTCTGTTTT GTGTCATCTA
CAATGGCTGA GTACACCTAC TATGATAACG AAGGATATCA AGAAGGATAC AACACCAGCC
ATACGATCAC CGGAGGTGAC GGCATGGAAT ATTACAGCAA TACCAGCAAT AGTGCTACCA
ACAGTTCGGG AGGTGACATT GATTATTGGA CCAACTACGC CATCTTCCCT AAGCGATGCA
TAGTTTAGTA AGTTTCTTCG ACCGGTGGCA ACTCCGTTCA CTTGTCCTCC TTCTTTTTGC
CGCTCTATCT CACCTTTCCC GCACTGCCTT GTCCTCATCC TTGTGTAGCA AAAAGACAGA
CTACATCATG TACGAAATGT TTGATCAGCA ATATTGCCAA GAAGAAAACC GATTGGGAAC
CTACATTTCG CGCGTTCCTG ACTACATGAC GGGCCATCTG CAACAATTGG CCGAACAGAT
GTCGGATCTA GGTGTCGACG ACTACACCAA ACCAGAGGTG GCGCAGTACA TTGAATGTAC
ACCTTTTCAA ATCCAAGGCG CCTATTACTA TTTCCAGATT GGGTGCGCCG ATGGCGTCAC
TCAAAAGCTT GCTGTAAATA TTTACTCCGA CAATACGTGC TCAAAGAAAT CAACTGTGGA
CGGCTTCGAT GATTCTGTCA TTGACGTTTC GGCTTTGAAT GTAAGCAGAT GCAGTTTTTT
TGTCCTGGCG ACGGATTGGC TTGTTGTGTT GCTTAATGGC AGTCTATTTA TTCCCAACAG
ATTCCTTTCA AACAATGCCA AACGTGTGTT AACTGGGTAA ACGTGGATGG GGCTGACGAT
CAATTCTACA TCAAGCGTCA ACAAAATGCT CCTCTTTGTA AGACTGCTTG GACGTACAAG
AAGCATTGTG GGCGCCAATG TCAATCCGTT GGTCGCAATG CGGAAGTCGA AGGCTGGAAT
GCTTCAGATA AAATTCTAAT ATCGGTGCTT TCCTCTTTTG CTCTCATTCT GCTTGGATCA
ATTGCCTTCC GGAGACAGAA GATGCCCAAC AAGGATATAC TCTTGGAACA AGCATTTATA
AACTCAGCGG GACTTCGGCA ATCACAGATT TTTGTCATTA TCATCATCGT CATGGCAGTT
ATCGCAGTCC TCATCATGCT GGGTTTGAAG GACGCGACCT GGGCTTTGCT ACTGGTATTG
AATACTGCTC TGTTTGCTTA CCTTATGAAG CTGACACTGG AAAGCAGTGT CAATACCGGA
GAAATTATCA TCGGACCAGA TGGCACTATT CTACGCAAAG ATTCCGACGA TTCTTCCATA
GAGAGTACAA GTAACCCCAA CAATGGTACC TACATGTTGC CAACGTTGAC ATGATCGAAT
TGTGTACTTA AATGACATTT AGGTTTTGTG CCCTCGACTG ACGCAATACC TCGTGGACCT
GTCGATCCTT TTGTCCTCGC TCGTTCCCCT CTTGTTTCTG TCATTAAGTT AAACAAGGCG
TCGTGCAAAG TCTATAGTAG TTGAAAACGG CTAAAGAGTG AAATATTCAA TAGGCATGCT
TTCCTAAGAC GGAATAATTG GTATTTGATA AATTTTCATC TCGGCTGAAA ATCCCTGCTG
TTGGTCTTGG CCATAGGGCT CGAATTTGTT CCGCACTCAT GTTTTTTTTA TTGCTGTGCA
CACAGAGTCA TATAAGACAA GCATCTGATC CTCTCTATAC TATCAATTGA TTGTAAAAAG
TTTCATGTCT ATTAGAATAC TGCAACTTTG TTTGCAACAT TGTGTGTTTC ATCCGAGCAA
CATTGCCTAG CGCGAGGAAT TGAATCAATT TTTTTGCTGA CTGTGATTGT CTCGCTTGGT
TATTAGATTG AACCGCAGCA GTGTTGATCG ATCCAAAGGA AGAGCTTTCC GACTCAGTAT
CAACCACACT CCCTTTAGCG AAAGCTTCGT TGGTTGTTTC CATACTGTCG CATTCACATC
AACAGGAAAA CGCCTTGGTG ACGCCGCTCT TGCGGTCAGA TCATCATCAC GAACTGAGTC
TACAAGCAAA TTCCGAGGCT TCCGGCCACG TCGCCGAAGC CGATATCGAA GGGGAAGAGG
TTGGAGGTAC CAGCATGAGC AGCACTTACC ATGATCCCCG ATTTATTCGA AATCCAGTTA
TTCGGGGACT ACTGCTGCGT TTGCTGCGTA TGCAATCAGC GGCCGATTCC TTACCACACC
TAGTTCAAAT GATCGTGTCG AATGGAAAGA TTACAACTGT ATCATTCGTT GCTCTCTATC
TCGTGTGTTT AGTCCTTTGG CTACCTTTCT GGCTGCTTGC TTTGCTAGTC ACGGAATGGG
GCGTCTATGC GCTGTCTGTT GCGGGAGGGT ATTTCATTGG ACGATGTATT ATTCGCATGA
TTGCTTTTCC TGGGGCTTCC CGAAAAGTCA GTACCGATAT TGAGAAAGAA TTTGCCAAAT
ACTCAGTCCG TATGTTGCAG TCCGGTGTCC AAAGCTTCGC GGAAGTCGCT TCGATCATGT
CAGCCAATCC GAACCATCCC CAACGAGTGA GTGTTTTGAA GGGGTACACA CTGCCGTCTC
TTTGGAGTAG GGCCAAAACC TATCGAAACC GTGTGTTGGG AGTATACTTG GAAGTTCTTT
TGCACATTTA CCAACAGGCA CCAGAATCAT CTACTGGGGC ACAAAGAGGA TTCACAAAGT
ATGGCAACAA CGTTTTATCT GGAGATATTG GGAACATTGC CGGACTATCG GTAAGTTTTC
ACGTTGTCTG CGCTGCAATT GTGTTCGTTA TTCTCAAAAT AGATAAAAAT GCTTCTACTC
TATCCTGTAG GCCCAAGCCC AAAATGATGG GCTTGGTCTC ATTGAACAAC TCAAAACGGT
ACTGGCTTTG GTGGACACTT TGGAAGAGCA AGCGCATACG TTCCTTGAAG GTAGAGCAGC
TGTTGTAACG CCAAATGATC TTCCGGAAGA TGCACGCCGG ACGGCGCAAC ATCTTTTGGC
CCGGTCTCAG GAGCTTACCA ATTTCGTCTC GTCCTTAAAA CCTCCATCTG ATAGCAGCAA
TGAAGACATA GAGGATGAAT CAGAAGAAGA CTTAACGGTT GACGCAGTAC GCAGAAAACT
AGAAAGGCAG CATGGTTCCA CAAGAGAGGC TATTAAAAAA GGAATTGCTT CTGTTATCCC
CCTGTTGGAC CCCCCACCCC ACAACTCAAT ATTCTCGTTT GATTTACAAC GTGGTTGCAT
GTTGAGTCGC TATCGAGGTG CTCGCCAGCT TTGGGTGCGT CGTCCTAGCG GCGGCATGTT
GGATGTTTTG CATTTTCCCG CTCGCGATCG TTGTGCGGAT ACTCAGCGGA ACTCCAAAGC
ACTTCTTTAC TGCAATCCTA ACGCAGGTTT AGTTGAAGTA GCAGCCGGGA TGAGCCTAGT
TGGTGGGAAC GTGCCGTCAA ATGAGGGTAA TGAACAAGCA TCCGATGGGA GCTGGGTGGA
CTTTTACACC TCCGTCGGCA TAGATGTCTA TGTCTTCAAC TATGCAGGAT ATGGACGAAG
CTTTGGGTCG ACAACATGCC TAAAAGGGAA CAGCGCGATC GATAGCTATA CTCCCGGTAT
CCTACCCCGA TTGATTCGAA TTATTCGCTC TACATTTTTA ACGTTCACGC CAAATCCAGA
CACTCTCCGC GATGACGGGT TTGCCGTTGC GGTACACTTG CTCAAGGAAG TGGGTATAAA
ACAGCTTATA ATCCATGGAG AGAGCATTGG TGGGATGGCT GCATCAAGTA CCGCTCGTCG
AGTCTCCCAC GAGCCGGAGC TACAGGATAA GCTGGCTCTT CTGATTTGCG ATCGAACGTT
TTGCAACTTG GAGGCTGTAG CTCAGCGTCT CGTCGGAGGG TGGACAGGAA ATGCGATTCG
GATGTTAGCA CCTTTCTGGA GTACAGATGT AGCGGGCGAC TTTTTCGCAA CAAACTGCCC
CAAAATTGTT GCCAATGATG CAGCCGATGC AATTATTTCT AATGAAGCAA GCCTGAAGTC
TGGAATTTCG CTTTGGAAAG AATTGCATCG CGGAATCGCT TCCACAAAGG GAATTGGTTG
GATGACGGAA GCACCCTTAC AGTACAGAAT GGCCGATTGG GAGAATGTCT GTGTGAATGA
TTCGAAGTAC GTTACAGCAC CAGGAGTACT TCGATCTCAA GCACCGACAT GGCCTCACGA
CAAGCACATA TCCGTCGAGG AGGCTTTTCA TTTTGCGGCA TGCTGTAAAC GGATTGGAAA
GTACGCTAAG GCTTTCAGGA AAGGCTCGGA GGGTGGCGAT TTGAGTGGTT TGCACCCCGG
CTCTAGGCGT GCTCTAATGG AAGCATGGCA GTCTCTGGCA TGTCTCGACG GTCTCACTGG
AGCACCGCTC GGTGTTGCTG TCAAACAAGG GTTCGATACG ACGGTAGCTT GGCTATGTTC
TTTTCTGATT TTCGGTGCTC AATCAATTGT AGCTGTCGCT GAGCGACGCA CGGAGGGTGA
CGTTGGGCTT CGGGAGCAGC TAGCAATTGT CCCCTCTGAC TTTGATTCCC GACCAGCCGG
CTTTGCAGCT GCAGAAGAGG GAGGCATGGT GCATCCGAAA CCACTTCCAG AAGTTATTGA
ATCCCTTTTG TCATTTCAAG AATCAGGAGA TCCTTCCCTC AGAGACTGTA AGTTGTGACT
GTCGTTTCGA TTTAGCTACA GGGCTATCCC TTATCTAACC TAAGTGTATT CCTCTCTTCC
AGTGTCCCAT GAATTTCAGT TTGTTGTTGG CGTTCTTAAG TATTTGCAAG GCCGCCTGTC
GGCATCCACG AGTATTGAAG CTGCGCAAAA AAGCCGAAAA TTGCAAGTTT TTAAGGAAGG
TGTTGGCTTC TTGTTGAACC TACATTGTGG ACACAATAAT CCCTTTTCCA AGGAGGAACA
ACTGCAGCTG AAAGGACTTT TGGATAGGGC CATCGGAGTG AAAGGAGGTG GATTGGTGTA
GTCTTTGCAT TTACCAGAAA ATACAGTAAG CCATACTTCA CGCTGGCTTT CGGAAACGAG
TTTGAGATGT ACACCGTTTT TGGCTGGCTT ACAGTTATTA CAAATCGTTC ACAACACGCG
TTT
 
Protein sequence
MKTENGRKLN SWYVLFCFVS STMAEYTYYD NEGYQEGYNT SHTITGGDGM EYYSNTSNSA 
TNSSGGDIDY WTNYAIFPKR CIVYKKTDYI MYEMFDQQYC QEENRLGTYI SRVPDYMTGH
LQQLAEQMSD LGVDDYTKPE VAQYIECTPF QIQGAYYYFQ IGCADGVTQK LAVNIYSDNT
CSKKSTVDGF DDSVIDVSAL NIPFKQCQTC VNWVNVDGAD DQFYIKRQQN APLCKTAWTY
KKHCGRQCQS VGRNAEVEGW NASDKILISV LSSFALILLG SIAFRRQKMP NKDILLEQAF
INSAGLRQSQ IFVIIIIVMA VIAVLIMLGL KDATWALLLV LNTALFAYLM KLTLESSVNT
GEIIIGPDGT ILRKDSDDSS IESTSFVPST DAIPRGPVDP FVLARSPLVS VIKLNKASCK
ENALVTPLLR SDHHHELSLQ ANSEASGHVA EADIEGEEVG GTSMSSTYHD PRFIRNPVIR
GLLLRLLRMQ SAADSLPHLV QMIVSNGKIT TVSFVALYLV CLVLWLPFWL LALLVTEWGV
YALSVAGGYF IGRCIIRMIA FPGASRKVST DIEKEFAKYS VRMLQSGVQS FAEVASIMSA
NPNHPQRVSV LKGYTLPSLW SRAKTYRNRV LGVYLEVLLH IYQQAPESST GAQRGFTKYG
NNVLSGDIGN IAGLSAQAQN DGLGLIEQLK TVLALVDTLE EQAHTFLEGR AAVVTPNDLP
EDARRTAQHL LARSQELTNF VSSLKPPSDS SNEDIEDESE EDLTVDAVRR KLERQHGSTR
EAIKKGIASV IPLLDPPPHN SIFSFDLQRG CMLSRYRGAR QLWVRRPSGG MLDVLHFPAR
DRCADTQRNS KALLYCNPNA GLVEVAAGMS LVGGNVPSNE GNEQASDGSW VDFYTSVGID
VYVFNYAGYG RSFGSTTCLK GNSAIDSYTP GILPRLIRII RSTFLTFTPN PDTLRDDGFA
VAVHLLKEVG IKQLIIHGES IGGMAASSTA RRVSHEPELQ DKLALLICDR TFCNLEAVAQ
RLVGGWTGNA IRMLAPFWST DVAGDFFATN CPKIVANDAA DAIISNEASL KSGISLWKEL
HRGIASTKGI GWMTEAPLQY RMADWENVCV NDSKYVTAPG VLRSQAPTWP HDKHISVEEA
FHFAACCKRI GKYAKAFRKG SEGGDLSGLH PGSRRALMEA WQSLACLDGL TGAPLGVAVK
QGFDTTVAWL CSFLIFGAQS IVAVAERRTE GDVGLREQLA IVPSDFDSRP AGFAAAEEGG
MVHPKPLPEV IESLLSFQES GDPSLRDLSH EFQFVVGVLK YLQGRLSAST SIEAAQKSRK
LQVFKEGVGF LLNLHCGHNN PFSKEEQLQL KGLLDRAIGV KGGGLV