Gene PHATRDRAFT_49572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49572 
Symbol 
ID7198237 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011691 
Strand
Start bp80036 
End bp83199 
Gene Length3164 bp 
Protein Length975 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184301 
Protein GI219128189 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.280952 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAAAC CCAAATCTTG CGTATCGTAC AGTGTCCTCT ATTACAAGCG GGCGGCCAAT 
ACAAAAAAGG TCTACCAACG CAAAGGCGTG ACAACGTTGG ACGGAGTCTT GACGATTCAT
TCCGATTCCT CCAGGATTGT GTTGCGATCG GCGGATGGGG CAGATGTGCC GACACAGAAC
GACGCGTTGA ATGCATGGAG TAAAGCGGCC AAGGCACCGG GAATGCTGGT ACTGTCCTCG
GTCCAACGCG ACGTTGCACA GCAAACGTGG TCGGAAGACG ACGAATTTAC CGTGGGGCCG
TATCGAATTG AAATCTTGAA GCGCCTGGAT CCGAACGAAA ACTGCAGCCC CGTCATCAAC
AACATTGACA TTGTCCGAGG CCCGTCAGTC GTACACGGCC AGACAGTACG AACGGGAGTC
CCAATGCAGG CTTTACCGTC CAGAACTCGA CTGCCGCTCA AACGCAAGGC TCTACCTTTG
CAATCCACAT CCACTTTGTC TTTGGGGAAA CGAGTAATCG GCGCACTAGC TCGCAAGACA
ATCCCTGCCA CAACTCCAGC AATAGGATCC ATAGCACAAT CTTGTGAACT AGTAAAGAGC
GGAATAGCAA GCAAATGTAC GGTGGACAAG GGGACGTCTC GCACACTTCT GCCCACTGCG
CGCCCATCCC CACCGACGAT GGTCCGCCGG ACTCCGCTGG TGTCCCGATC CAACACTACA
ACCACAACCT CCCAAAGCAA TCTGAAGCGA CGCACGACGC CATTAATGGG GTTGAAATAC
ACGTTGGCGA AAAATCGGGC CAAGCTGGCG TCGTGCCCGG CAACTCGTCG ATACCTACCG
GGCGCAACCA GCACCTCAAC CATTCCCGCG ACAAACACAT CTCCTCCGCA ATCGTCCACC
AATTTACTGG CGAACGTACC TCTACCTGCC TCGGTACGAT CCGTGCTTCG CCCTCATCAA
GAAGAAGGAG TGGAATTTTT GTGGCAAGCA CTGGCACCAA TGGCCGTCTC CGGCAACGAG
CAAAATGACA CCCAAAGTCC CGCAAGGGGC GCTATTTTGG CCGACGAAAT GGGTTTGGGG
AAGACGCTCA TGACTATTGC CATCATTGCC GGTCTGCATC GTCGACAAAG AGACAAGGTG
AGTCCAAACA GCAAAGCGTG GTGTCGTTTG TTCGTACTCG CATAGATGAC TCGCCATGCA
GACAACGCCT TTGTCAGTCA AACCCTCACA GCTCTTGTGT TTTGGTTCGT TTCAGCAATT
CATCGTGGTC TGCCCTTCTT CACTCGTTAC CAATTGGGCC AGGGAGTTTG ACAAGTGGAT
CGGCCGGGCC AGTCAACCAA AACGTGTAGT CATTCAGAAA GGCGGCGAAG AGGGGGTCGC
GGCTATGCGG GCGTACTGCG CTGGCATGTT GAAAAAGAAA AAGCAATTGC AGAAGATTGG
TCAAGTATTG ATTGTCTCGT ATGATTTACT TCGGCGGCAG GTCGAGCATC TCCAAGATGC
CTGTGCATTC GGTTTACTTG TCGTGGATGA AGGCCATCGT CTCAAAAATA CTTCCGGATC
ATTGACCTTG ACGGCCTTGG AATCGCTGAC AGCTGATGCT CGCTTATGCA TTACAGCTAC
GCCGATGCAA AACAATCTTT CCGAATTTTA CAATCTCGTC AACTTTGTTC GTCCTGACGT
GCTCGGATCT CTCAACGAAT TCCGTGACAG TTTTGATCGG CCAATCTCTG CTGCCAATCA
CAAGCACGCC ACACCGTCCC AAATTGCGAC GAGCCGGGAG CGATCAAGTG CCTTGGAGAC
CCTAACCAAA CCCTTTATTT TGAGAAGACT ACAGGCGGAT GTATTGAAGA GCATGTTGCC
ACCTCGAGTG GAAACTCTTT TATTTTGCCG GCCTTCGGAA ACTCAACGCG CTCTTTACCA
CCAATTGACG GCTCGCATTT CGGGTGGCAG TTGTACCGAT GGCGGCACTG ACGCTCTCAA
AACTCTGACA ACGCTGCGTA AAATCTGCAC ACATCCATCC ATCTGCAATG ATGACAATGT
CAAACCATGG AATCGGCCAG AGAAAGGACC TTGCCTCAAG TATGACATTG CTCTGTCTGG
AAAGATGACT GTGTTAGATA AGTTGCTGCA GTCGATTCGT GAGAACGCTC CGAATGACAA
GATTGTGGTA GTTTCAAACT ACACTTCCGC CTTGACAATC GTGGAGTCCC TCATTCTCGG
CCCACGTAAG CTTGGCTTTC TTCGTTTGGA CGGTGGTACC GAGTCATCAC AGCGACAGCC
ACTCGTAGAG TCTTTCAATC GCTCTCATCC AGAGAAGGTT TTCTGTTTGC TCCTATCGTC
CAAGGCCGGT GGCTGCGGTT TGAACTTGGT AGGCGCGAAT CGTCTCTTGT TGCTCGATCC
AGATTGGAAC CCGGCCTCGG ACGTGCAGGC CATGGGACGC GTCTATCGAC AAGGCCAGAC
GAAGCCGTGT TGGATCTATA GACTTTTTAC CACTGGCACC GTAGAAGAAG TCATTCTACA
ACGCCAATTG CAGAAAGGAA ATTTAACAGC GTGGACAGTT GATGGTGGAA AGAGCTCACG
ACAAAACAGC TCGGATTCTC GAGCTAAATT TTCAAAGGAA GAGTTGACTG CTGCATTCAC
GCTCAAGGAC GAATATTGTA TCTGTGATAC AAAGCAAAAA ATGGGCCTCG CCTGGCCTGC
GTACAATCGA GGGACCCTTT CGCAGTATGA CGATGCTCCT ATGAAGGAAA CGGCCATGTC
CTTGTCCGAG ACTTTGAGTT TCGTACATGT AGTCAACGAC GATGCATGTG CAGAAGAAGA
ACAGGACGCT GCTTTATCCA CTGCAAGCAG CACCCATTCT GCTTTGGTCA ATTTTTCATC
TAACGAAGTG GTTTCTGATA CCAAACAAAG TAACTGCAAA AAACATTTGG AGGTCTTCCA
TTGCGATGAT TCGTTGGCAG TGGGCGATAG TGATTCCGAA GAAGAATTCT GAATACTAGC
TGCAAAATCA TCCTTGATAC TTGCTTGGCA AGTCTATCTG TACTTGGCCT CTTACATCGT
CAACTCATTT TCTTGCGACA GCACTGTTTT CGAAAGATAA TGGGAGTTAA GATGGGTACC
GTCACCTTTC ACAACTAATA TAGCTTGCAC TGTTATCTTT GTTG
 
Protein sequence
MTKPKSCVSY SVLYYKRAAN TKKVYQRKGV TTLDGVLTIH SDSSRIVLRS ADGADVPTQN 
DALNAWSKAA KAPGMLVLSS VQRDVAQQTW SEDDEFTVGP YRIEILKRLD PNENCSPVIN
NIDIVRGPSV VHGQTVRTGV PMQALPSRTR LPLKRKALPL QSTSTLSLGK RVIGALARKT
IPATTPAIGS IAQSCELVKS GIASKCTVDK GTSRTLLPTA RPSPPTMVRR TPLVSRSNTT
TTTSQSNLKR RTTPLMGLKY TLAKNRAKLA SCPATRRYLP GATSTSTIPA TNTSPPQSST
NLLANVPLPA SVRSVLRPHQ EEGVEFLWQA LAPMAVSGNE QNDTQSPARG AILADEMGLG
KTLMTIAIIA GLHRRQRDKT TPLSVKPSQL LCFGSFQQFI VVCPSSLVTN WAREFDKWIG
RASQPKRVVI QKGGEEGVAA MRAYCAGMLK KKKQLQKIGQ VLIVSYDLLR RQVEHLQDAC
AFGLLVVDEG HRLKNTSGSL TLTALESLTA DARLCITATP MQNNLSEFYN LVNFVRPDVL
GSLNEFRDSF DRPISAANHK HATPSQIATS RERSSALETL TKPFILRRLQ ADVLKSMLPP
RVETLLFCRP SETQRALYHQ LTARISGGSC TDGGTDALKT LTTLRKICTH PSICNDDNVK
PWNRPEKGPC LKYDIALSGK MTVLDKLLQS IRENAPNDKI VVVSNYTSAL TIVESLILGP
RKLGFLRLDG GTESSQRQPL VESFNRSHPE KVFCLLLSSK AGGCGLNLVG ANRLLLLDPD
WNPASDVQAM GRVYRQGQTK PCWIYRLFTT GTVEEVILQR QLQKGNLTAW TVDGGKSSRQ
NSSDSRAKFS KEELTAAFTL KDEYCICDTK QKMGLAWPAY NRGTLSQYDD APMKETAMSL
SETLSFVHVV NDDACAEEEQ DAALSTASST HSALVNFSSN EVVSDTKQSN CKKHLEVFHC
DDSLAVGDSD SEEEF