Gene PHATRDRAFT_37572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37572 
Symbol 
ID7202424 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp451561 
End bp455246 
Gene Length3686 bp 
Protein Length977 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181729 
Protein GI219122805 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.421794 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGTCGC TGATTGCGAA AAATGGAGCG CTCCTCTTTG CGCTGATAGG ACTTCCATAT 
TGCCAGGCAT TCATCACTAT ATGTCCAACC TCCACATGCA AGCAGACACT CCTACGCGGA
ACGCTTCCAA GTGAAGAAAT CCTTGCAGAT AAGCCCGAAG CTGACAATGA ATATTCGTCG
CAAAAGAGTC GTCGGGAAAT GTTGGCCTCC GCAGCATCTT CCTTGGCTTA CTTTCCAGTA
TATTCATCCC TGGAAGCTAA TGCTATAGAA AAGAAGGAAG CTGTCTTCTT CAATTCCTTG
TCGGATCTTC CTCCTTTAGC TGACGACAAT GTACGTATTT TTCTTTGTCG ACATGGCCAA
ACGGAAAACA ATCGCCTCAA GCTAATTCAA GGATCACGAC TTGATCCTTC CATAAACGAA
ACAGGGCAAG AACAGGCTCG GAGACTGGGG AAAGCACTAT CTTTTGCGCT TCCGGTTGTC
CCTACGATCG TTTTTCATTC GCCCTTGATT CGCGCCCGCC AAACGGCCCA AATCGCGGCT
TTGCAGTTTT CCAGCAATCC ATCCGTCTCG TCGCCGACCC TCCGTCAATT AGACAGCTTG
AACGAAATCG ATTTTGGAAG CGCTGCTGAA GGCGAATCAG TCGAGCCTTA CCGGGCGAAG
ATGATGGCTA CCTATGCTGG CTGGTCCGTA GGAGAGCTAG ACCTCAGCAT GGGGGAAGGC
GGCGAAACGG GAGGGGAGGT ACTGGCACGA ATCGAAAAAT CGTTACAGGA CCTCGCCAAG
AGTGCGTCCA ACGCTTCAAA TCGATGTGTT GCAGCCATAG CGCATTCAAC CTTCCTCAAA
ATACTGTTGG CCACGGCCCA AAATATTCCT TTGGCGCAAG TAGCGATGTT GGAACAAAAA
AACTGTTGCG TCAATGTGCT CGATTTAAGC ACCAAGCAAG CCATAAACCT GGCATCTAGG
AGCGAGTTAC TAGGAGGCCC CCTCTCGCTA GCACCATTGG AGTTTACGTT GTCGATCCCC
AAGACAGCCG TAATCCGGAT GAACGAAAAG CGACACCTTG GTGACCTTGC TATATGACGC
ATAGTTTATT GGGACAAGAC GCTAATCTAA TATATTGTCA CGTGCGTATT ACAATCTATC
GTCAGATTGC CTCAATCAAA TTTGGACAAC GATTGCCGTA GCTTGTTGAG CTAGGTCTCG
CCAATCAAGA AAGGCCATTG GCAGTTTGGT CGGCAAATCA AGCTGACAGT GAAAACTGAC
GTCATACCTC AGCCATTGGA TCTTTTGATT TCCAGTTAGA TCAGTCAAAT CCGATGTTAC
GATACTGCAG CGAAATTTAC CCGTGATTGA AGTCTTTAGA CGGTTCATTT TCGAGAAAGT
GAGGAAGTAA AGATCGTATT AAACACGTAC GCAATTGAGG GTAATGATAC GATCACTTTT
CTGTCTGTTG TACTCAGCTG TACCGTCTTT AGGCGTCGCC GCTTTCCAAA CGACAGTCTC
CAATTCTGCG ACTCGACGAT TCCATTCGGC CGGATTACGG ACGACCCATG AGGAGAGCGA
CATAGATTCA CGTAGGGACT TCCTGCTACA GACGGTTTCA CTACTATCGG GTGGCGTTGC
GTTGTCAGGT TCTCCAGACA GTGCCACAGC TGTCGTTGGC GCGTTACCTG AGTTTGCCGA
TTCCAATGCA ATCTTGCAGG GTGTTACTAT AAAAGTAGCC GATCAGTCGC AGCAGGAGGC
CATGGTTTTA TTTTTGAAAG ACTCTTTTGA TTTCGAAGTT TTACGACGGC GTGTTCAAGG
ATCAATCGAA GAAACATGGT TAGGCTACGG TCCTGAACAG CTGCGAATAC CAGACGACTT
CACCCTTCCA GTGTCGTCCT TCAACACTTA CGGAGGCCAT GCTTCCGTTC GTCTCGTATA
TGACGCCAAG GCGACGGTTC CCTTGTATCG GACGGGAGAG AAAGCGCCAG GCGAAAATTT
TGCTTTTCTG CAAGTTGCTG TTCCAGGCTA CCGAATTTCG CAAATGGTTA AGCACGGTGG
CAACATCATT GACGCTTACG GTTTTGTCAA CGTGGTTTCA CCATCGGGAC TACCAATGCG
AGGGATTGTA GGGATCACCC CGGACCCAAT AATGTTTGTG GCAATCAATT GCATAGACGT
TAAGGCCAGT CAGGCCTTCT ACGAAAAGCT GGGATTCCAA AAGCAAGAGT ATCCGTATGC
ACGACCCTCG AAGGGAACGG GACCGTTTGA GCCAGCACAA CCATCAAAAT CGGTTTACAT
GGCGCCTTCC GCTAACTGTA TGGGATTACT CCTGCTCCCG TCGAAGAAAA AACGTCTTCA
AGCCAACCCT GTTGTGCAGT CTTTGAATCT CGTATACACG CCATCGGAAG AATCCGATTC
TGCTGATACC ATGCCGACAC TATTTGATCC TTCGGGTATT GCCGTTTCTT TCCAATCTGT
TCCTCAGTTT GAGCTAGAAG AAAAAGAAAC TAGGTAGGTT AAAGGAGAAA ATTGAAAGCT
CCTATACCGA GTCTGAAGCA CTGCTAATCG TCAAGAATCG ACTAGATTCG CATCAACTGC
AAACGACCAT TTCCTATCTG TAGCAACTTT TCCTTAGGAA CATAATATGA CCTCGTCGGT
ACCTTCTTGA TTTGTGATTC CAACCATTGT TCCAGTTCGG TCGCTGTTGA ACCAAAGACA
GCTCATAGTT AGATGTGGCG CCCTTGATTT TCGAAGGAAA ATCTACTGTG ATATATCAGA
TTTGCACCGG GCAACACAAA TCTGATTTCG GAAGTCTTTT TACGTTGCAC AGTACATTCA
CGATGGTTCG TAGAATAGTG AATAGAATTC TCATCTGTGT CGCCGCCTTA TCAAACGTTG
CTTCCTTTCT TCCGGCTTCG CTCCCGAGGA GTCAAACAAA AAGCAAAGTA AAGCAACATT
TGGTACCAGT GGATGCTTTC AGCTCTTCGC TGGTAACCTC TGTGGTGGAA TATTTTGATG
GTAGTACAAT TGTAGACCCT ACCATCGTTT CCGATGTGTA TTGGCAATCA CTAGGTAGCA
GAATTGTTTC CGTGGTCATT GCACAGGTTC TAGTTATCGC CACTTTTGCC GTTGTTTCTT
GGGTTGCTTC GAAGCAAATT GGTAACACTA TAAATTTCAT TGCCCTCAAG ATTTTTGGAC
AAGACAGACG AACGGAGGCA AAGAATATCG ATGGAGCTTC CGGCCAGCGG CTCAAAGTAC
CACCAAACTT ATCGAATGTG CCACCTCGTA GTCCCGACTT TGGCAAGCTT TTTGTTTGCG
TGATAATTGA CATTGTTGGG TCGTCTTCGG AGCTGCTTCC CATCATCGGA GAATTGTCGG
ATGTTGTGTA CGCACCGATT GCGGCACTCT TGTTACGCAA ATTATACAAC AGTAACGTTA
TCTTTGCGTT GGAATTTGTG GAGGAAATTT TGCCCTTTAC TGATATTTTG CCCCTCGCCA
CGATTTGTTG GACTGTCGAT ACCTTTGCAC CCGATTCGGA TGTTGCCAAG TTTTTGAATG
TTGGCAATTA CGGGAATTCT CAGGCTCGTG CTAGCACTTC CTTTGATGAT GGGATGGACG
CAATTGACGT AAATGGAGAA GTGAAATCTC CGGCAAAAAA AGCTTCTACA ACGATAGCGA
GTAGAGACGA CGGGCAACTA AGGTGA
 
Protein sequence
MPSLIAKNGA LLFALIGLPY CQAFITICPT STCKQTLLRG TLPSEEILAD KPEADNEYSS 
QKSRREMLAS AASSLAYFPV YSSLEANAIE KKEAVFFNSL SDLPPLADDN VRIFLCRHGQ
TENNRLKLIQ GSRLDPSINE TGQEQARRLG KALSFALPVV PTIVFHSPLI RARQTAQIAA
LQFSSNPSVS SPTLRQLDSL NEIDFGSAAE GESVEPYRAK MMATYAGWSV GELDLSMGEG
GETGGEVLAR IEKSLQDLAK SASNASNRCV AAIAHSTFLK ILLATAQNIP LAQVAMLEQK
NCCVNVLDLS TKQAINLASR SELLGGPLSL APLEFTLSIP KTAVIRMNEK RHLGVAAFQT
TVSNSATRRF HSAGLRTTHE ESDIDSRRDF LLQTVSLLSG GVALSGSPDS ATAVVGALPE
FADSNAILQG VTIKVADQSQ QEAMVLFLKD SFDFEVLRRR VQGSIEETWL GYGPEQLRIP
DDFTLPVSSF NTYGGHASVR LVYDAKATVP LYRTGEKAPG ENFAFLQVAV PGYRISQMVK
HGGNIIDAYG FVNVVSPSGL PMRGIVGITP DPIMFVAINC IDVKASQAFY EKLGFQKQEY
PYARPSKGTG PFEPAQPSKS VYMAPSANCM GLLLLPSKKK RLQANPVVQS LNLVYTPSEE
SDSADTMPTL FDPSGIAVSF QSVPQFELEE KETRIVNRIL ICVAALSNVA SFLPASLPRS
QTKSKVKQHL VPVDAFSSSL VTSVVEYFDG STIVDPTIVS DVYWQSLGSR IVSVVIAQVL
VIATFAVVSW VASKQIGNTI NFIALKIFGQ DRRTEAKNID GASGQRLKVP PNLSNVPPRS
PDFGKLFVCV IIDIVGSSSE LLPIIGELSD VVYAPIAALL LRKLYNSNVI FALEFVEEIL
PFTDILPLAT ICWTVDTFAP DSDVAKFLNV GNYGNSQARA STSFDDGMDA IDVNGEVKSP
AKKASTTIAS RDDGQLR