Gene PHATR_37029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_37029 
Symbol 
ID7204474 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011679 
Strand
Start bp925235 
End bp928542 
Gene Length3308 bp 
Protein Length1092 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185810 
Protein GI219121161 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTCCTG CCACCCGGCA AATGACAAGT GTAGCCGTCT ATGCCCATTT TCTGGACAAT 
GTACTCCTTC TTCCCCAAGG ACATCCTATT CGTCTTGCCT TTGATCAACA AGGATATGAA
TCGGCTGACA ATCTCTTGTG CATCTTTGAG AACGAACTCG ACTCTCTTGA GTACACTCCT
CCTGCCATTC CTGACGGTCC CGAAAATCCG TCACGCATCC CTCTAATCAT GGCACATCGA
CAGATCATAC GTCACTTCCT ACGTTGGCAA GCATCCTTAA AAGACCAAAA GGGGGCTCCT
TTGAAGAACT CCGAGCTCGT TGCACTCAAC AACGAGGACT TTGTCCTGTA CCGTCGGTCA
GCACTTGGCC AGGTTTCGAC GGCCACTGCA CCTGCCACTG TTCCCCCGAC TGTTCAGAGT
CCCACAGGAA AGACACGTTC GGCTGTCGAG GATTTCAAGC GTGGGATAAA ACGTGATAAA
ACTCACTATC CTGTGCTCAA AGATGACCGA TACTGGGACA ACTTCTATCG TTCGTTTGTT
GTTACTGCCG TAACACATAA CGTTGACAAG GTTCTAGACC CAAACTACAT TCCTACCGAT
CCTTTGGAAA AGTCCCTCTT TGAAGAACAG AACAAGTTTG TATATTCTGC TCTAGAGCAT
ACTCTTCAGA CGGACATGGG AAAGAACATT GTCCGTGAGC ACAGTTTTGA TTTCAATGCC
CAAGAAGTTT TCCGTAAAAT CGTGAAACAT TATACGGAGT CAGCCAGCGC AAAGATCAGT
TCGTCTACTA CTCTGGGGTA TCTCACAACT GCAAAATACG GATCGTCATG GACAGGTACA
GCAGAAGGAT TTATTCTCCA CTGGAAGAAT CACTTGCGTA TTTACAATGA CACTGTACCT
ACTGGTGAGC AACTTCCTCA GCAATTGTGC CTTAGTCTTT TGGAGAATGC TGTTCATGAT
GTACCTGAAC TCCGACAGGT TAAAATCACA GCAACACTCG ACTTAGCCAA AGGTGGTAGT
CCCATTAGCT ACGATAGTTA CCTCAGTCTC CTCCTTGCAT CGGCATCGCT CTACGACAAC
GGTAATAATC TATCTAATTC TCGCAGTGGC AAGAACAAGC GCAATATCTA TACTACTGAA
CTAGCCTATC ATCCGACGGA TTTTGAAAGC GAACCAGATG TAGACTATGA TATAGATGTG
TCACCGACTG CCATATACGA AGCCAATGCC CACGTCCGTA ACAACAGTAC CCGTAACCGT
CCCCTGGCAA CTAATCGCGA ACGACCTTAC ATTCCTCGTG AAATGTGGAA TTTGCTCTCT
GATGATTCCA AGGCCATCCT CCAAGGTTTG GCTGCACCCG GCAAGCAGGC ACCATTAAAT
GGTAGCCCGC CTCATCAAAC GCTGCAGGCC AATACACACG AGACCATTGG CACGGAACAT
ACCGCAACGG ACACCTTCCA TGATTGCGCA CCTGAAACTG AATTACTCGC ACATCTTACT
GAGCGTGTCA GTCGCATGAG CAGCGGTGAT ATTCGTAAGG TACTCGCCGC ATCACGTGAC
GTATCAGAAA AGCCCAAATC ACTGCAATCT AACGTACTGC AATACCAAGT CTCTCGTCAT
ACTACCAACG AGACTTCTGC ATCCCTTGTT GACCGTGGCG CTAACGGAGG GCTTGCGGGT
GGTGATGTCA TTGTCCTGCT CAAAACAGGA CGTTCGGCAA ACATCACAGG TATCAACGAT
CATACCTTGC CAAACTTGGA CATCGTCACT GCCGCTGGAT GTGTTGAATC CCAAAATGGG
CCCATCATTC TCATTATGAA CCAGTATGCT CATCTGGGGA AGGGTAAAAC CATTCATTCA
AGTGCGCAGT TGGAACACTA TCGTAATCAT GTCGAAGACC GTTCACGCAC GGTAGGGGGT
AATCAGCGCA TTGTCACTTT AGATGACTAT ATCATTCCCC TCCATATTCG ACAGGGACTT
CCATACATGG ACATGCGACG CCCCACTGAT GCTGAACTAG CGTCCCTCCC GCATGTTGTC
CTAACCTCAG ACGTCGATTG GGACCCCTCT GTACTCGACA ATGAAATTGA CCTTGCGACT
TCATGGTACG ATGGCATCCA TGACTTGCCC CAGCCCCCAT ACGTTGAACC ACGTTTTGAT
CATACAGGCC AATACCTTCA CCGTCACATT TCTCTATGCG ACTACCGTGA TGACGCCATT
GCACGTATCA TGCAGTGCCA ACAGCATCAC GTCACACGTA ATGTGCACGA TTATGAAGCC
CTTCGTCCTT GCTTTGGCTG GGTCTCTGCC GACACCGTTC GGAAAACCAT CATGGCCACC
ACGCAGCATG CCCGTGAAGT CTATCACGCA CCGTTACGTA AACATTTTAA GTCTCGTTTC
CCAGCCTTAA ATGTACACCG TCATAACGAA CCGGTCGCTA CTGATACCAT ATGGTCCGAC
ACTCCCGCCG TAGACAATGG TGCAAAATTT GCACAACTCT TCGTTGGCAG ACGGTCTCTT
GTCACTGATG CCTACCCCAT GAAAACTGAT AAAGAGTTTG TCAACACCCT TGAAGATCAC
ATCCGTTTCC GTGGTGCAAT GGACAAACTA ATCAGTGATC GCGCTCAAGT TGAGATCAGT
AAAAAGGTCA CTGATATTAC ACGCGCATAT AATATCGATC AGTGGCAGAG TGAGCCTAAC
CATCAGCACC AAAACTTCGC CGAACGTCGT ATCGCCACCA TCGAAGCCAA TACGAACAAC
ATTCTCAATC TTACTGGTGC TCCTGATAAC ACCTGGCTTC TTTGCGTGAC ATATGTTTGC
TATGTCTTCA ACCATTTGGC GCATGAATCT CTCGATCATC GCACCCCCCT CGAAGTGCTT
ACTGGTTCTA CACCTGATAT CAGTGTTCTC CTTCAGTTTC ATTTTTGGGA ACCGGTCTAT
TATAGAATTG AAGATGCGAC ATTCCCCTCT GGTGGTACCG AGCAACAAGG ACATTTTGTC
GGCATCGCAG ACTCCGTCGG TGACGCTCTC ACTTATAAGA TCCTCAACGA CCGCACCAAC
CGCATTCTAT ATCGATCTAG TGTTCGTTCT GCGGCCATTT CCGGGCAAAC CAACCTACGC
CTTGCGTCAC AGGATGGGGA GAATGGTCCT AAGCCCATCA ACTTTATCAA GTCGCGTAGA
ACCGAAAACC AAAATTCCTA TGCCATTAAG GAGTTGCCTG GTTTTACACC TGATGATCTT
ATCGGTCGCA CGTTTCTCAC CGACACTCGT GATGATGGAG AGCGTTTTCG GGCACGAATC
ACCCGTAA
 
Protein sequence
MVPATRQMTS VAVYAHFLDN VLLLPQGHPI RLAFDQQGYE SADNLLCIFE NELDSLEYTP 
PAIPDGPENP SRIPLIMAHR QIIRHFLRWQ ASLKDQKGAP LKNSELVALN NEDFVLYRRS
ALGQVSTATA PATVPPTVQS PTGKTRSAVE DFKRGIKRDK THYPVLKDDR YWDNFYRSFV
VTAVTHNVDK VLDPNYIPTD PLEKSLFEEQ NKFVYSALEH TLQTDMGKNI VREHSFDFNA
QEVFRKIVKH YTESASAKIS SSTTLGYLTT AKYGSSWTGT AEGFILHWKN HLRIYNDTVP
TGEQLPQQLC LSLLENAVHD VPELRQVKIT ATLDLAKGGS PISYDSYLSL LLASASLYDN
GNNLSNSRSG KNKRNIYTTE LAYHPTDFES EPDVDYDIDV SPTAIYEANA HVRNNSTRNR
PLATNRERPY IPREMWNLLS DDSKAILQGL AAPGKQAPLN GSPPHQTLQA NTHETIGTEH
TATDTFHDCA PETELLAHLT ERVSRMSSGD IRKVLAASRD VSEKPKSLQS NVLQYQVSRH
TTNETSASLV DRGANGGLAG GDVIVLLKTG RSANITGIND HTLPNLDIVT AAGCVESQNG
PIILIMNQYA HLGKGKTIHS SAQLEHYRNH VEDRSRTVGG NQRIVTLDDY IIPLHIRQGL
PYMDMRRPTD AELASLPHVV LTSDVDWDPS VLDNEIDLAT SWYDGIHDLP QPPYVEPRFD
HTGQYLHRHI SLCDYRDDAI ARIMQCQQHH VTRNVHDYEA LRPCFGWVSA DTVRKTIMAT
TQHAREVYHA PLRKHFKSRF PALNVHRHNE PVATDTIWSD TPAVDNGAKF AQLFVGRRSL
VTDAYPMKTD KEFVNTLEDH IRFRGAMDKL ISDRAQVEIS KKVTDITRAY NIDQWQSEPN
HQHQNFAERR IATIEANTNN ILNLTGAPDN TWLLCVTYVC YVFNHLAHES LDHRTPLEVL
TGSTPDISVL LQFHFWEPVY YRIEDATFPS GGTEQQGHFV GIADSVGDAL TYKILNDRTN
RILYRSSVRS AAISGQTNLR LASQDGENGP KPINFIKSRR TENQNSYAIK ELPGFTPDDL
IGRTAFSGTN HP