Gene PHATR_43994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_43994 
Symbol 
ID7204400 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp641490 
End bp644482 
Gene Length2993 bp 
Protein Length942 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186388 
Protein GI219113609 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ACAGCAAACA CTCCGGTACC ACCGATCCTT GTTTCTGTGA CCATTTTGAA AGCATCAGCA 
CATATTTTTT CACGATGCCG ATGGATATTC GTCAATTTTT CAAAGGCGGA GGATCCAGCA
AAAAAAACAC CGTAAAGCCG GTATCCAATT TGATGGATCA GGTCAAACTG GTGAACTCTG
GCTCCAAAAA GCGCAAAGAA TCCCCCGTAC ACGAGGAAGA ATCCACAAAT ACTTTTCTTG
AATCCACTGG AAATGTTCCT ATTCAGGAAC GAGAGAAAGA GCCATCAGGT CGTAGGAGAT
CGCCCCGTAA GCTGTCGAAA AATAGTCCGG CAAACGTAGT GCGAGCCGAG ATTGGATTTG
TCGACGGAAC GAAAGAAAAG CCGATCGCTA GTCCCAACAA AGCTCTGGAT ATGAGTTTAT
CGAAAAAATC TTCCAAATTA GCAACGTCAC CCCAAAAAAA ACCGTCGGGA GTTGTCAGCA
TTGCCAACTC TCTTCCCCCG ACCGCCGCTA TCGGGAATCG CAATGTCCCT TCCGATTTCA
GCCCCTCGCC GCAGACCCGC AAATCTCCAC CCACGAGTAC TGTGAGCAAC ACGAAACGCC
TAAAGCGTGA TCCGCCTCTG GAGCCCAAGC TAACACAATC ATCGTTCAAT GTCGACAAGG
CTGCTCCAGA ATGTTTGAGA GGCTGCACGT TTGTCTTTTC CGGTGTCTTA CCGAACCTCA
GTCGCGAGGA CGGCCAGGAA ATGGTCAAAA CACTTGGCGG ACGGATCACT GGAGCTGTAT
CAAGTTTGAC AAATTATCTT GTTGTTGGCG AAGAGTTGGA AGATGGACGT GTCTACACAG
AGGGCAGCAA GTACAAACGT GCGGTCCAAG AAGGTACGCA CATTGTTCAG GGCGAGGAGG
CCTTTTACGG GTTGCTACAG CAGTACAATG ACAAGGAAAT CGCAGCAGGA AATGCTTTAT
TGAACACAGC TCCCAAACTG AGCCAATCGG AAGCACCTCT TGCTGCGAAT CCGTATGCCA
AGAAGGCGCC AAATAATCCT TACGCCAAAC CTGCCCTTTC AAATCCTTAC GTTAAGGCAA
AACCATACAC TTCTGGCAAG CCTTCGCCTG CAGAAATTAG CTCACCAGTC GACATCAAAG
CAGATCGCTC TTCCGGTGCT AACCTTCTTT GGGTGGACAA GTACAAACCG ACTCGCTCGG
GGGAAATTTT GGGAAATGCC GAGTCGGTGA AGAAGCTTGG CCTTTGGCTG TCATCTTGGG
AACAGAAGTT TAACAACTCC AAAGCTGTTG GAAAAGGTGT TGCTAATCCA AACGATCGCT
TCAAGGCCGC ACTTTTGTCT GGGCCACCTG GCATTGGTAG TAAGTACAGA TTCTTCCTGT
GTTTGTTTTC TCTCGCTTTC GGATACGTTT CTGACACTCA ACCCATCATA TATTCAGAAA
CGACTACAGC AACTATTGTT GCAAAAGAAT CAGGTCGCGA TGTGATTGAA TTCAATGCTT
CCGACGTGCG ATCCAAGAAA GCGATCAAAG ACGACATGGG TGATATCACT GGTTCATACA
CACTCGAGTT TGGCAAACCC GCCATCAATG AAAAGCGCCA AAGTAGTCGG ATTAAGCGTT
GTATAATTAT GGACGAAGTT GATGGCATGG GTGCTGGGGA TCGCAGTGGG ATGTCAGAAC
TTATTCAAAT GATTAAAAAG AGCCGAGTTC CGATTATCTG CATTTGTAAC GATCGGCAGT
CCCAAAAGAT GAAAAGCCTG CTTCCCTACT GTATGGATCT TAGGTACCGG CGACCGACAA
AATCTGTAAT CGCGAATCGC GCTGTAAGAA TTGCGGCACA AGAAGGATTT ACCGTCGAAC
AAAACGCAGC TGAAGCGATT GCTGAGTCAT GCGGAAACGA CGTTCGGCAG GTTTTGAATT
GCATGCAAAT GTGGGCCAGT GACAGCAGTA GTGAATCGCG CATGACTTAC AAGGATTTGA
AACAACGCGA GAGCTCCATT AACAAAGACG AGATCCTCCG CGTCAGTCTT TTCGATGCAG
CGCGAAATAT TTTGGAAGGT CGTCGAGGGC TACAAGGAGC TGATGCATCG ACCGAGCGAC
AGCACTTTTT CAGAAGAAAC GATGCCTTCT TCGTAGACTA CAACTTTGTT GGTCTGTTGG
TACAGCAGAA CTACATCAAA GTGATGCAAG GTCAGTTCAA TGATGCAAAA CGTTCAAATG
ACCAGTCCAA TATTTTAGGT GTTTTGGAGC GAATGAGCCA GGCCTCGGAT GCCATGTCCG
ATTTTGCTGA GGCCGAGAAC GGACTGAGGG GAGGCCAGAA CTGGAGCCTT TTGCCCTTTT
GTGCAATGCT AGCGGTAAAA ACTGGCTTCC ATGCTGGTGG TCCCAATGGG GGCGGTCTTC
CTGGCTTCCC AGACTTTACT TCTTGGCTTG GACGAAATTC TAGCAAAGGC AAGAAAGCTC
GTCTGTTACA CGAACTACAG CATCACATGA ATTATAAGAT TAGTGGTGGA GCTCAAGAAA
TGCGTTTATC CTACCTACCA GTTTTACGTG ACCGGTTCTT GTCGCTCCTA CTGGGCAGAG
AAGAAGGACT CACTGAAAAA GCCATTGACC TCATGGATGA ATATGGCCTG GACCGAGACG
ACGTCTTCGA AAAGCTTGAT GAGTTTCGAA TGGATCACAA GGCGGACACC TTCGCTAAGC
TGGATAGCAA GAAAAAGGCC GCCTTCACAA GGTTTTATAA TCAAGGTACT CATAGAAGCC
AAGCACTAGT GGCTGAACAA GGCGGTAGCA AGACGGTTAA GCGTGGTGCT AACGCGGTTG
CGGAGGAAAC GATTGATCCA GATGCCATCG ACGATGATGT CGCAAAGGCT GAAGAAAATG
AAGGTGATGA TGCGGACGAG GACATGGAAA AGATTAAAGC CATGTTCAAA AAGAAAGGGC
GAAACACGAC GCAGAAAGCT GCTACCAAGG GTAAAGCCAA GAAAAAGAAA TAG
 
Protein sequence
MPMDIRQFFK GGGSSKKNTV KPVSNLMDQV KLVNSGSKKR KESPVHEEES TNTFLESTGN 
VPIQEREKEP SGRRRSPRKL SKNSPANVVR AEIGFVDGTK EKPIASPNKA LDMSLSKKSS
KLATSPQKKP SGVVSIANSL PPTAAIGNRN VPSDFSPSPQ TRKSPPTSTV SNTKRLKRDP
PLEPKLTQSS FNVDKAAPEC LRGCTFVFSG VLPNLSREDG QEMVKTLGGR ITGAVSSLTN
YLVVGEELED GRVYTEGSKY KRAVQEGTHI VQGEEAFYGL LQQYNDKEIA AGNALLNTAP
KLSQSEAPLA ANPYAKKAPN NPYAKPALSN PYVKAKPYTS GKPSPAEISS PVDIKADRSS
GANLLWVDKY KPTRSGEILG NAESVKKLGL WLSSWEQKFN NSKAVGKGVA NPNDRFKAAL
LSGPPGIGTT IVAKESGRDV IEFNASDVRS KKAIKDDMGD ITGSYTLEFG KPAINEKRQS
SRIKRCIIMD EVDGMGAGDR SGMSELIQMI KKSRVPIICI CNDRQSQKMK SLLPYCMDLR
YRRPTKSVIA NRAVRIAAQE GFTVEQNAAE AIAESCGNDV RQVLNCMQMW ASDSSSESRM
TYKDLKQRES SINKDEILRV SLFDAARNIL EGRRGLQGAD ASTERQHFFR RNDAFFVDYN
FVGLLVQQNY IKVMQGQFND AKRSNDQSNI LGVLERMSQA SDAMSDFAEA ENGLRGGQNW
SLLPFCAMLA VKTGFHAGGP NGGGLPGFPD FTSWLGRNSS KGKKARLLHE LQHHMNYKIS
GGAQEMRLSY LPVLRDRFLS LLLGREEGLT EKAIDLMDEY GLDRDDVFEK LDEFRMDHKA
DTFAKLDSKK KAAFTRFYNQ GTHRSQALVA EQGGSKTVKR GANAVAEETI DPDAIDDDVA
KAEENEGDDA DEDMEKIKAM FKKKGRNTTQ KAATKGKAKK KK