Gene PHATR_43975 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_43975 
Symbol 
ID7204389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp591009 
End bp594757 
Gene Length3749 bp 
Protein Length1210 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186378 
Protein GI219113589 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAACTG CTCCACGTAG AAAGCGGAAA TTCCCGTTGC CCATTATCCC AATCGACCAC 
ATCCGTCCTC TGCTGTGTCC CGATCAGCTG AATCTACGGT ATCCTGTCCA TGGCGCTGAA
CTTGGCACAC TACTCGAGGC AGAGGCGCTG TTTAGTGTAG AAGGAAACGA TATCCTGTTG
ATGCCTAATG AACCAGCAGC GAAAGCTCGA AAAGCTTCGC ACAATCCTCA ACAACAAAGT
CCTGAGCTCG TAGACTCAGA CAGCTTACGA TCTACGGTAA TGCCCGACCA AAGTCAACGC
TCGACGGTCG ATTCCCTATC ACTTAGATCG AACAAAGATA CTGAATGTTT GCGGGAAAAG
AGTTCCGTAG TCGACTCTGT TGGAGAAAGC ACGATTGCTC GACAATCGCC CAACATAAAT
GTTGGAATCC ATGCTGCAGA ACGAAACGCA TCCAGCAATA CGAACACTAA TTCTCTTCCT
GCACATACAA GTGATGCCGA AGAGTACGGC AACTCGCAAT TGCCTCCAAA TTCTCCCGAC
GATGCTGCCG TCTCTGAGCT GGAGCAAGCC TGGATGGGAA AGCTCGATCG CGATATTTCG
GTGGCCGAGC AAAACACGAG TACGAGTAAG CTTGTCGATC GAATAGCTCA AGCTGTAAAA
AGTGGCGTAC AGAAAAAAGT ATTTATATCC AGAAAAGAAA GCTGCGGATC CTCAAACTTG
GACTCTTCTT TGGCAGCTCC TGGCAATAAC CGACGCCATG ACGAGCTGTC GAACGACAAG
TCTTCCTCTG CCTTAGAAGT TATCGATCTA ACTGGGGAGC TCTCAGATTG CGGGGACGAC
TGTGAAGTCT TGGATTTTCC GGAACGTTTG TCTTTGACAG TACCAATTGA TGCAGCCGAA
CAAAGTAGAA CAAAGAAACG ACCTTTGGAC CTCAAAGTGG TGCGAGCTAC GAAATTGGTG
CCGAAGCGGC TGGAAAGTGG TGGTGACTTA GTGTTTTCTG TTGAAGATTT TCAGCGCTAT
CTTGTCGGTC CGTTTAACCA ACGTGAGGGT CGCAAGCTTG AATCTGCGAA AAAGGCGGCC
CCGTTTGCTG CAAAGGTAAT CAGTTTGAAA AAGTCTCCTG ATGTTTGGAA GGTCCTGGTT
GAGGGCCACC CCGATACGGT ATCTTCTTTT GAGGTTGCTA TGGTACAGTG GGTCGGAGAG
AGGATACATG GCTTAAATTT TCAAAAGCTC AAGCTTGAAG GCCTTCCAAA GGATGTTTTG
TTGGAGTTCT CTCACCCTAT TGGGGCCAAG CTTAGACCAG CTCGGTACGT CGTAAATAAT
CGAGAGTCCC GAGCCTTACT TATTTCAATC AATGTGGATG GTCAGCTTGG TAAGGCACTT
GCTCCACTCG TCACTGTATT TGGCTGTGCA GTTGCCCTTT TGGATGGATT GGAATGTCGA
ACAATTTCGG AGTTCAAGTC AGTATTAAAA AATGCAGCTA AAAAGGGACA ACCTTTGTAC
AAGCTCTCTC TGTTGCTTGC AAAGGAAAGC AATGCAGCTG GGAGAGGGCT TCACCAGCAA
CAAAAAAGTT CTTACTTGAG TGGCGGCTTG AGTAGTTTCC CGTCTTCCGC AAACCTACTT
GTGGGTGGCC CAAACACAAC CGATGATAGT GCAAACATCT TATCTGGCGC CAACGTGAGC
ATTGATTCCA GCGTAAAGAC AAGTTTACAT ATACCTCGCA AGGCACTAGA GAGCAAAGCA
GGGAGCAAAC AGGAAAGGAC GTATGAAGTT CTGTTTGATG CACAACAACC CCTAGGATTT
TATTGTATTG CCCTACCCTC TGGAATTTCG CAAGCTGAAT ACTGTTTGAT TGTTTCGATA
TGTCCCGGAG GTCAAGCGTC GAGAGATGGT CGGATACGTC CCGGTTCAGT CGTGCGATCC
GTATCAGGGG AGGACAGGTT ATTAGGTATT GAACAATTAT TTGAGATTTA CGAGATGGCC
AAGCGAAAGA ATCACATAAT CAGCCTTTCA TTTCTGGACC GCCTTAGTCC GCTGAACGGT
TCGATGTCCC AAGCCTTTGG GGAATGGACT GCTAAAGGTC ATTGGAAAGG GCGTGTGAGC
CATGGTTGGG CGGGTGGTGC TCTACAGACT CTCGATACTT CAAACAGAGT TAGTCGCGAA
AGAGATCCTC ACGGAATTCA GAAGCAGCTC TCCATTGGGA GTGGAGAGAC TGGTCGTAAA
TCGATGGATG ATGGAAGGCC GTGTATGGCT GAGCACCCTC CGAATAGTTG CCGAACTAGT
ACCGGGAGCT CAGGAGACCG AAGGGTCCGA TTCATGGACG CAATACATGA GCGACACTAT
TCTATCGATA GCAAGCCCTG CGAATTTTAC GAGAAGCATA GCGGGTTGGC CTTGTTGCCA
AAAGACAAAG GAGCTGGACT GGGACATACA ATACCTTGCG ACAATCAGTC TTTACTCCTT
CGATCAATAA AGTCGGGGTC GTTTCGGGAC GTTATTGTGA TTCTTGAGGA AGGGTTGCTG
AGTTCAGCAA AATCAACCGC ATCACTTATA GTTGCAAAAT CTTATGTGAA AGAGCAACTT
GCCTCACTAC AGCAGAATGG AATAACCGAC GCTGCTTCCG AAAGAGATTG GATGTTGAAA
GATGTTTTGA CCAAGATTTT CTTGAAGGCC GCTCATGTGT ATGAGAATGC CAAGTCTCTC
AAAGAGTGGT CGCGCTACGA AGTAATCTTT CTCGGACTTG AAGAGGTTCA ATTGTCTAGC
AGCGGCGGGC TTCAATTTAA TCAAGATTGC ATTTCCGTTA GGCTATCAGC TCGCTACCCC
GATTCCAACC AGCAGTCCGA ATTGGCAAAG TCTCTACCAG TACCGCTCTC GAAGGATATT
CTTTTTGGCA AGGAGCTTTC TGTCCCGAGG CACATGCACT ACAACAAATG CGTAGCTTCT
AAAAGAAGTG TTGTTGTCGA CATTTGTAAA AACGGGGAGG CATCCGGCTC GGTCAAATGT
ATCGGGTCTA CTGTGTTAAA AATCCAAGAT CTTCAGCGCA AGTGTCCTCG GAATGGAACT
TGGTTGGAGA GTTCGAAAGC GTTCTCGAAC AGAAATCTGT TTGGAAGCGC ATCTATACGA
TTTCGTGCGA GGCGTTTACC AGTTGAGGCC ACTTACCTCG AACGGAAACG GAAAACGGAG
TGCATAGGGC TCAAGGATGT CATCAATTGG ATCAAGCGCT TCAACGATGG GCTTTGCCCA
GAAGAAAGGG ACGCACAACT GACATTTACT GTTCCTGTTT TCGACAACGC TAGCCTGTTG
CATTCCGCCA TTTTAATACA AGAATCACCT CTGGTTGAAG AGTTGCTCTA TCTCGGAGCC
GATCCAAAGA GAAGGAGTGT AATTGGATCA CCTGTCTTTT TGGCCCACAA TCTTCGACAC
AAATTGATAG AAAGTCTCGC CGAGACGTCG AATAGTGAGA TAGCCGATAC AGACGCATAT
CAAGAAAGGG AAAGAGGTCC CGATAGCGCA TCCGTCGTAT CCGAAGGAAG GTGTGCTAAC
CCGCGAAAAA AGAGAATCGA GCACATTGCC GCGCTGATAG CGGCAGCGAC TGGAAATAAA
CTTCCAAGCG AACCACGACG GAAGGTATGC TAACCCGCGG AAAAAGGAAT CAAGCACATT
GCCACGTTGA TAGCGGCAGC GACTAAAACG GCACTGTCCA GCGAATTGTG ACAAGTTGCT
TACATCATAC AACGCGAGCG GCTCTAAAA
 
Protein sequence
METAPRRKRK FPLPIIPIDH IRPLLCPDQL NLRYPVHGAE LGTLLEAEAL FSVEGNDILL 
MPNEPAAKAR KASHNPQQQS PELVDSDSLR STVMPDQSQR STVDSLSLRS NKDTECLREK
SSVVDSVGES TIARQSPNIN VGIHAAERNA SSNTNTNSLP AHTSDAEEYG NSQLPPNSPD
DAAVSELEQA WMGKLDRDIS VAEQNTSTSK LVDRIAQAVK SGVQKKVFIS RKESCGSSNL
DSSLAAPGNN RRHDELSNDK SSSALEVIDL TGELSDCGDD CEVLDFPERL SLTVPIDAAE
QSRTKKRPLD LKVVRATKLV PKRLESGGDL VFSVEDFQRY LVGPFNQREG RKLESAKKAA
PFAAKVISLK KSPDVWKVLV EGHPDTVSSF EVAMVQWVGE RIHGLNFQKL KLEGLPKDVL
LEFSHPIGAK LRPARYVVNN RESRALLISI NVDGQLGKAL APLVTVFGCA VALLDGLECR
TISEFKSVLK NAAKKGQPLY KLSLLLAKES NAAGRGLHQQ QKSSYLSGGL SSFPSSANLL
VGGPNTTDDS ANILSGANVS IDSSVKTSLH IPRKALESKA GSKQERTYEV LFDAQQPLGF
YCIALPSGIS QAEYCLIVSI CPGGQASRDG RIRPGSVVRS VSGEDRLLGI EQLFEIYEMA
KRKNHIISLS FLDRLSPLNG SMSQAFGEWT AKGHWKGRVS HGWAGGALQT LDTSNRVSRE
RDPHGIQKQL SIGSGETGRK SMDDGRPCMA EHPPNSCRTS TGSSGDRRVR FMDAIHERHY
SIDSKPCEFY EKHSGLALLP KDKGAGLGHT IPCDNQSLLL RSIKSGSFRD VIVILEEGLL
SSAKSTASLI VAKSYVKEQL ASLQQNGITD AASERDWMLK DVLTKIFLKA AHVYENAKSL
KEWSRYEVIF LGLEEVQLSS SGGLQFNQDC ISVRLSARYP DSNQQSELAK SLPVPLSKDI
LFGKELSVPR HMHYNKCVAS KRSVVVDICK NGEASGSVKC IGSTVLKIQD LQRKCPRNGT
WLESSKAFSN RNLFGSASIR FRARRLPVEA TYLERKRKTE CIGLKDVINW IKRFNDGLCP
EERDAQLTFT VPVFDNASLL HSAILIQESP LVEELLYLGA DPKRRSVIGS PVFLAHNLRH
KLIESLAETS NSEIADTDAY QERERGPDSA SVVSEGRCAN PRKKRIEHIA ALIAAATGNK
LPSEPRRKVC