Gene PHATR_44131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_44131 
Symbol 
ID7203884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp1069140 
End bp1072563 
Gene Length3424 bp 
Protein Length1111 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186461 
Protein GI219113755 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0847147 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATCCA ATAATACAAG CAACACAACG ATCCATTTGA TTGGGACTAG CGCGACTATA 
CTCAATACAG AGAACTCGGA GTGCGGAGCC TGTTACGGTG TTGCGGCGGT CTCAGCTTGG
GCTCGCAATG TTGTCTCCGA AAGCCCCGTT ATAGTACTGC CCCTCCTGGA TCGGTACACA
CAGTTTGTAG AGATGCATCC ACTTAAATTG AGTGTGAATA CGCATGTGTT CAACGTTCTT
TTGGGGTGGG GCGTTTTCCT ATCGAAAGCT TCTCTTCTAT CGCTTGACGC TACACAGAGT
GGCATTCGAA GTTTGCAATC AAGAAGCATG CCGCTGCTGT TGACAAATGC GATTGTATCT
CCCATAAGCT CATGGCACGA ATATTCAAGG GCAGTTCATC TTGACGCGGA AACAAAACTT
GCCGTCCTGT CAATTGTGCC GGCCTCAGCG GAATCTTCTT CGCGTGTGGA TTCAATTTCT
TCGGTCTTGG GCATTCTCCG CTACGTTGCG AAGCTTAGTC GCGAGTCGGG TTGCAACCCG
TCCACGAGCT TGTATGACTC GTACTTGGAC CGCAGCCATG TCGCTATGGA AGCACCGTCT
TGTTGGATTC CGGTTGTCTG GTATTGCGAT CCAGACCCAG CTCATTTTGA ATTTTTCTTA
CAAACAGTTA CGGAAAGCGA ATACGCTCCT GCTGCTATTG TGGATACTGA GAGCAACAGT
GCCGGGTTTT TATCTCCGCG GCAATACGGA AGCAAAGGTA CCTGGGTATT AAGTTATAAG
GCTGACCCCC AGCTTTTCAC GCTTCACAGC TTCGTCCTCG ATGAAGACCG GAAAACTATA
TTGAATTTTA CAACCCAGAA CGAAAACATT TTGGATCTTC CCGAAAATTT CAGAGACGAG
ATTTACCTGA AGCATGTAAC CGATTTGGAA GACCTTGCTG CACAAGCCAT CGCGAATAAT
CCTATCCTGG GAAGTAGTAC ATCCATGCCG GCCCCAGTTG AAGGAGACTT TTATCGATGT
ATAGTTGGAG AATGTGAATT AGGTAATCTA TTTTCAGACG CACTTCGCTG GTACACAGGA
GCTGATGTCG CTTTTCTCGC AGGTGATGGT TTCCGTGGAT CGGGCTGGCC GGAGGGATTT
GTTCGGGTGT CCGACCTTTG GGAGGCTTCC CGTTTCCCGT ACACGGAATG TACTGGAACG
ATGAGCGGTA TCTCCTTGTT TCAATTGTTA AATTATTCGA CCAGTTCGGC GTCACTGAAT
GGGTTCAACA TCGACGGAGG TGAATTTCTT CAGACTTCGG GTATGCGTGT CACTTATAAT
CCACAGTTGT CTGGGTCTCG CCTGATTGCT ATCGAAGTCT GGGACCAAAA TGTAGCTAGA
TACGAGCCAC TGGAACGGCT GCGTATGTAC CGATTTGCCA CCGATAGTAA CCTTTGCCAG
AAAAAGAATC CTTTCCCTAA TTTCCTCGGT CCGAATTTTG CTGTAGAAGG TGAGATTCAG
GGCGCAGTGG GAGATGAGTC GCAGCAAAAC ATTGTTGGTC AATACCTGGC GCAGCTCGAT
TTACCATACG AAGCGTCCCT TTTTAATCGC TTGCGCAATA ATGTGTCCTC ACTGAAGACT
TTGAATCTGG TTCAAGTTGC CGAGGAATGC CCCACTGGAA CGTATTGGAT TACTGAGAGA
CAAACATGTT TTGACTGCCC GGACTCGACT CGTGTTGCTT TTTCGGAAAA AGAATTTCAA
TACCAAATTC CTCATAGCAT GAATGTACCG TTGGAAAGTC GAGTCCTATT ACTAAATGAA
GCGCCCTTTG CAGTTTCTGT TGGACCAAGT TCGATACCAT CGTGGGTTTC TTTCACGAGG
TTCTACTTGA ACTCAACGAT TCCGATTGAT CCTCCGTCGA ATGGAAAAAG GGCAGTTTTG
CAGTCTGGCG GGTCTCTTAC AATCGACTTT ACGGTTAGAT CCCGGGGGTT GTCGTCAGGG
ACGGCGTTGG GTACAGTGTC TTTTGGAGTA CACGTTGGAG GTGCCTATCC CGGGTGCGAC
GGCCAAGAAG CGACCTTTGA CATTTTGATT CGTGTAGCAC CACCCCTGGA GCTCAATCAG
CTAGGAAATA TCAGGTACAT CGGATTAGGG CTGTCCGCTA TCATCTTGTT TACGGCAGCT
GGATTTGCGC TTTGGGTTAG ACGCTCTCGC GAGACGCGTA TTGTCAAGAC TATGCAACCG
TTGTTCCTTG TCACAATTTG CTGTGGTGTT TTTGTTTTGG GAGCCGTTCT CGTTCCACTC
AGTATTGACG ACGAGATAGT TTCAAACCAG GGGTGCGACA TTGCATGCAT CTCGATGCCG
TGGCTCGCTA GTATTGGATT CACGGTAACG TTTTCTGCTT TGTTTTCGAA GCTGTGGCGA
ATCAACAAGC TGTTTCAATG CCAACACTTT CGCCGCACCA AAGTCGAAGA AAAGGATGTT
CTTGCACCAT TCGCCGTCCT TTTTACGCTT AATCTTACGA TTCTCGTATC TTGGACCGTT
GTTGATCCTC TAAAGTGGAG CCGAGCGCCC GTTAATGGAG AGTTTTGGAA CACGCATGGC
GAGTGCAGTG GTTCGAGTAA AACGACAACC ACACTTTTCT TGGTTCTTAT TTGCTTGGTG
AACGCCGGTG CTTTTTCCCT AGCTTGTTGG CAAGCATACC GAGCTCGCAA AATTAGTGAT
GAATTCAGCG AAAGCAAGAA TCTTGGCATG GCAATCTTCT GCTGGGCCCA GCTACTCGCT
GTAGGTTGTC CCGTCCTATT CTTGATCAAC TCTAACGACC CGGTTGCACG CTTTTTCCTT
TTGGCAGTCA TTTTGTTCGC TACCTGCATG TCCATGCTGA TGTTTATTTT TGTGCCCTTG
ACTCTACAAA GCTGGCGTGA TAAACGCGAC GGTGGTCGCC GGCGAAGTTC GGTTCAGATT
TCGGGTGTCA TGTCGGCAGG AATATCCGGT GTGTCACTGG TGTCCAGGTC GCACAGTTTG
TCAGAAAAAA AGGCATCGTC ACAGGAAACC AAATCATCAA ATATGTCCGG TGTTACAGCT
CCTGGCCCTT CGGCGATTCT TTGCAACAAC CGTATTCTGA AGGACGACCT GGAAATTGAC
GTCGAAGCTG TTTCCAATCA CAACTCTTTC TCTCAACGCG CCTCCGTTGG AGATGACAAT
GCCAGCATAG ATGAATCAGC TTGTGGGTTG GCAGACAGTG TCATTCCAGC GGTGGTCCAA
GAGGCTTCTA TCAAAAGCGA CGCCAGCAAG TCTTTTGATG TCCAAGAATC TCTTTTGCTA
TCAGAGAAAA AATTGCGCTT TGCTCCGGGA ATGTAGTAGC TCAAATTGTA CATTTTATCG
TCCCCCACCT ATCTTCACAT CCAATTGAAA TTACTGTAAG TTACAGAACA AATTTGTTAG
TTGC
 
Protein sequence
MESNNTSNTT IHLIGTSATI LNTENSECGA CYGVAAVSAW ARNVVSESPV IVLPLLDRYT 
QFVEMHPLKL SVNTHVFNVL LGWGVFLSKA SLLSLDATQS GIRSLQSRSM PLLLTNAIVS
PISSWHEYSR AVHLDAETKL AVLSIVPASA ESSSRVDSIS SVLGILRYVA KLSRESGCNP
STSLYDSYLD RSHVAMEAPS CWIPVVWYCD PDPAHFEFFL QTVTESEYAP AAIVDTESNS
AGFLSPRQYG SKGTWVLSYK ADPQLFTLHS FVLDEDRKTI LNFTTQNENI LDLPENFRDE
IYLKHVTDLE DLAAQAIANN PILGSSTSMP APVEGDFYRC IVGECELGNL FSDALRWYTG
ADVAFLAGDG FRGSGWPEGF VRVSDLWEAS RFPYTECTGT MSGISLFQLL NYSTSSASLN
GFNIDGGEFL QTSGMRVTYN PQLSGSRLIA IEVWDQNVAR YEPLERLRMY RFATDSNLCQ
KKNPFPNFLG PNFAVEGEIQ GAVGDESQQN IVGQYLAQLD LPYEASLFNR LRNNVSSLKT
LNLVQVAEEC PTGTYWITER QTCFDCPDST RVAFSEKEFQ YQIPHSMNVP LESRVLLLNE
APFAVSVGPS SIPSWVSFTR FYLNSTIPID PPSNGKRAVL QSGGSLTIDF TVRSRGLSSG
TALGTVSFGV HVGGAYPGCD GQEATFDILI RVAPPLELNQ LGNIRYIGLG LSAIILFTAA
GFALWVRRSR ETRIVKTMQP LFLVTICCGV FVLGAVLVPL SIDDEIVSNQ GCDIACISMP
WLASIGFTVT FSALFSKLWR INKLFQCQHF RRTKVEEKDV LAPFAVLFTL NLTILVSWTV
VDPLKWSRAP VNGEFWNTHG ECSGSSKTTT TLFLVLICLV NAGAFSLACW QAYRARKISD
EFSESKNLGM AIFCWAQLLA VGCPVLFLIN SNDPVARFFL LAVILFATCM SMLMFIFVPL
TLQSWRDKRD GGRRRSSVQI SGVMSAGISG VSLVSRSHSL SEKKASSQET KSSNMSGVTA
PGPSAILCNN RILKDDLEID VEAVSNHNSF SQRASVGDDN ASIDESACGL ADSVIPAVVQ
EASIKSDASK SFDVQESLLL SEKKLRFAPG M