Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44131 |
Symbol | |
ID | 7203884 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 1069140 |
End bp | 1072563 |
Gene Length | 3424 bp |
Protein Length | 1111 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186461 |
Protein GI | 219113755 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0847147 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATCCA ATAATACAAG CAACACAACG ATCCATTTGA TTGGGACTAG CGCGACTATA CTCAATACAG AGAACTCGGA GTGCGGAGCC TGTTACGGTG TTGCGGCGGT CTCAGCTTGG GCTCGCAATG TTGTCTCCGA AAGCCCCGTT ATAGTACTGC CCCTCCTGGA TCGGTACACA CAGTTTGTAG AGATGCATCC ACTTAAATTG AGTGTGAATA CGCATGTGTT CAACGTTCTT TTGGGGTGGG GCGTTTTCCT ATCGAAAGCT TCTCTTCTAT CGCTTGACGC TACACAGAGT GGCATTCGAA GTTTGCAATC AAGAAGCATG CCGCTGCTGT TGACAAATGC GATTGTATCT CCCATAAGCT CATGGCACGA ATATTCAAGG GCAGTTCATC TTGACGCGGA AACAAAACTT GCCGTCCTGT CAATTGTGCC GGCCTCAGCG GAATCTTCTT CGCGTGTGGA TTCAATTTCT TCGGTCTTGG GCATTCTCCG CTACGTTGCG AAGCTTAGTC GCGAGTCGGG TTGCAACCCG TCCACGAGCT TGTATGACTC GTACTTGGAC CGCAGCCATG TCGCTATGGA AGCACCGTCT TGTTGGATTC CGGTTGTCTG GTATTGCGAT CCAGACCCAG CTCATTTTGA ATTTTTCTTA CAAACAGTTA CGGAAAGCGA ATACGCTCCT GCTGCTATTG TGGATACTGA GAGCAACAGT GCCGGGTTTT TATCTCCGCG GCAATACGGA AGCAAAGGTA CCTGGGTATT AAGTTATAAG GCTGACCCCC AGCTTTTCAC GCTTCACAGC TTCGTCCTCG ATGAAGACCG GAAAACTATA TTGAATTTTA CAACCCAGAA CGAAAACATT TTGGATCTTC CCGAAAATTT CAGAGACGAG ATTTACCTGA AGCATGTAAC CGATTTGGAA GACCTTGCTG CACAAGCCAT CGCGAATAAT CCTATCCTGG GAAGTAGTAC ATCCATGCCG GCCCCAGTTG AAGGAGACTT TTATCGATGT ATAGTTGGAG AATGTGAATT AGGTAATCTA TTTTCAGACG CACTTCGCTG GTACACAGGA GCTGATGTCG CTTTTCTCGC AGGTGATGGT TTCCGTGGAT CGGGCTGGCC GGAGGGATTT GTTCGGGTGT CCGACCTTTG GGAGGCTTCC CGTTTCCCGT ACACGGAATG TACTGGAACG ATGAGCGGTA TCTCCTTGTT TCAATTGTTA AATTATTCGA CCAGTTCGGC GTCACTGAAT GGGTTCAACA TCGACGGAGG TGAATTTCTT CAGACTTCGG GTATGCGTGT CACTTATAAT CCACAGTTGT CTGGGTCTCG CCTGATTGCT ATCGAAGTCT GGGACCAAAA TGTAGCTAGA TACGAGCCAC TGGAACGGCT GCGTATGTAC CGATTTGCCA CCGATAGTAA CCTTTGCCAG AAAAAGAATC CTTTCCCTAA TTTCCTCGGT CCGAATTTTG CTGTAGAAGG TGAGATTCAG GGCGCAGTGG GAGATGAGTC GCAGCAAAAC ATTGTTGGTC AATACCTGGC GCAGCTCGAT TTACCATACG AAGCGTCCCT TTTTAATCGC TTGCGCAATA ATGTGTCCTC ACTGAAGACT TTGAATCTGG TTCAAGTTGC CGAGGAATGC CCCACTGGAA CGTATTGGAT TACTGAGAGA CAAACATGTT TTGACTGCCC GGACTCGACT CGTGTTGCTT TTTCGGAAAA AGAATTTCAA TACCAAATTC CTCATAGCAT GAATGTACCG TTGGAAAGTC GAGTCCTATT ACTAAATGAA GCGCCCTTTG CAGTTTCTGT TGGACCAAGT TCGATACCAT CGTGGGTTTC TTTCACGAGG TTCTACTTGA ACTCAACGAT TCCGATTGAT CCTCCGTCGA ATGGAAAAAG GGCAGTTTTG CAGTCTGGCG GGTCTCTTAC AATCGACTTT ACGGTTAGAT CCCGGGGGTT GTCGTCAGGG ACGGCGTTGG GTACAGTGTC TTTTGGAGTA CACGTTGGAG GTGCCTATCC CGGGTGCGAC GGCCAAGAAG CGACCTTTGA CATTTTGATT CGTGTAGCAC CACCCCTGGA GCTCAATCAG CTAGGAAATA TCAGGTACAT CGGATTAGGG CTGTCCGCTA TCATCTTGTT TACGGCAGCT GGATTTGCGC TTTGGGTTAG ACGCTCTCGC GAGACGCGTA TTGTCAAGAC TATGCAACCG TTGTTCCTTG TCACAATTTG CTGTGGTGTT TTTGTTTTGG GAGCCGTTCT CGTTCCACTC AGTATTGACG ACGAGATAGT TTCAAACCAG GGGTGCGACA TTGCATGCAT CTCGATGCCG TGGCTCGCTA GTATTGGATT CACGGTAACG TTTTCTGCTT TGTTTTCGAA GCTGTGGCGA ATCAACAAGC TGTTTCAATG CCAACACTTT CGCCGCACCA AAGTCGAAGA AAAGGATGTT CTTGCACCAT TCGCCGTCCT TTTTACGCTT AATCTTACGA TTCTCGTATC TTGGACCGTT GTTGATCCTC TAAAGTGGAG CCGAGCGCCC GTTAATGGAG AGTTTTGGAA CACGCATGGC GAGTGCAGTG GTTCGAGTAA AACGACAACC ACACTTTTCT TGGTTCTTAT TTGCTTGGTG AACGCCGGTG CTTTTTCCCT AGCTTGTTGG CAAGCATACC GAGCTCGCAA AATTAGTGAT GAATTCAGCG AAAGCAAGAA TCTTGGCATG GCAATCTTCT GCTGGGCCCA GCTACTCGCT GTAGGTTGTC CCGTCCTATT CTTGATCAAC TCTAACGACC CGGTTGCACG CTTTTTCCTT TTGGCAGTCA TTTTGTTCGC TACCTGCATG TCCATGCTGA TGTTTATTTT TGTGCCCTTG ACTCTACAAA GCTGGCGTGA TAAACGCGAC GGTGGTCGCC GGCGAAGTTC GGTTCAGATT TCGGGTGTCA TGTCGGCAGG AATATCCGGT GTGTCACTGG TGTCCAGGTC GCACAGTTTG TCAGAAAAAA AGGCATCGTC ACAGGAAACC AAATCATCAA ATATGTCCGG TGTTACAGCT CCTGGCCCTT CGGCGATTCT TTGCAACAAC CGTATTCTGA AGGACGACCT GGAAATTGAC GTCGAAGCTG TTTCCAATCA CAACTCTTTC TCTCAACGCG CCTCCGTTGG AGATGACAAT GCCAGCATAG ATGAATCAGC TTGTGGGTTG GCAGACAGTG TCATTCCAGC GGTGGTCCAA GAGGCTTCTA TCAAAAGCGA CGCCAGCAAG TCTTTTGATG TCCAAGAATC TCTTTTGCTA TCAGAGAAAA AATTGCGCTT TGCTCCGGGA ATGTAGTAGC TCAAATTGTA CATTTTATCG TCCCCCACCT ATCTTCACAT CCAATTGAAA TTACTGTAAG TTACAGAACA AATTTGTTAG TTGC
|
Protein sequence | MESNNTSNTT IHLIGTSATI LNTENSECGA CYGVAAVSAW ARNVVSESPV IVLPLLDRYT QFVEMHPLKL SVNTHVFNVL LGWGVFLSKA SLLSLDATQS GIRSLQSRSM PLLLTNAIVS PISSWHEYSR AVHLDAETKL AVLSIVPASA ESSSRVDSIS SVLGILRYVA KLSRESGCNP STSLYDSYLD RSHVAMEAPS CWIPVVWYCD PDPAHFEFFL QTVTESEYAP AAIVDTESNS AGFLSPRQYG SKGTWVLSYK ADPQLFTLHS FVLDEDRKTI LNFTTQNENI LDLPENFRDE IYLKHVTDLE DLAAQAIANN PILGSSTSMP APVEGDFYRC IVGECELGNL FSDALRWYTG ADVAFLAGDG FRGSGWPEGF VRVSDLWEAS RFPYTECTGT MSGISLFQLL NYSTSSASLN GFNIDGGEFL QTSGMRVTYN PQLSGSRLIA IEVWDQNVAR YEPLERLRMY RFATDSNLCQ KKNPFPNFLG PNFAVEGEIQ GAVGDESQQN IVGQYLAQLD LPYEASLFNR LRNNVSSLKT LNLVQVAEEC PTGTYWITER QTCFDCPDST RVAFSEKEFQ YQIPHSMNVP LESRVLLLNE APFAVSVGPS SIPSWVSFTR FYLNSTIPID PPSNGKRAVL QSGGSLTIDF TVRSRGLSSG TALGTVSFGV HVGGAYPGCD GQEATFDILI RVAPPLELNQ LGNIRYIGLG LSAIILFTAA GFALWVRRSR ETRIVKTMQP LFLVTICCGV FVLGAVLVPL SIDDEIVSNQ GCDIACISMP WLASIGFTVT FSALFSKLWR INKLFQCQHF RRTKVEEKDV LAPFAVLFTL NLTILVSWTV VDPLKWSRAP VNGEFWNTHG ECSGSSKTTT TLFLVLICLV NAGAFSLACW QAYRARKISD EFSESKNLGM AIFCWAQLLA VGCPVLFLIN SNDPVARFFL LAVILFATCM SMLMFIFVPL TLQSWRDKRD GGRRRSSVQI SGVMSAGISG VSLVSRSHSL SEKKASSQET KSSNMSGVTA PGPSAILCNN RILKDDLEID VEAVSNHNSF SQRASVGDDN ASIDESACGL ADSVIPAVVQ EASIKSDASK SFDVQESLLL SEKKLRFAPG M
|
| |