Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_24915 |
Symbol | |
ID | 7196245 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 1903523 |
End bp | 1906055 |
Gene Length | 2533 bp |
Protein Length | 791 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176867 |
Protein GI | 219110231 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.615618 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CATCTGTATT GCGCATATCC TGTCGCTCTG CGTCTGATGC GAATACCAGT CGAAGAATAT GCAAGCAATT AAACAGCGTG ATACAAACCT GGCTGGTGCC AAAGATGAAG CTTCTCTTTC TAGGGTCAAA ACTCGTCGTC GCAAAATCAA TCCCTTTTCA CCACCGACAA CAGTCGATGT GCGCGGAGTA CGAGTGCACT TTCCCTTTCG CCCTTACAAA TGTCAAGAGA CTTACATGGA GAAAGTGTTA GATGCGTTGC TGCGCTCGGA GAATGCGCTT CTGGAAAGTC CAACTGGGAC TGGGAAAACG CTGTGTTTAT TGTGTTCGAC CTTAGCTTGG CAGCGTGAGC AATCACGACT CTTGCAGCAA GCATCAGAGC TACAAAACAC GGACGCAAGT CTTCTTGCGA ACAGCCAAGA TGCCCCGGCG AGAGCGGCTC GTGTTCCAAC AATTATTTAT GCGTCACGGA CTCATTCGCA ACTGTCTCAA GTCGTTCGCG AACTGCGTAA CACGCGGTAT CGGCCTCAAC ATGCAGTCCT CGGCTCTCGA GACCAAATGT GCGTCAATCC AAAAGTCAAA AAGCAAGGCT GTTCGGCAAC AGACGTAAAC CATGACTGTA GCAAGCTGGG AAAGGATCGA AAATGTCGCT TTCGCAATAA TCTAGACGGT TTCACTGCTC CGGCCAACGA AAACTCCGGG ACAAATACGC AGCCAGTTAT GGATATGGAA GATCTCGTAA CGATGGGGAA GTCGCATAAG GTGTGCCCGT TCTACTATAC TAGAGCGTTG GTGGCAAAGG CAGAACTGAT TCTTGTGCCG TATAACTATC TGTTTGATAA AGACGCACGT GCAACAACAT TGTCCGATAT TCCGTGGGAC AACGCCGTGT TAATATTTGA CGAGGCTCAC AACTTGGAGT CTTTTGCAAG CGAGTCAGCT TCGTTTGATT TATCGAATAC AGATATATCG GGTTGCATTG CGGAAGTACA AAAGGCTGGA AACTATTTAC AAGCCATGCC TGATCTGGGT TCTAACCTTA AGGAGGAGAA CCTTGTTAAA CTCAAGGCTA TCTTTCTCAA ACTCGAAGAC TATCTCATGG GTCTCGGGAA TCAAACAGCA TACACTGGGG AGTTTATGAT GGATTTTTTC CAACAAGGGG GAGGAGTCAA TCACAGCAAC CATGAAATTT TCATCGAAGA ACTGCGAAAA GTCAATGATC TACTTTTAGA TATGAGGGGG ACTGGTGCGA CGAAAGGTTC ACCCAAGCTG GAGCACTTTG TCCAGTGTCT CAAACGGGTG TTTGGAGAAA AACTAGAATC ACGGTGTCTT GCAAAGTCAG CATCATATCG TGTGCATGTT TCACCCAAGG AGCAACGGCA ATCCGGCAAG CAACATATTG CGAGCCGCAC AGTTTCTTAT TGGTGTTTTG CGCCGGCATT GGCTATGGAA GAACTTGCCA GCCTTAACGT TCGATCAATC ATTGTCACTT CCGGAACTCT GTCGCCATTA CCGAGCTATA GTATGGAGTT GGGTCTCAAC TTTCCTCATA CGCTAGAAAA TCCACATATA GTCAGTGACA ACCAGATTCA CGTACGTGTC ATCGGCAAAG GTGTCAGTGG AAAAGAGCTG ACGAGTTCCT ATGAACGACG AAAGGACGGG GAATACTACT CAGAGCTTGG AAATACGCTA GTAGCACTTT CCAAGGTAAC CCCGGCAGGT ATGCTGGTTT TCTTTCCGAG CTACAGCGTG ATGGAGACTT GTATTGAGCG ATGGGGTGGG CCTTCGTTGA ATTACAACAT TAGTAACATC GGAAGGAGCA GTTTCTTTGC GGCACGACAG AAGAAACACC CTTCTGTCAA TCAATGGTCT TTTCCCTTCG TTCAGATGTC GTACTACGAT TCAGAGTCAC CGAAAACTCC ATGGAAACGA TTGCTTTCTA CTAAGTCTGT TGTAGTTGAA CCTAAGTCGT CTGCGAACTT GCCTGAAGCC ATTGCCGACT TTCACAGATT TCTAGGCATG TTAAAGTCGC CTGGCTGTAT CTTGATGGGT GTATGTCGAG GAAAGATCAG TGAGGGTATC GACTTCGCCA ATGAGCAAAG TCGCGCGGTT GTAATCACAG GCCTACCGTT TCCTCCATCT TTTGATGCCA AAGTTAAACT GAAGCGCGAC TTTTTGGACG GTGCGCGGGC GATCGGCAGC ATGAAAGCGA GCAACGTAGG AGGGTTTGCA GACAATGCGA GCTCTACTTC AAATTTGGCG ACCAAGCTTT CAGGGCACGA TTGGTATGCA CAGCAGGCGC ATCGAGCGGT CAATCAGGCG ATCGGCCGAG TCATTCGCAA TAGATCCGAC TATGGAGCCG TTTTGCTTCT TGATTCTCGA TTTAGCCAAC AGAGAAACCA AGAGGGTCTC AGCAAATGGG TTCGCCCGCA TTTGAAAAAA GACGAAGGAT TCGGAACCGC GGTGGGTGCC TTGGCGAAGT TCTACAGAGA ATCCGCTTTG GAAGCCGAAG TTCGCGAGAC TACAGAGTCA AAA
|
Protein sequence | MQAIKQRDTN LAGAKDEASL SRVKTRRRKI NPFSPPTTVD VRGVRVHFPF RPYKCQETYM EKVLDALLRS ENALLESPTG TGKTLCLLCS TLAWQREQSR LLQQASELQN TDASLLANSQ DAPARAARVP TIIYASRTHS QLSQVVRELR NTRYRPQHAV LGSRDQMCVN PKVKKQGCSA TDVNHDCSKL GKDRKCRFRN NLDGFTAPAN ENSGTNTQPV MDMEDLVTMG KSHKVCPFYY TRALVAKAEL ILVPYNYLFD KDARATTLSD IPWDNAVLIF DEAHNLESFA SESASFDLSN TDISGCIAEV QKAGNYLQAM PDLGSNLKEE NLVKLKAIFL KLEDYLMGLG NQTAYTGEFM MDFFQQGGGV NHSNHEIFIE ELRKVNDLLL DMRGTGATKG SPKLEHFVQC LKRVFGEKLE SRCLAKSASY RVHVSPKEQR QSGKQHIASR TVSYWCFAPA LAMEELASLN VRSIIVTSGT LSPLPSYSME LGLNFPHTLE NPHIVSDNQI HVRVIGKGVS GKELTSSYER RKDGEYYSEL GNTLVALSKV TPAGMLVFFP SYSVMETCIE RWGGPSLNYN IKSPKTPWKR LLSTKSVVVE PKSSANLPEA IADFHRFLGM LKSPGCILMG VCRGKISEGI DFANEQSRAV VITGLPFPPS FDAKVKLKRD FLDGARAIGS MKASNVGGFA DNASSTSNLA TKLSGHDWYA QQAHRAVNQA IGRVIRNRSD YGAVLLLDSR FSQQRNQEGL SKWVRPHLKK DEGFGTAVGA LAKFYRESAL EAEVRETTES K
|
| |