Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_14922 |
Symbol | |
ID | 7203686 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | - |
Start bp | 530708 |
End bp | 534010 |
Gene Length | 3303 bp |
Protein Length | 572 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182978 |
Protein GI | 219125416 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCGCTG CTCTGATCTC CGATCGTCTC GGCGCTGACT CCATCATGCT CGCGGCCGTA ACGGCATTCA TGGCAGCCGA AATCATTACC ATTCGCGAAG GCTTGGCTGG ATTTTCGAAC GAAGGCTTGC TGACCGTGTT GGTTTTGTTC GTCGTAGCGG AAGGCATTTC CAAAACGGGT GCGCTCGATT GGTACATGGG TAGGCTGCTC GGTAATCCAC CCACCATTGC GTCGGCACAG TTGCGCCTCA TGGCGCCGAT TGCTGTGGTT TCGGCATTCT TGAACAACAC TCCGGTTGTT GTTGTCATGA TTCCTATCGT TCAGCGCTGG GCCAAGCAAA TTCGCGTTTC TCCGCAACAA TTGTTGATTC CGCTCTCCTT CGCGTCCATT TTGGGCGGAA CCTGTACACT GATTGGTACG AGTACAAACT TGGTGGTTCT CGGTTTGCTG GAAGAGCGGT ACCCGGATGA TCCGGACGTG GCGATTGCTT TGTTTTCGCT GGGAACGTAC GGCGTTCCGG TGGCCTTGAC TGGCATTGCC TACATTTTGT TGGCGTCGCC GGTGCTTCTT CCGGGCGGAC AAGGCCAAGG CGGCTCGAGT CCGCTAGAGA ACAACGAAGA TGTTTTACTA GGCGCTCGGC TGACGCAATG GTCGCCAGCG GCTTCGCGAA CAGTCAAGCG CAGTGGGCTG CGTGATACTG GAGGTATATA CCTGGTGTCG GTCCATCGGG CGGCCACCGG TAACGTCCAT CGGGCCGTCT CGAACGATTT CGTCCTCAAC GTGGGTGACA TCCTCTACTT TACTGGATTT GTCGAAAGCT TTGGCGAGTT TTGTGAGGAG CATGGACTCG AAGTCGTAAC GAACGAAGTT GAGACGTGCT TACCGGAAAC CCAAACACAC GAGACAAGGG ATACAGTGAC GGATCAAGTT CTTTGGGAAT CCCTAGACCA ACAGAGGAGC GAAACGAGCA TCGAACGCAA TCACGGCTTT TCTATGCTGA CAAAAAGCTT AGGATCATTG GATACCGTTG CTGAAGATCC AGACGCGATT CCGGTTGAAG TTGGAATGAC CAAAGAATCC TTATTGCGTG CAGACGAAGA CCAGCGACTG CGGAGCATCA ATCGAATGAC GGATCTGATT CGAGACGATG CACCCTCAAA AGACAACAGC ATACTCGATC CTAAACGGGA TAGATTGGTG TCGGAAAGGC TGCGAGGAGG CGATCCAGCG AAGATTGTAG TGACAATTGA TAAGGATCTA GTGGTCGTCG GAATCAACGT AAAAGATCGT TCCGGACTTA TGCTGGACAT TTCCAAGGGA CTCTTGCGAC TCAACCTACA ACTACACCAT ACCGAAGCTG CTGTGGTTGG CGATCGCTCT ATTTCTATCT GGCGCTGTGA AGTCATCGGT ACCGAACTAC CGGATTTGGA AGAGATATGG TCTGTAATGA ATGCTCTTTT GTCGATAGAA GGTGGAATTG CCACAATTAA ACAGCGAGGA CTTCGCGTGA TTCGAGCTCG GGTGGTACAA GGATCACGAT TAATAGGACG AAAAGGGGCC GACATAGATT TTCGGAAGCG CTACCAGGCT GCTATAGTGG CTCTGCAAAA TAACGGCAAG AACAGCACTC AGCCCCTTTC GCAGGTCAGC TTTGATGTTG GGGATGTTTT GGTTCTGCAA GTCGGGGAGG CTTCCCCGCT GCTCCAAGTG CCTCCGTCAA ATTTCTACAA GGGTCGCACA GATAATTCCA GGACGGATGA GAACGTATCG CGAAATTCTT CAGTAAGAAA TCTGGTGAAT ATGGTCACCT GGAGAAAGGC TAGTACAGAC AATTTGGAGG CTATGGACAA GTCACGGGCA GGCCGGGTTT CGGAACGAAT GGAAGGTGCT ACTCCCACCC AAGACGACGA CTGCTTTATT GAATCGGAGG GTTCTGAAGT TGCAATTGAA CGAAACGGCG ATGAGGAAGA TCCTGCCGTT GTTGATATGC CTGGGGCAAT GCAACAAATC GAAGAACAGG AGGTTGTTTG GAAAGATCTA CAGCTCCTCG TGCCTGACGA AAGGGTACAT AGCGGTGAAG GAGCAGCTCG CGAGTTTCTC ACCGCGATGC AAGTTGCCCC AAAATCCAAG TTGTCGGGGA AAACCGTTGC AAAAAGTGGC ATCGACAAGC TTCCAGACTT GTTCTTGGTT AGTATCGAAC GCCCCATCTC TGCAGGGACC TCTTTGCCAA CGAAGACCAA AAGACTATCA GTGATGTCTG GCGCATCTGA TGCGCATTCT CTGGGAGAGG ACAGCAATCA GCGCCTTGGC TCGATTCAAA CAGACAATCA GGCATACCAA TCCATTGCTC CAGAGGAGCC CCTTCAGCAC GGAGATGTTC TATGGTTCTC CGGCTCTGCA TCGTCCGTTG GCGATCTGCG CAAGATTCCA GGATTGATCT CGTATCAAAA CGATGAGGTG GAGAAAATCA ACGAGAAGGT GCATGATAGA CGTCTGGTTC AGGCTGTCAT TGCCAGAAAA GGACCATTGG TCGGGAAGAC TGTGAAGGAG GTCCAGTTCC GGAAGCGGTA TGGAGCCGCG GTGATTGCTG TACATCGCGA AGGCAAGCGT GTGCACGAGC ATCCGGGGAA CGTGAAGTTG CAAGCAGGTG ATGTGCTGTT ACTGGAGGCG GGTCCTTCGT TCATCGCCAA GAGTGGTGAG AACGACAGAT CGTTTGCTCT GCTAGCTGAA GTGGAGGACT CGGCCCCTCC TCGTTTGAGT CTTTTGATTC CTGCGTTGTT GATCACGGCA GGGATGCTGA TTGTATTTAT GGCTGACTGG ACGTCGCTAT TGGTTTCTGC ACTAGTGGCT TCAATGTTGA TGGTAGCTCT TGGTATTTTG TCAGAACAGG AGGCTCGGGA TGCGGTGAAT TGGGACGTGT TTATAACCAT CGCCGCAGCC TTTGGCATTG GTACAGCTCT TGTCAACTCA GGGGTGGCAG GAGGGATTGC TAACTTTTTG GTTGATGTAG GTACTGCTTT GGGTATTGGG AGCGCAGGGT TGCTTGGAGC CGTGTACTTT GCAACCTTTC TTATTTCAAA TGTGGTCACG AACAATGCAG CGGCGGCTCT GTTGTTCCCT ATTGCATTGG ATGCAGCGGA GCAGACAGGC ACTGATCGTG TTTTGATGAG TTATGCGTTG ATGTTGGGCG CGTCAGCCAG CTTTATGTCA CCTTATGGTT ACACAACGAA TTTGCTGATC TACGGTCCTG GAGGCTACAA GTACAAAGAC TTCCTTGTGT TTGGAACCCC AATGCAGATC GTG
|
Protein sequence | MFAALISDRL GADSIMLAAV TAFMAAEIIT IREGLAGFSN EGLLTVLVLF VVAEGISKTG ALDWYMGRLL GNPPTIASAQ LRLMAPIAVV SAFLNNTPVV VVMIPIVQRW AKQIRVSPQQ LLIPLSFASI LGGTCTLIGT STNLVVLGLL EERYPDDPDV AIALFSLGTY GVPVALTGIA YILLASPVLL PGGQGQGGSS PLENNEDVLL GARLTQWSPA ASRTVKRSGL RDTGGIYLVS VHRAATGNVH RAVSNDFVLN HGDVLWFSGS ASSVGDLRKI PGLISYQNDE VEKINEKVHD RRLVQAVIAR KGPLVGKTVK EVQFRKRYGA AVIAVHREGK RVHEHPGNVK LQAGDVLLLE AGPSFIAKSG ENDRSFALLA EVEDSAPPRL SLLIPALLIT AGMLIVFMAD WTSLLVSALV ASMLMVALGI LSEQEARDAV NWDVFITIAA AFGIGTALVN SGVAGGIANF LVDVGTALGI GSAGLLGAVY FATFLISNVV TNNAAAALLF PIALDAAEQT GTDRVLMSYA LMLGASASFM SPYGYTTNLL IYGPGGYKYK DFLVFGTPMQ IV
|
| |