Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43504 |
Symbol | |
ID | 7197196 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 655250 |
End bp | 658319 |
Gene Length | 3070 bp |
Protein Length | 977 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177978 |
Protein GI | 219112453 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCCGC TTACCAAAGG AAAAAGCGGT AACGCAAAGA ATGCCGACTT CAAACGTATT AAGGCCAAAG TGGGCAAGAA AGCCCCAAAG CCAGCCAATG TCACGGACAC GGCGTTCCGC GCTGCGTCGT TGCAAACTAG CTCCCAGACG TCCTTCACCA ATGGCGGCGC TCCAGCAGCA AATCTTTCCG ATGATGTCTC ACTCTATTCA GCCCGTGGTC GATCTTTGCA AAGTTTGGCG TCTCAGTTGT CCCATCCGGC TGCCGCAGTA CGCGCATCCG CTGCGAAAGG GCTGTATGAT TTAGTTTCCG GAGCAGCAGC GACGGCTACC GGTGCGACTA GCAGTAGTCT CCTGCAAGCG CACTTGTCGG CTCTCATTCC AGCCGTTGGT AAATGCGTTG TCGACGAAGA CAGCGAGGTT CGAACTGTTG GAACGAACAT CCTCCGTGAA ACGGTAAACA AACTGAACGA AAAAGCTACG ATGGCACTGA GGCCATTCAT CAAACTTTTG ATTGCATTCG TGGCGTCGGC ACTGAATAGT TTGGATCGTG ATTCTCGTCG CGACGGTGCC ATCCTTGTCG AACTGCTGAG TTCGTCGGTA CCCACATTGG TAGCACCCTA CGCGGTGGAA CTGCTACCAG CCTTGATCCG ATCGCTGGAC GATCGGGACA CCCGTTTGCC GAAACAACCC GGCATCAACG GAAGCAGTGA CACCGGCACC AAAAAGCGCA AGCGTAGGCT GCCGAGCGTC GTATCGAATA AATCGACCGC TAATCTTAAT GGGCGTCACT TGTTGCTGCA GTCCCTTGTC ACCTTGCTCC AATCGGCAGC AACCCTCACA GGTAGTAGTA AATCCGAAAC GCAACAACAG AGTGCGACAG GAGAGTACTT GTCGGAGCCC GATTTAATTT TTGGCACTGG AGGGCGATCC CGAAATGCTG TGATTCTCCG AGGTCGACCA ACACGACGGC TTGCTCAACT GTCTCCCATT CAGAAGCTTA CCGACTTACC ATCCTTGGAA GCATTTCGGC TATCTTTAGC GAGAAACGAC GGCCCGATCA AAGTGGGTTT GAATACGTCT TTACCTCCGA AAACGGTTCA GCAATTGTAT CACAAATTGC GAGATTGCTT TGTTGAAACA ACTCAGCGAG GCTATTTTGA CGCTAAACAA GGCTACGTGA TGACTGTTAC CGATTTGTCT ACCTTCTTAC TCGTTGCAAA GGCTTTACGA TTGACCTGGG ACGTTTTTGG CAACGAATGG TCTCAGCTTC AAGCAGAATC GGATGTCGCT GACATGCGCA AAAGTTTTGC ACAGGCTGTG TCTTTAATCC TGGAGATTTT TCCGATTTCG CGTGCAGACG AAGCAACACA AGTTGTGGCC GACGATGTGA ATGGTGAGCT GTGTGTGACT CTGGTTTTGA TGGGGCCCAC CAGCGCCAAC CAGGGGAAGG AGAAAACTGG CTGGGCAGAT AAAGTTGTTG CGCACGTCTT GACTTCCATG GACGAAATGC ATGTGCAGTG TCACCAAGCC AGCTCCATGA GCCCAACTTC GGCACGCTCT GTCTTTGCTG TATTGAATGG GTTGGTTTTG ACTGGGCTGT GCAATGTGAA ATCTCAGAAC AGATTAATTA GTATGTTCAG CTCCACATTC TTTGCACCCG AAAGCAAGGA CTCAGCTCGA ATGTGCACAA GCATTTGTCG TCAGGCGACG GATGTGGCGA ACAGGATCTT TGAAAGGATA AACTATGATA TCGACAAGGC CAGTGAGCCA ATGCAAAAGC TTGCTGTTGA TGTCCTGAAG ACTATACCGG CCTATTTGGT AGCTTGGGGC GCCATCTACA TACCGGAAAG TTCGAGCGCA ATAGCATTGC TCCATCACCT TGTGCGTAGA CTGGGCGACG AAGATAAGAG CATGCTGGAT TTGGGCAAGC TGCGCAATGA TTTGGATCAG TTGATGATGG TCTCCGAAAG CATGCAGAAG CAAAAGTTCA CGACGTGGTC TACGGTCCTT GAATGCTATC CGCCTACGCT ACAACGACTG TTTGTAAGTC TCATTATTAT GCTCGGAAAG CCCAGCGATA TTACCCTGAA GCTTTTGGGG CGAATCTCTG CCCGATGCCA AGCCAAAGGC GACCCTGACA AGTTGGCCAT TTATGTTGCT CAATCAATGT TTAGCATCCG TAAGACCGTC TCTATGTCAT CGTTTCTGAC GTTTTTGATT GACGGCACCG GTGTCTTTCT TTTCGAAGAG TCGGCGTTTC ACAGTAAACC CAAGACCGAG GGAGATAACC GGTATATACG CCTGTTTGAA TTGGATAATG GTGTTCGTTT GGCAGCCACG AACCTTGTCG CGTGTGGATC CTCGGCCAAG ACACTTCCGA TGCTGGAAGC CCTCTTATCA ACATTAATTC GAAGCTGTGA CGTAACAAAG ACGTCGACAC AACGTGAGTT GTTCCGGATT CGAGCTGGTT TTTCGATTCT GGCACTCTTT GCCTTGGATC TCCGTCGTCA GGGATCCAGC ATTTTTGACA TTCTGCCCCG GGCGTTCAAG GTACAAGCCA TGGCCGGCAT CGGACGGCTT CTCGCACACG CACCGAGTCT CGACTCGGAA TCAAACGAAA ATATCGACAG GGCCCTTCAA GCGTGGATCC GTCCGGTGGT GACCCTGTTG GCTTCGGAGG ACGGACTTTT GGTAGATTCG TTTACGGTCC TGGTGTCGTC CTTGCATAAC TGGCCCGACA CCCACCGGAC CAGCGCGGTC CAGACCCTGT TGCTAGTCAT ACGGGCCCCG ACGTTGGCAC CCGTATTTCG ACGTAGTGAT ATTGCGGCCA TGGTGGCGCA GGCTAAAGTC TTGGAGCAGG CCTCGGCGGA GAGTCCGCTC GCAAGCATCG CGGGTCAAGT CGTGGCGGAA CTCGAATTGC ACATTAGCGG GTGAGCACAC TCAATGACAA GTCGCACGTA TACAACAGTA CAAGAAATGC TCTTTATTGA AATGCGAGCG CAACGGTGGC GAAAAAAAAT TGTTTCCAAT CATCGAATAA CGTTTAATAG CTAGAGCTTA TAGTAGAGGG
|
Protein sequence | MAPLTKGKSG NAKNADFKRI KAKVGKKAPK PANVTDTAFR AASLQTSSQT SFTNGGAPAA NLSDDVSLYS ARGRSLQSLA SQLSHPAAAV RASAAKGLYD LVSGAAATAT GATSSSLLQA HLSALIPAVG KCVVDEDSEV RTVGTNILRE TVNKLNEKAT MALRPFIKLL IAFVASALNS LDRDSRRDGA ILVELLSSSV PTLVAPYAVE LLPALIRSLD DRDTRLPKQP GINGSSDTGT KKRKRRLPSV VSNKSTANLN GRHLLLQSLV TLLQSAATLT GSSKSETQQQ SATGEYLSEP DLIFGTGGRS RNAVILRGRP TRRLAQLSPI QKLTDLPSLE AFRLSLARND GPIKVGLNTS LPPKTVQQLY HKLRDCFVET TQRGYFDAKQ GYVMTVTDLS TFLLVAKALR LTWDVFGNEW SQLQAESDVA DMRKSFAQAV SLILEIFPIS RADEATQVVA DDVNGELCVT LVLMGPTSAN QGKEKTGWAD KVVAHVLTSM DEMHVQCHQA SSMSPTSARS VFAVLNGLVL TGLCNVKSQN RLISMFSSTF FAPESKDSAR MCTSICRQAT DVANRIFERI NYDIDKASEP MQKLAVDVLK TIPAYLVAWG AIYIPESSSA IALLHHLVRR LGDEDKSMLD LGKLRNDLDQ LMMVSESMQK QKFTTWSTVL ECYPPTLQRL FVSLIIMLGK PSDITLKLLG RISARCQAKG DPDKLAIYVA QSMFSIRKTV SMSSFLTFLI DGTGVFLFEE SAFHSKPKTE GDNRYIRLFE LDNGVRLAAT NLVACGSSAK TLPMLEALLS TLIRSCDVTK TSTQRELFRI RAGFSILALF ALDLRRQGSS IFDILPRAFK VQAMAGIGRL LAHAPSLDSE SNENIDRALQ AWIRPVVTLL ASEDGLLVDS FTVLVSSLHN WPDTHRTSAV QTLLLVIRAP TLAPVFRRSD IAAMVAQAKV LEQASAESPL ASIAGQVVAE LELHISG
|
| |