Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_55073 |
Symbol | |
ID | 7198275 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | - |
Start bp | 190964 |
End bp | 193891 |
Gene Length | 2928 bp |
Protein Length | 934 aa |
Translation table | |
GC content | 58% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184417 |
Protein GI | 219128432 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGTTGG AAGAGGAGGA CGAGGAGGTC CAGGCGCGAC CGACGAACGT GGACCCGCCG TCGACCCGTA CGGTCCCCCG AAACACCACC GTCCCGACTT ACGCGACGCC GGGAAAACCC AGTACTGCTG TTCCCACTCC CGCCACTCAC ACGGCCCCAC CACCACCACC CTTCGTGACT CCCGCCGCAC GCGGCGTCGC GTCGGACCTT CACGAGACGA CACCCTTGCT GACCGTCGTC CCACCGATCG ACCCGCCCTT GCTCGATCCG AGACTGCCGG AATCCTCCCG GGACGGGGCC AATCCGGAAA CCGACGCAAG TGTTGATAGT GACACGTCGG CGCCTAAGCA AGTCCGTTTC GATGCGCACG TTTCGCCTGC ACCAGCCGAA GCGGCGCGTA CCGGATTGGT CTCGCGACGG GGAGGTCGCA TGGCGTCGCC CGCGAGACGA CGGCCCATTC CACCCTCGTC CACTGGTGCC GACGCGGCGG GAGCTGCCGG CTCTCCGTCG ACCGTGTCGT CGCCCCTCGC CGCCACAGTC CTAGAGTATC AAACCTTTCG CAGCCCCTCC CGCAAAAATG ACGAAGTCCC GTCACTGTGC TCCATCCATA GTGGTGGGGG ACCGTTGGGC AACCGCACGG AATCCTTTCC CAATCACCAA GACAAGGCAC CGGCAGATTC CATTTCTTGT GATCCTTCGG AAGAAAATAC TACCAGCCCC TTGGCAACCG ACTCGCGGAA AGACAGTGTC TCGTTTTCGC CCAAAATGAA CGACGAAACG GTACGTAGTC TTGCCGACGA CAATGACGAC GGCAGCTTTC GGAACCTGGC GAGAATGTTC TCACCCACCA CGTGTGTTTT TTTCTATTTC CTCCACGGCC CGTCCCACTC TGCATACTTC TAGACTGCCG CGGATCACCG CTCGACCACG CCCACCGATT TCAGCGTACA GCGACTGCCC AAGTTGGATA CCGACGCTTC GCAGCCCACC AAGACCCCAA GTGTCTTTCT GTCACCCAGC CCGTCCTCGC GGCCGTTACC CGAAGCTTCC TCATTCGAAG AGAACAAGGG CAGCCACGCA GCTGTCGAAG GAGGTGGTGA CGCCAAGCCG TCACCGCGTC ACGAACGCAA TTTGGCTACC CCCACCGACT TTGCCTGGGA TTACGGTCGG GGACCACCAA CGTCGACCGG CTCCTTCGAC CACTCCAACG TGCTAGCGTG GTTGCAGTCT CCCACCGCCA ATGGATTCTT TTCACCGGGC GGATACGGCT CAATTGTAAA TACGCCACAT ACCGGAATCC CGCGGACTCC CGGGACGCCC ACCGTCAGTA CGAGCTTTTT CTTTACGGAC GTCGCTACCC TGCCACGAGG CAACGATCTG ACGCCCCGTA ACGGTGATGG CGGCGAAACA CCCGTGAGGG ACGGTTCACG GCGCTCGGCC TCCCACGGTA TTTCGAGCAT TATCTGTATT TCCCCGTTGG CCTCAGCCAA GGTACGGGGA TCAACCACGA GCACGCCCAT GAATTTGAAA GATGTCTTTG CCTCACCCCG AGAAAAGTCC AGAATGCGGG GACTGCCGTT ACTGAACGAT ACCCCGGCGA AGCGGCAACA GTTACGCGTT CCCCGGAGAA GCTCCAGTAA GGATCCCAGC GTGGATGCGG TACATTTGGC GGAACGTGAT TTGATGGAAG ACGAAGATTT GAGTGTGCTG TTGCAACTGG CGTCCAACAC GCCACAACAC CGAGAACCAG ACGGTGCGCA CGGGAATGGA GTCGATCCGG TCTTTCGGTC GCCTGACTAC CGCAAGAATA TTGGGGACGA CGAAAGTGGT GAGAACCTTC CAACACTACA GCTGCCAATG ATTGGTAACG GGCGGCACGA CGAGTTACTG TCAAGCAGAC TGGCACAAAA GTCGCGATCT AGAGACCTCG GTGACGCCGA CGACTTTGCT CCTCCTCCTC ATCTCGGAAT GCGGTCAACC TCTTCCAGCG GATCAAAAGA GGCTTATGCA AAAGTGTTGA GTCTACCCAA TAGGAGCGGA AGCGGCAAGG TCAACAAAAG TGAAGGTCAA GCCGCCTCGA ATCAATACCC GACGCATCCA TCGTACCCTC AACCGATTTC CAGCCAGGAT CATTCTTCCT ATTACTCAAT GCCCCACGGA GTCCCCTCAG GTCCTTCGGG TAGTATGCGT ATCAGCATGG GTGGACCTCC ACCTACAGCA GCTGGAAAAG GATCACCGTC TCGTCCACAT GGAGGAAGGT CGCCTCACGA TGCGCCGCAT CCGTATCACG ACTACTCGAC ACACAATGGA ATGCAATACC CTTCTCAACA TCCCATGTAT TCGCAATACG CGCCGTACGG TACATATCCT TATGCATATC CGCCTCTGAG GCAAATGCCA ATGTACAGTG CGCAGCATCC GTCAGCGGGA CCCACAACGC CGCTCAAGAA GAGTGTGGTC AAAATGAAAT CTGGCACCAA GAGACCTTTG ACGGAGAAGC TAGCACTGGG CTCAGCAAAG AAACAAAGAA AATCTCCGAG CGCCAGCGCA AAGAAGAAAA ACAAGTCGCC ACAAATTACC GACAAGGCGG AGCTGCAAAA AGCTGCTGAC GCCATTCGAG CGGTGAATGC AGCGAGCGGA GGAAAGAACG ACAAGGCGGC GGCCTTGGCA GCAGCTATTT TGCGTGGTGT CACAATGCGA CCTTCAGGAA AATGGCAAGC GCAACTGTAC TTTTCCGGCA AGTCGCGGTA TATCGGGGTG TTTGATACAC GAGAAAAAGC GGCGTTGGCG TACGAGATTG CCCGAGAGAA ACTCAAGGCG GGAGGCGGTG AAGGGGCCGG TAGTCAGAGT CCGAAAACAA CCGAAAACTT GGTGAACACA GCTCGAAAGG CAGCCTTTGA TGGTGTCAAT GAAAAGCTTG CCAAGTAG
|
Protein sequence | MVLEEEDEEV QARPTNVDPP STRTVPRNTT VPTYATPGKP STAVPTPATH TAPPPPPFVT PAARGVASDL HETTPLLTVV PPIDPPLLDP RLPESSRDGA NPETDASVDS DTSAPKQVRF DAHVSPAPAE AARTGLVSRR GGRMASPARR RPIPPSSTGA DAAGAAGSPS TVSSPLAATV LEYQTFRSPS RKNDEVPSLC SIHSGGGPLG NRTESFPNHQ DKAPADSISC DPSEENTTSP LATDSRKDSV SFSPKMNDET TAADHRSTTP TDFSVQRLPK LDTDASQPTK TPSVFLSPSP SSRPLPEASS FEENKGSHAA VEGGGDAKPS PRHERNLATP TDFAWDYGRG PPTSTGSFDH SNVLAWLQSP TANGFFSPGG YGSIVNTPHT GIPRTPGTPT VSTSFFFTDV ATLPRGNDLT PRNGDGGETP VRDGSRRSAS HGISSIICIS PLASAKVRGS TTSTPMNLKD VFASPREKSR MRGLPLLNDT PAKRQQLRVP RRSSSKDPSV DAVHLAERDL MEDEDLSVLL QLASNTPQHR EPDGAHGNGV DPVFRSPDYR KNIGDDESGE NLPTLQLPMI GNGRHDELLS SRLAQKSRSR DLGDADDFAP PPHLGMRSTS SSGSKEAYAK VLSLPNRSGS GKVNKSEGQA ASNQYPTHPS YPQPISSQDH SSYYSMPHGV PSGPSGSMRI SMGGPPPTAA GKGSPSRPHG GRSPHDAPHP YHDYSTHNGM QYPSQHPMYS QYAPYGTYPY AYPPLRQMPM YSAQHPSAGP TTPLKKSVVK MKSGTKRPLT EKLALGSAKK QRKSPSASAK KKNKSPQITD KAELQKAADA IRAVNAASGG KNDKAAALAA AILRGVTMRP SGKWQAQLYF SGKSRYIGVF DTREKAALAY EIAREKLKAG GGEGAGSQSP KTTENLVNTA RKAAFDGVNE KLAK
|
| |