Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_56623 |
Symbol | |
ID | 7200086 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | - |
Start bp | 452106 |
End bp | 454064 |
Gene Length | 1959 bp |
Protein Length | 513 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179434 |
Protein GI | 219117279 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.329951 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCATGGCCAC CCCCTCTTTT CGATCAAAGC TTGAAGCTCG AGTCGCCGCA GTCAACTCTC TCTTGTGCGT TGGTCTAGAC CCGCACGAGA AAGAGCTGTT TGCAGACGGA TGGGAAGGCG TGCCGGAAGA AAATCGCTGT GACGCGGCCT TTACCTTTTG CAAAACGTTG GTCGACGCAA CATTGCCTTA CACGGCCTGC TACAAACCCA ATGCTGCCTT TTTCGAGGCG TTAGGCGATG GAGGGATAGC GGTTCTGCGA CGAGTTTGTC AAAACATAAT ACCGGATGAT GTGCCGATTT TGTTGGATGT CAAGCGCGGC GACATTGGCT CGACCGCTGC GGCCTACGCC GAAGCGTGCT ATGGTTTGGG TGCAGACTGT GTCACGCTTT CACCACTGAT GGGATGGGAC TCAGTCAGTC CCTTTGTTAC AGGTAAGTTG TAGCTAAAGG ATTGGCATCT CCAGCGATTG AATGGATGCC ATCTGATACG TTGAATTGAC TCCCTCCGAC AGAAAAGTAC GTTCACAAAG GAGCATTTTT GCTGTGCAAA ACGTCAAATC CTGGATCCAA CGATTTTTTA GCTCTGGGAT TACGTTCAAA TGAATGTTTA TACGAAAGAA TTGCCAAGCT TGTTGGCTCG GAATGGGCTC AGCAGACCGA GAGTTCATTG GGACTCGTTG TCGGGGCCAC AGATCCAGTG GCCTTGTCCA AAGCGAGAAA GGCTGCAGGC GACGACACCT GGATTCTAGC ACCCGGCGTT GGTGCTCAAG GTGGAGATCT TCTAGAAGCA GCGCAGGCTG GATTGAATAC AAAGGGGACT TGCATGCTAA TTCCCGTGTC TAGGGGTATC AGCAAAGCTA CGGACCCAGC GCAGGCTGCA AAAGAATTGC AGGAGAGGAT TCAGAAAGCT CGGGACCAAG TCGTGGCCGC ACACATGATA AAAAAGAGTT CAGACGAAGA TATTAAACTC TATCAACGCG AGTTTCTTGA ATTTAGTCTG TCTCTAGGTG TTCTCAAATT CGGCTCTTTT GTGCTGAAAA GCGGCCGCAT CTCTCCATAT TTTTTCAACG CCGGTCTTTT TGCTTCTGGC GCTGCGTTAA GCAAGCTTGG GAAAGCCTAT GCTTCGACTA TCATGTCCTC GGAATTATTG TAAGTGTGCT TTGTGTGTTT TTCTCTGCTG AACGGCAAAA ATTCAAGAGA AGGATGAGTA TCCACTTGGT CCGTGTTACC GATCTGCCCC CACGTGAGTG GCAATGAGCA AATTTTTTTC CAGTGGCCTG ACTCTTGAAA CAACATAGTC GATGATGACT CCTTTGGTCT TCTTTCACCT AATTTCTCCG GAAAGATGCC GGTCAACACC AATTCGCTGA TTCGAAATTT TCTGAGACTG TGTTTTGATT TAGTTCTATG GGACTATCAT TGTTGTGAGC AGGCTTACCC AACAAAATTC GTTTTCTTTT TCCTTCCAGA GCTGCTGGGC CCAACCAAGT CAATTTTGAT GTGATTTTTG GTCCTGCATA CAAGGGTATT TCTCTAGGTG CTGTCGTTGG AAGCGCTCTG TATAACGATT TTGAAGTAGA TGTCGGTTTT GCGTATGACC GAAAAGAGGC AAAGGATCAT GGGGAAGGTG GTAAATTGGT CGGGACTTCG TTGGAAGGAA AACGAGTTCT GATTGTAGAT GACGTAATCA CAGCGGGAAC CGCCATTCGT GAGTCGCACA CTTTGCTCAA CGATGTGGGT GCTTTGCCAG TTGGAGTAGT TATTGCCCTC GATCGAGCCG AAATTCGCTC TATGGAGGAC AAGATTTCCG CTGTTCAAGC AGTCGCACGA GATCTATCTC TTTTGGTCGT GTCAATTGTC AGTCTTCCTC AACTACAGAC ATTTCTCGAA CGAAGTCCGG ACTACGGCGA TGAAACGCTG GAAAAAGTAA TTAAGTATCG AAACGAATAC GGAGTGTAA
|
Protein sequence | MATPSFRSKL EARVAAVNSL LCVGLDPHEK ELFADGWEGV PEENRCDAAF TFCKTLVDAT LPYTACYKPN AAFFEALGDG GIAVLRRVCQ NIIPDDVPIL LDVKRGDIGS TAAAYAEACY GLGADCVTLS PLMGWDSVSP FVTEKYVHKG AFLLCKTSNP GSNDFLALGL RSNECLYERI AKLVGSEWAQ QTESSLGLVV GATDPVALSK ARKAAGDDTW ILAPGVGAQG GDLLEAAQAG LNTKGTCMLI PVSRGISKAT DPAQAAKELQ ERIQKARDQV VAAHMIKKSS DEDIKLYQRE FLEFSLSLGV LKFGSFVLKS GRISPYFFNA GLFASGAALS KLGKAYASTI IAAGPNQVNF DVIFGPAYKG ISLGAVVGSA LYNDFEVDVG FAYDRKEAKD HGEGGKLVGT SLEGKRVLIV DDVITAGTAI RESHTLLNDV GALPVGVVIA LDRAEIRSME DKISAVQAVA RDLSLLVVSI VSLPQLQTFL ERSPDYGDET LEKVIKYRNE YGV
|
| |