Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46366 |
Symbol | |
ID | 7201632 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | + |
Start bp | 125214 |
End bp | 127847 |
Gene Length | 2634 bp |
Protein Length | 519 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180761 |
Protein GI | 219120026 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0951479 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TACATTAGTT ACTCGGTTCT TGTCACTATT CTCACCAAAA GCATCATGGT GCTACAAAAG CGCCTGACTC GCGGTCTCTC TTTCGCTGTA ATTCTTTTTC TTGCCAGCAC TGCGAACAAA GCCGCTGCCT TTCAAGCGTC ACTAGTTTCA AGCAGAACCG TATCTCGGTC GTTCGATACA GCCATCGGGC CAGTTGCAAA GGACATTGAA CCTAGTCCAC TTTCATACTC CGAAAAGTCT AGATTGTACC GACGAGACTT TTATGATCAT GATTCCTGGC TCAAGCACCG TGCCAAAAAT CGTTTTGTCG GAACGCTTTC CAAAGTGTTA GACTCGGGTG TTGTTCGACA ACTTGCGGAC GAAGTGATAT TGATTGGAGC TGTCGCTACC TTTGTTTGTG TGTTTAATGC ATTGTGTGTA ACTGGTTATG AAGATTTCAG CAATCTACAT CACGACCCTA TATTTAATTT TGGCGCTCCG TTGGTGAAGT TACCGGGAGA GCCATTCAGT CTGTCGAGTC CGGCTTTGTC GCTGTTGCTT GGTACGGAAC AATTTGTTGA TATTGTGTTG AAGTGATACC TGTTACCGTG TTCAACTAAT CGATGCTCAC TTGCATCTTC TGCTCTCGTA TGACTCAGTT TTTAAAACCA ACACATCTTA CCAGCGATGG GATGAAGCCC GCAAAGCATG GGGTGTGATT GTCAATAACT CGCGTACGGT CATCCGCGAA ACGTCGGCGT GGGTGCTACA AAGCGATCTT TCCGACGAAG AGAAGTACCG CTTGATTCGA AGGGTGGCTG ACTGTGTCTG GCTGTTTCCG CGTTCGCTGC AACGCCATTT GCTGAATCCT GCCGAAGATG AAGAAGCGTA CGTCAAAGAC GTTCGCGCGA AGCTTGATCC TGTGCTTGCA GAGGACTTGG TGAGTGCACG ACATCGTCCG ACACGAGCAA TGTACGAAAT GTCCAAAGCT GTAAATACCT TGCCGCTAGA TTCCTACCAG CGGTCAACCA TTGACCAAGG GGTTTCACAG CTGTGCGACG CTTGCGGAGG TTGCGAGCGT ATTTTCGGTT CACCCGTGCC GAGTATTTAC ACCCGTCACG CCGCCCGGTT TATAGAACTC TGGATGTTTT TCTTGCCATT AGCTTTGTAT TCGCCCTTTT CGATATCGTG GAATCACTGG TTTATGATTC CTTCATCCAT GATTATTGGA TTCTTTCTGT TGGGCATTGA AGAATTGGCC ATTCAGCTGG AAGAACCCTT TTCTGTGTTG CCGCTGGGTA AGATAGCTAG TGGAATCGGA CTCTCAGCAG AGGAGCATGT GCAATGGGCT GAAGAACAGA TGGCAATTGA TAGAACAAGC GGACCGTCCA GTTACGGGTA TCCGGCTCCG ATTCCAGAAT CGACAGTTGT TTCTAGAGGT CCTACCTACT CAACCCCTAC TTGCCAACTA CAGCAAACCC CTGTGCCCCT TGCCAATGGC GGCTTTTCCT ACTTGAATAA TCCTTCACAG CAATCTATAA GCACATCGGA GCTGGACAAT GACGAATCAT CATATCAAGA ACCTCCGACG GAGCAACGGC AGCCTCCTGT CGTAGCCACC GGTGATTATT CCTATTTGAA TCCTCCAGCC CTGTCTGAAG ATGGGCAGCC TCAAAATGGC GGACTGACAT ACCAGTTTGG ACCTCCTCGG CGGCTGCCTT GAAGTTCCTA CCAAAATTGA AGGTTCTATG AAGCTGGCAA GCTATAGAAG TGTTCTAGCA AAACAGATTC AATTCCCTTG TAGTCTAAAT GGATGTGGGG TACGATTGTT ACTCAATTAC ACGGCCTTGC TTCTCGTCGA CTTGTTTGTG GAACAGTTCG TGAGCTTCCT TCCATGCTCG TTGAAACCCC CTCTCTTGAT CAGCTTCCCG CCAAGTCAGC GAAATTTCGG GGCATGACTT GAGTGGTTGT TTCCACCACA TAGTTACTTC TTGCATGGGA ACAGCCTTGC GTAGATTCAC AACAATGTAC GGAATGGAAT GCCATCGTTG TATGCACCAA TCAATCTGAT CTTCCTCTTC ATCTAAATGT ACAGCGTGCG GCAATTCATC TCGAAGCCAA ATCGACTCGT TTAAATGAAT TTCAAGCGTT TGTAGACAGC TGCCTACAGC ACAATTTCGA TCTTTGAAAT CGAGAACATT CTTGACCCGA CAGTTCCATC CTTGAACGCA CGCTCCGTCC TGCAGGACTA ACCGTAGCGT AACTGTGTGC CGGTCTTGAC TCCACAAAAG CTTAACTTGG CCGTCCGCAA CAACAGCGTT GCACTCACCC CCTTTTTCGG TCCAGCTTTC GGGAAACGAA GCAAAAATGT CTGTTCCGCT GGACGATTGA ATCTGGACCG GCTCCTTGCT TGATACAGTA GCAGGTGATT TCGCCGTGGC CGATTCCGAA TATTCCCTTG CGGGTCCCAA AGTATGCACA TGTAGTTCAC CGCCCGCAGT CCGAGTGATG CGACTAGGCT GATCAAGTCT TGTTACGGAA GGCTGCGACC TCGGAGTGTG CATACCGGAA CCTGCGCTCT CCTCTTCCCC GCTGCTGATG TCCTCGTCAA ATTTCATCCG ATCCCATTTG GAATAGTCAA TGTTAGGCAT TTTC
|
Protein sequence | MVLQKRLTRG LSFAVILFLA STANKAAAFQ ASLVSSRTVS RSFDTAIGPV AKDIEPSPLS YSEKSRLYRR DFYDHDSWLK HRAKNRFVGT LSKVLDSGVV RQLADEVILI GAVATFVCVF NALCVTGYED FSNLHHDPIF NFGAPLVKLP GEPFSLSSPA LSLLLVFKTN TSYQRWDEAR KAWGVIVNNS RTVIRETSAW VLQSDLSDEE KYRLIRRVAD CVWLFPRSLQ RHLLNPAEDE EAYVKDVRAK LDPVLAEDLV SARHRPTRAM YEMSKAVNTL PLDSYQRSTI DQGVSQLCDA CGGCERIFGS PVPSIYTRHA ARFIELWMFF LPLALYSPFS ISWNHWFMIP SSMIIGFFLL GIEELAIQLE EPFSVLPLGK IASGIGLSAE EHVQWAEEQM AIDRTSGPSS YGYPAPIPES TVVSRGPTYS TPTCQLQQTP VPLANGGFSY LNNPSQQSIS TSELDNDESS YQEPPTEQRQ PPVVATGDYS YLNPPALSED GQPQNGGLTY QFGPPRRLP
|
| |