Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44500 |
Symbol | |
ID | 7197723 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 744126 |
End bp | 746051 |
Gene Length | 1926 bp |
Protein Length | 602 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178568 |
Protein GI | 219115545 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTCGCA TCGCCAACAG TAGTGGAAAC TATGACTGCG TGGCCAGGGA TAGGGGCACG GGCGTGCATA CCCGTGAAGT CTTTGCTATT TTTTTGCCCC CCTTTCTTGC CGTAAACCAA ACAATTAGGG GCGTGGAGTA TACACTTCGT GATTGTAAGC AATAGCGAAA ACACTTGTTC ACCGTGGCGG TGAGAAGTGT ATCAAAATCT ACGGCTCATC AACACGAGGT GTTCATCGCT CCAGTCTTCC GACTAGCTGG GGTTTCCGTT TATCCACCGA CGAGATCGCA CACATTGATT GTGAGACCAG GCGTCGGTGT TTCCGGAGTC TTTGGCGTAA CCACGACGCT GGGCGCATAC GTAACATGCG GTACCGCAGT ACCAATCCGC TTTCAATGTC ACCGGATCGC TCTTCACCTG TGGATGATGC TGATATCGCA GCGGGACAGA AACGTAATGT AAACGGATCC CTCCCGCGGA AGGGTCTGCT TCCAGCGCTT ACTTCCACGC CTCCACGAAG TTTGCAGTCA AACCTATCGC TGGCTTCGTT GAACACCAAT CGGGGACACC CGAGTGACTT AGAGTTCTAT GAAGATCTGG TTTCAGAGCC AGTGCTGGTG CTTGGCATGG ATATTTCTCA TCTGAGCCGC CGCGAACAGT TCATAGTGAC AGCAATAGGA GTCTTTTGTT TCTCTCTCCT GTACGGCTAT CTGCAGGAGC TGATTTCGGT TGAACTCTGT AATCGCCAAC TTGGTCTCTT TCTAGCCATG GTACAGTTTA CCGGGTACAC AGTACTGGCA TTTTTCCTCC GTAACTTTGT CTACCACAAG CAAAGGTCCA TGTCAAGGGC TGTCCACAAA GATAACGACG ACTCTCTTGG CGCGACTGGT CCCCAAAAAC AGGTGCCATT TCGTCTTTAC CTTGGACTGA GTTTGCTGCG CGCCGTTGAC TTGGCTATGA CAAACATGGC CATGCAATAT TTGAATTATC CGGCCAAGAC ACTCATGAAA TCTTCCAGAA TTGTGTTTAC CATGTTCTTT GGTGTAGTAA TTCAGCGCAA GAAATATCAT CTTGGAGATT ACCTAATTGT GCTGGCGATG GTTGCTGGTC TGGCACTCTT CATGCATGCC GATGCAAATT CTGACGCCAT TTTTCACCAC ATGGGAGTCA TCATGCTGAC GATATCTTTG ATTTGCGACG GGGCTATTTC CAACATGAGC GAAAGCATTA TGAAGGATTA TGGTGTTGGA CAAGATGAAG TACGTAAAAG CATTGGCATC CCACTATACA ATTTAGAAAT AGTCTCTCAG CAACGCCTTT TTTCTTTTTA CCCAGTTCAT TTTTCGGATG TACTCGATTG CCTTGATCGC TATTGCAGCT GCTGCCGCAT ATCGTGGCGA TCTGCAGGAA GGAATACGCT GGATGCACCA ACCAGGAACT TACGCACAAA TCGACCACCA GGCGGAGGAG CGTACTTGGT CTATTTTGGG CAAGATCACT GTTATGACAT TGTTCAGCTC AATGGGGTTT TTTGGTTCAT CGTGTTCTGC TGCAATCACG AAGAATTTTG GAGCGTTAAC AATGTCGATA ACGAGTACTG CTCGCAAGGC AACAACTCTT TTCCTTTCCT TCGCGCTTTT CCACAACGTT TGTACATCTG AGCATTTGAT GGGTATTATA GTCTTCATTT CCGCACTTAC GACCAAATCA TTACGGCGTG GACGCGTAAA GAAAAAACGA ACGCGCAAGA TACTACAGCA GCCGTCTCAA GTTGACCTGG AGTCCCTAGA TACGCCCGAA GACTGGCCTT TAAATCGTCG GTCTTACAGC AGCGATGGCA TAGAAGTGCC CTCTATGAAA CTTGGTATTA GAGAGAATTT TGGAACCGCT GGTCGTGGGG CAAACAATAG TACTCATGTT GTATAG
|
Protein sequence | MRRIANSSGN YDCVARDRGT GVHTREVFAI FLPPFLAVNQ TIRGVEYTLR DCVHRSSLPT SWGFRLSTDE IAHIDCETRR RCFRSLWRNH DAGRIRNMRY RSTNPLSMSP DRSSPVDDAD IAAGQKRNVN GSLPRKGLLP ALTSTPPRSL QSNLSLASLN TNRGHPSDLE FYEDLVSEPV LVLGMDISHL SRREQFIVTA IGVFCFSLLY GYLQELISVE LCNRQLGLFL AMVQFTGYTV LAFFLRNFVY HKQRSMSRAV HKDNDDSLGA TGPQKQVPFR LYLGLSLLRA VDLAMTNMAM QYLNYPAKTL MKSSRIVFTM FFGVVIQRKK YHLGDYLIVL AMVAGLALFM HADANSDAIF HHMGVIMLTI SLICDGAISN MSESIMKDYG VGQDESLSNA FFLFTQFIFR MYSIALIAIA AAAAYRGDLQ EGIRWMHQPG TYAQIDHQAE ERTWSILGKI TVMTLFSSMG FFGSSCSAAI TKNFGALTMS ITSTARKATT LFLSFALFHN VCTSEHLMGI IVFISALTTK SLRRGRVKKK RTRKILQQPS QVDLESLDTP EDWPLNRRSY SSDGIEVPSM KLGIRENFGT AGRGANNSTH VV
|
| |