Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48034 |
Symbol | |
ID | 7203249 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 770739 |
End bp | 772666 |
Gene Length | 1928 bp |
Protein Length | 537 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182297 |
Protein GI | 219123990 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0967386 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAAGTCGAGA GAGAGTGTAC GGCTCGTTTA CGGTTAGTCC AGTTCAGTCT AGTGTAGTAA GCCCAGCCTT TAGTCCATTC ATTCGGTAGT GTGTCTTTCG TCGTGTCTTT TAACCGCTCG AGCCGTTTGC TGACAAGTTT CGATTGAAAC CGTAGCAAAC GCGAGTCTCT AACTCTCACG TAGCTTCATT GTTGCGTTCC ATCGCCATGA TTGGAACCAA CCGTTGGTCC GTGCTTTGGG TAGCCGCAGC CTTGGGGTTC CCGCCGACGG CAAGGGCGAC GATTACCCTA GTGGATACGA AACAAAAGTT TGCCTCGACG CAGGATCGCA ACCTCGGCAA ATCTCTCTGG AAGAACAACG AGTACATGGC ACGCTTACAG TACGTTGACG GAAACCTACC CTTGTGTCCG TCGTCGCCCT CCGCGTCAAC ACACCCCAAG TACAATCTTA CCGTACCGAG TGACGACTCT CCGGTAGCCG TACTAGTGCG TGGTGGAGGA TGTACTTTGG AAGACAAGGT ACACTTTGCT CTCCACAATC TGGAACCAGC CGGCTTGGTC AAGTTCTTGA TTGTCGATGG TGATCATACT CTCTCTACCG AAAGTACATC GTTACTTACG GATTCACTCT CCGCGCATCA AGACGCGCTA GAAATAAAGC GTTCCCCGGA CCAGTTCGAT ACGAAACAAG ACAACGACAT TCCTCTCTAT ATTCTACACG TCTCCTATCA CACGGAATAC GATTTGCTCG ATATTATTCT GCATCAATCA TCCCGAAGTC AATCACTGGG AGGACCACGT ATAGTCATGG ATAGTCGTCT AGGCACGGGG TTTTTATCGG GACCAGCGGC GGTGTGGATT GGTCTCTTGT CGCTACTGAG TGCCTGTGCA TGTTCCTGCT TACTCGTGCA CGGAGGGTCG CAGTGGATGC CCGATCCGGA CGATCAGGAA CACGTCCCGC AGCGACCCAC ACGTCGTAGA CTCACCAAAA ATCAAGTCAA AGCAATGCTC CCAGTGTACC AGTTTGACGG AGAAACGATT AAACCCGCCC ACGGTCGATC CACGCCCGCG TTGGTGGGTC CGGACGGCCT CGAGACCGAA ATCCTCTTGC CCGATCCGGC CACGTTAGAA TGCTGCTCCA TTTGTATCGA TGACTACGAA TCGGGCGACC GATTACGAAT GCTGCCTTGT CATCATCTAT TTCATTCCAA ATGCATCGGA AGATGGTTGT CGGAACGATC GTCAACTTGT CCACTCTGTA AATTGGATCT CTTTCAGGAA GACGACGAAG AAGAGACCGA TGAAGAGCAG CCTGTACAAG AGCCTTTGCG CTCAAATCCC CTTACGTCGA CTGATGGCGT GAGTGTGTGG CGCTCCGTCT TTGGCTTGGC CGAACTACAA ACACAACCGA CACATACGCA ATCATTGGAA GCAACCAACG ATAACCATGG AGCGCCGGAA GTAGTCGAAT CGAGACCACC ATCCACTTCT ACTCAATCAT GGTGGCGTCG TTTGTTGCCG TCGCGGTCAG CACCGGTTGA GCCGTCGACG GAGGAAAGGT TGACGGAACC GCTCTTGCCA GCGGAAGACG AAGAGGCACC GGCACCGAGC ACGTCGTCGG GTACTCCCGA AACGCATGTT GCTACGGCTG ATACTGCTCG ACACGAGCAA GTAGTGTCAG AGGATGGTTC AGAAGATGGA GCGTCTACCG GTGAGCTAGA GCAATCCGAA GACCTAAGAT CGGCGGAAAC GACTGGATCC AACCTTGTGA ATTTGCCGTC CATCTCACCC GACCCTCCTT CGAGACAAGA GTCGGTGTAA ATTAAAGTAA GCTTCCATTT TCTTTGCCCA TTGTATTTGC CGCGACCAGC CACTACTTTC TTCGGGCCAC ATTTATATTA ATGGCTAAGC GTTTCTTATT TTGTTTAC
|
Protein sequence | MIGTNRWSVL WVAAALGFPP TARATITLVD TKQKFASTQD RNLGKSLWKN NEYMARLQYV DGNLPLCPSS PSASTHPKYN LTVPSDDSPV AVLVRGGGCT LEDKVHFALH NLEPAGLVKF LIVDGDHTLS TESTSLLTDS LSAHQDALEI KRSPDQFDTK QDNDIPLYIL HVSYHTEYDL LDIILHQSSR SQSLGGPRIV MDSRLGTGFL SGPAAVWIGL LSLLSACACS CLLVHGGSQW MPDPDDQEHV PQRPTRRRLT KNQVKAMLPV YQFDGETIKP AHGRSTPALV GPDGLETEIL LPDPATLECC SICIDDYESG DRLRMLPCHH LFHSKCIGRW LSERSSTCPL CKLDLFQEDD EEETDEEQPV QEPLRSNPLT STDGVSVWRS VFGLAELQTQ PTHTQSLEAT NDNHGAPEVV ESRPPSTSTQ SWWRRLLPSR SAPVEPSTEE RLTEPLLPAE DEEAPAPSTS SGTPETHVAT ADTARHEQVV SEDGSEDGAS TGELEQSEDL RSAETTGSNL VNLPSISPDP PSRQESV
|
| |