Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50640 |
Symbol | |
ID | 7199477 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011701 |
Strand | + |
Start bp | 55891 |
End bp | 59267 |
Gene Length | 3377 bp |
Protein Length | 1001 aa |
Translation table | |
GC content | 44% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185602 |
Protein GI | 219130924 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCGCGAGATA TACCAAACAC CGTCCTTTTC ATTTTGCACA TAGTCATTTT GTTGCATCTA CAGTAAAAGA GTCAGTGTTT GCTTCAATGC TTGGAGACGG AACAAGACAA GTCCACAAGG CCAACAATGG TCCAAGTCGA AATAATGTCT AAAATATATG GTGAGCTTGT AGCATATCCA CCACTCCGGA TTCTGCAAAA TGGGAAGGAA CATGACCCTT ACGTTGTTGT AGATGAAGCT ATTGAAAAAT TTGGTGCAGC AGTCAAAGAT CAAATGCTTA CAAAGGAACA AGCTTTGGTG AAGGTGACAA AAGTTGCTAA TTCAATTGAC TGGCTACCTA CTGAACTGAA AGAACTGGTT GATGCACACT GTCGTAAAGA CTCAGATGTT GACAATGATG GATATGTCAA TAAGATGAGA CTTTCAATCA AGGCAAATGA ACTCTTTGGG AAGATGAAGG CCAGCACCTA TTTTGTCAAC TTTTACCAGC TCAAGCAGGT TGCTTCACGA TTTGCCGCAC ACTGGGGATT TGTTGTTGTT TCGTCTGGCA ACAAATTGTC ATGCTTTTTT GCCAAGAGCT CAACTAAACC ACGTGAGTCA ATTGTGTCAC CAACAAGACA AAGAGAGCGG ACAAGCATCA AATCAAATTG TTCATTTATC ATCCGATCAT CATCTTGTTG CAAGGATGAC ACAAAACCGC GGCATCGGAG AGCTGTGAAA TTAACATCAT ATGAATTAGA GCACAGCACG GAATGCCATC CAGGTGTTAA GGAGCAGCGC CTTGCAAGGA AGGCTGCTGG CATAACAATT GGTGGCTTGG ATCTGACAAA GGTCAATGAT ATTGTGACAT TAATAGCTGC AGGAAATATC AATGCTTGCC AGATGAAGAG TTTGCTAAAA GATCATGTGC CAGAGCACTA TGCAATTACT GCTTCTGATA TATGCAACAT CAGAAAGCGA GCAGTCAAGT ACTCCATAGA ACAAACACCA ATAAACATTG CAACAGCACA GAAATTAATT GATTTTGTTC CATTGGATGA GGATGAGACC ATAATCTCAA AGGATGATGA TGTGTCCAGA GAGAAAGTGG CTGAATTTAT GAGACAGGTT CTACAGGACA CAGGTGAAGG TTGGAAGGCA CTTGCATTTC TTGAAAAAGT GAAATCTGAG ACCATTGGCT TTGACTTCCG TGTGCATTAC GATATTGACG TACGGCCAAT TGGCATTGTT TGGATTACCA AAAGTATGCG CAAAGCCTGG ATCCGGTTTG GGAGCACAAT ATACCTGGAT GCTATGAAAA GGAAAATGAA CAGTCTCCAC TGGCCATACA TTGGTCCTGT TGCAATGGAT CATGAAATGC GAGTGGTTCC ACTCTGTGAA AGTATCTGTT TGGGGGAAAC ACTTGCCGCA TATGCATTTG CCTTAAATTC TTTGGAGCAA ATGGAGCCAC GGCGAAAATT GGCCTCTATC CGTCTCATCT ACGCAGACTG CTTTTTAACA GATGCACTCC TACCATTGGT AGGTCTGAAG CGCCCTTCCA CAACTCTTGC ATGGGATTCC TTTCACCTGA AGTCAAAAGT ATGGCCAGAA TACTTTGGAC CAACCCTGTT TGACCAGTTA AAGGCTTCAC TTGGGAAGAT GCTTTATGGA AAGTCACGCA AAGAATATGA TGAAGCATAC CAGGAGATAG CACAAACACT GGCCCACAAC CCTGCTAAGC TTGAGTATGT GAAGGGATTG TATGATCACC CAGAACGGTT TGCTCACCAC TTTATCAAAA CCATACCTGG AAATCTTGGC AAATCCAGCA GTCAACCAGC AGAATCAAAC CACTCTAGTG TTGTAGCTCG AGTTGGTCCA GGCTCATCAC AAGACATTGT CAAAGAAATC AGTGCACTTC TGTACCGACG ACAAGACTTG GCCAACTTGC ATGAACAAGA AGACGCAAGA TATGAGCTAT TATCCTTTAA TAGGGCTTCA AAAACCAAAA CAACCACACT ATTGCATGAT CTGTCAGATG CTGGGGCACA CAAAGCACTC TCAAAGAGAT TCTTCAAAGA TTATTGGTGT CCTCTTTCAG AAGAGGGAAA GAGCTATACC CATTTATGTC TTCCATGTGG CTCTCACCAA ATTTTTCACA TGGATACTCC TAAGCTGGAT GATTTCGCCA TTGTTATAAA GAAAGGAGAG CGTTGCACTT GTTCCCAGCA AAAGGAGTTT GGGGGTATGT GCCTGCATGA ATATGCTCTA CATAATCACA CCTTTGAACT TAGCTTGTTT CCAGAAAGAA TGCTCCAGCC ACACTTGCAA ACAGCAATTC GTCCAACCAG CTCCAACAAT GATACTTATG ATCATTATGG TTGCAATGAT GATGATCTAG TGTTGGTCAA GGGCAACAAT GCTAATGAAG TCAATGTTGA TGAAGTTAAT ATTCATGCTG ACACTCGCAA TGAACCCTCC TCTTCTGAGG ATGATAGCAT CCCATTAGCT ACATTGGCTG GTAAATGTTC TACCAACACC AAAATGCAAC CCCAGAAGAA ACAAAAGTGT GCTGCTGTGA CTCATGCTGA AGCAACTTGT GTTGCTGCAG CAGTGGTAGA CTACATTGTT GGTGGCAAAT CGGAAATGAG CATGGTCCTC TATTCATCTC TTCAGCATTT GCTTGAGATT GCTCGTGGAA CAAGTGCCCG CTCAGCTGCC AGCATAGTAC AATCTGCTAG CCTGGCTGTT GAACTGAGAA GAAAGCATCC AGCTCTGCAT CAACCAGTGG CTGGTCCAGA GGCCAACACA GTAGGAAGGA CCCGATCCAA GAGACTAGTC TCAAAGGTGA TGGCCACATA TGGAGGGACA TCGGTGCGTA GCTACAAAGT AAAGAATTCC AAATGCAGCT TCTGCCACAG AGCCACATGC AGAAATATTC AAAGCTGCCA GATACTGAGG GACCTTGGCC GGAGGATTAC AAAAGAGGAG CTTCCTCGCT TCCGGCAAAC AGAACTTTGC TGCTCACAAG CCATCAGTGA CGGATCAAAG TTAAGTTCCC TTATTACAGC CAACAAACCT GTGCTCATCA GCCTCCCAAG ACATACTAAA TGGCTTGTTA TTCATGGCCT GTACAATCTA TCAGGAAGCT TGGCTGCTGC CAAAGTTTCA AACAATGTTG GGGTGGAAGT TACATGCTAT GGCAACATGG GAACAATTAT GGAAGGACTT ATAGAAGGTG CTGCCAGCTT TGATCACAGA GTGGCCACAT ACAGCACTGT AACAGACTGG ATTGCAACCT CTGCTTTGAC AGGAATGAAT ACCATGACAC GGTTGATTGC AAGCAACAAA TTTAATAGCT TAACTGGTAG CATCTAATGA CTACTTACTT GTGCTTT
|
Protein sequence | MVQVEIMSKI YGELVAYPPL RILQNGKEHD PYVVVDEAIE KFGAAVKDQM LTKEQALVKV TKVANSIDWL PTELKELVDA HCRKDSDVDN DGYVNKMRLS IKANELFGKM KASTYFVNFY QLKQVASRFA AHWGFVVVSS GNKLSCFFAK SSTKPQHSTE CHPGVKEQRL ARKAAGITIG GLDLTKVNDI VTLIAAGNIN ACQMKSLLKD HVPEHYAITA SDICNIRKRA VKYSIEQTPI NIATAQKLID FVPLDEDETI ISKDDDVSRE KVAEFMRQVL QDTGEGWKAL AFLEKVKSET IGFDFRVHYD IDVRPIGIVW ITKSMRKAWI RFGSTIYLDA MKRKMNSLHW PYIGPVAMDH EMRVVPLCES ICLGETLAAY AFALNSLEQM EPRRKLASIR LIYADCFLTD ALLPLVGLKR PSTTLAWDSF HLKSKVWPEY FGPTLFDQLK ASLGKMLYGK SRKEYDEAYQ EIAQTLAHNP AKLEYVKGLY DHPERFAHHF IKTIPGNLGK SSSQPAESNH SSVVARVGPG SSQDIVKEIS ALLYRRQDLA NLHEQEDARY ELLSFNRASK TKTTTLLHDL SDAGAHKALS KRFFKDYWCP LSEEGKSYTH LCLPCGSHQI FHMDTPKLDD FAIVIKKGER CTCSQQKEFG GMCLHEYALH NHTFELSLFP ERMLQPHLQT AIRPTSSNND TYDHYGCNDD DLVLVKGNNA NEVNVDEVNI HADTRNEPSS SEDDSIPLAT LAGKCSTNTK MQPQKKQKCA AVTHAEATCV AAAVVDYIVG GKSEMSMVLY SSLQHLLEIA RGTSARSAAS IVQSASLAVE LRRKHPALHQ PVAGPEANTV GRTRSKRLVS KVMATYGGTS ILRDLGRRIT KEELPRFRQT ELCCSQAISD GSKLSSLITA NKPVLISLPR HTKWLVIHGL YNLSGSLAAA KVSNNVGVEV TCYGNMGTIM EGLIEGAASF DHRVATYSTV TDWIATSALT GMNTMTRLIA SNKFNSLTGS I
|
| |