Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50625 |
Symbol | |
ID | 7199472 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011701 |
Strand | + |
Start bp | 8199 |
End bp | 10108 |
Gene Length | 1910 bp |
Protein Length | 495 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185592 |
Protein GI | 219130903 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGGTAGGTAG GTAGAGAAAC ACGACAGGCA TCTTGGCATC GATTAACAGA GACTTGACAG AGACAGGATC TAACACATCG AGGTAGTCTC CCGGTGAATC TTTTGCGGTT CGACGAGCCG CTCAACTTTT TTCGTTCGAT TGAGAGCAAT TTATAACATG ATGAGGAACG TTGCAATAAT GCGAAAGAGT CACTATTGCT GGCGTAAAGC TGTAACAAAG CACCCGGGAA ATGCGTTTTC GCGATACCAA GGTCGGTCCA AGTCGAGCGA TTTCCACGAG AATCCTGACG ACGTCCACCA CAATGATTCG TACCGAGTCG GAGCATGGAA TAAATTTCCT GACGGAAGCC GCTTTTGGAA GAAGACAACA GACGGAAACT TCCAACGCCC AATCTTCGTA GCGGCAACTC GACAGCACGT AGGGAAGACT ACGACGTCAT TGGCGCTGTG TTCCGGCTTG AAGAAGCGAT ATGAAAAAGT AGGATTTCTA AAACCAGTAC GTATCCGATG CGATATTCTG CTGTGTGGGC TTTTCCGTCT TGAGTGATAT TGAGCTAACA GGTACACGAA CATTTTCTGG TTCGGTAATG CTCACAGGTT GGGCAACAGC ATGTGGAGGT CAAATCGGAG AATCGCAATG CTACGATACG GGTCGACAAA GATATCTGTC TTGTAAAGGA ACACTTTAAA CTTGATCATA TCGACTACGA ACATATGAGT CCAGTTATCG TACCCCGGGG GTATACCAAA AAATTTGTTG ATGGCGAAGT GTCTCTACAT ACCCAGATTG ATGACGTGAT TGATGCCATG GAGCATGTGT CTGCTGCAAG CGATGTGGTT TTGTGCGAAG GCACCGGTCA CTGTGCTGTT GGATCCATTG TCGGTCTCAA TAACGCCAAA GTGGCATCTT TGATTGGCGC CGACATGGTG TTGGTCGCCA ATGGCGGACT CGGCTCGGCA TTCGATGAGC TCAATCTCAA CCGCATTCTG TGTCAGCACT ACAACGTTCG TGTAGCGGGT GTGATTATCA ATAAGGTGAT ACCCGATAAA CACGAGCAGA CAAAACATTA CATGCGTAAG GCACTGATGC AGGCTTGGGG TGTACCGTTG TTAGGCTGTA TCCCAGATCG ACCTTTCTTG GGCTGCCCTG CTCTAGCCGA CTTGGAAAAC TTGTTCCAAA CGCAGCTCAT TTCTGGGGAA GAACATCGAT TCAGGCATTA CAATACGAGT GATATCAATG TTGTCACTAC TTCCCTGACG CGCTTTTTGG AAAATTTGCG CCACAAACAG TCGAGAACAT TGTACATTTG TCACGTCACA CGTAAGTTTG ATGTGTGCAC GTATTGTAGA TTGTCCAACC GAAGTGATTT TTAATCTTTT GCTACTCGTC CTCTTTTTAG GTGACGATTT GATTGTTGGA TTCATGGGGG AATATCATCG ACGACGGGAA CAATCTGAGA CTCCATTTGA GGCTGCATTG TTGGTTTGTG GTCGAAAGGG AAAGTATCAG CTTTCACAGG AAGTGACCGA CATGTTGAGA GGGTTAAAGG GCGCTCCTGT CATGACTGTA GGACTCTCGA CCCACGATGC CATGACGGCT ATCCACAACT ACACGCCCAA GCTCAATATT CATGACACAA ATCGCGTAAA TGTCGCCGTT GACCATTACG AACCCTATAT TGACTTCGAC GAGCTCATCC GACGGACGCA AGCAAGCAAT TCTAATTTCA ATGAACCCGG GAGTATTTCG ATGGAGGAAT TACGGACTCT ATAGCATCAA AATATTGGTA GTGTTGTCAG GCACTCTTTT GGATGTGAGA ACGATTCACC ACGACATTTG TGGTAAATAT TTACATTAGT TCGCTCATGG TCAGTTCATT TCCATTAAAT AGGTGTCGCA GCCATAGTGT
|
Protein sequence | MMRNVAIMRK SHYCWRKAVT KHPGNAFSRY QGRSKSSDFH ENPDDVHHND SYRVGAWNKF PDGSRFWKKT TDGNFQRPIF VAATRQHVGK TTTSLALCSG LKKRYEKVGF LKPVGQQHVE VKSENRNATI RVDKDICLVK EHFKLDHIDY EHMSPVIVPR GYTKKFVDGE VSLHTQIDDV IDAMEHVSAA SDVVLCEGTG HCAVGSIVGL NNAKVASLIG ADMVLVANGG LGSAFDELNL NRILCQHYNV RVAGVIINKV IPDKHEQTKH YMRKALMQAW GVPLLGCIPD RPFLGCPALA DLENLFQTQL ISGEEHRFRH YNTSDINVVT TSLTRFLENL RHKQSRTLYI CHVTHCPTEV IFNLLLLVLF LGDDLIVGFM GEYHRRREQS ETPFEAALLV CGRKGKYQLS QEVTDMLRGL KGAPVMTVGL STHDAMTAIH NYTPKLNIHD TNRVNVAVDH YEPYIDFDEL IRRTQASNSN FNEPGSISME ELRTL
|
| |