Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50429 |
Symbol | |
ID | 7199298 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | - |
Start bp | 6159 |
End bp | 9059 |
Gene Length | 2901 bp |
Protein Length | 958 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185418 |
Protein GI | 219130534 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0883829 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTACTA AACATATAAG CCGTGTCCAT ACGGTGCCGT CAAGACCGCA AAACTCCAAT TGGTACCGGA TTCCTTTGGT CGCTGCTACC GTTTGCACTG TGTTCTCCGG TTTCTATTCA CTCCGTGTAC TTGACTGTCC TTTTGCTGGT TCCTCGGAAG CACAGCTGTC CACGAAATGG CCAGCTCCAT CTATTATTGC GTCAAAATCG TCACCAAGCC AAGCGACAGT AACATCTGAT GATAACGGCT GCCGACCGCT AGCCAATGGT GGTCCAGTAA GTCTGGTCTG GGCGTATGAC GATCCACCCA CGTTGGAGGC CATGAAAAGA ACCATGGAAG GACTTGCGTC TGCCTTGCCG GGAATTCAAG TAGCTGTATA CTGTGGCTCA AGTTTATGTG TAACTGTCGC ACACAAGGCC GTCCCTAAAC GCTCCTGCAT CTGGATCCAG CATATGGTCG CACCGAGGTT GGCGCAAGAT TCTCCACTGG AGGACTGGGT TGGAGATCAT GTTTTGGCGA AGCTCTTATC AGCGAATCAT TTCGAACAAA CTCTTCAAGT TGTTATGCAA CTTGTTGTTA TCTGGAAGTA TGGTGGTATG GTACTCTTAC CGGGGTCCGA CGTGTTGAAT GCCAGTGATC TTGGATCATT GGGTCAAGAC GATGCCGTGC TCTTGACGGC TGATGATCAA ATGCTAGTGA AGCCGGGCTC TGGCGGTGGT CTCTATGCTG TAGCGGCCCC CCCTACAAAT AAAGTCATTG AAGCTCTCAT GGATGAGATG CTACATGTTT ATCAGTGGCC AAAATACGTC GCGTCCAATT GGCCAGTAAA CATACAGTGG GATGTCCTAT GCGCCAGAAT GACGATCTGT CTTGATGCTC AGCTATCTCT TTTAAATATG GGAATAACAA CAGAAACAGA GTCCGTTATG GAGCGTCGCC CTGGAAGACA TTTTGGGACA CTCAGCTATC AGGCGAGGCG GCACTCCTTG AAGGTTGTAG GTGATCACAA CATGAATAAA GGTGATGAAA TGCAGGGGCT AGCAGGCCTG CAGTTTCGTG GGATCCCCGG GTACCTTGTA GAAAGAGACA GACTTGACGT AGTGAGTGTA ATTGCCTCCA AAAACTTTTC TGCCACAGGT GAAGTGACTG TTCACTCCCC ATCTCAATCA GCGTTGACGC CAACAACAGT GTTCCTCAAT GCCTGGTGGG GGGATGGGAA TTGGATCTGG CCTCCGCCTG GAAAACTGGA GCCTGTCTTT GTTTCAATGC ATCTTAACAA CAACAAAATC AAACAGGACG TCAACAAATC AAAGGCATAC TTAAACCAAC ACTGGCCTAT TGGGGCTCGT GACACGGAAT CACAAAGGTT CTTGAAGTCA ATTGGTGTCC ACTCTGTCTT TTCAGGGTGC ATGACAATGA CTCTCCTACC AACCTGGAGT AAATGGCGTG CTTCGAAGGA GCAGAACAAT GAGGTTTTAA TTGTCGACGT CAACAAAAAT GGTCTCCAGC TGTTACCGGA CCACATCACG TTGGGCGCTG TGAAGCTTTC GGCAAACCTT GTGAACAAAA GTGTTATAGA TGACCAGGTG GCAAGATACT TTTTAGCTAA CGAAATGAAG CTTCGCTTGC AAAAGGCCAA GCTGGTTATC ACTCAGCGTC TCCACATTGC GTTACCGGCT GCCTCCATGG GGACGCCCGT GATTCTGATC ATGGATCACA ATATGCCCGG AGGTGGAGGT GTCGATGGGG CGCGTTTTAG CGGTCTACAG GATGCCGTAC ACACTGTAAA CACTGTCAAT GGGTCAGAAG CTCTTGCTTC CTTCAATTGG GATCAGCCGC CACCTAACCC CAACCCAGCA TTTCTTGAGG TCAAGCGAAA TGCGCTTCAA GTGTTAACAA TGTGTCATGG GGAACTAACT GATTCTGCAA GAAAGTTTGG TGTTATTCCA ACATCATGGG AGTACCCTCC TGAAGCGGAT GTGTGTAGAG CAACCAAGGA AGACCATTAT GACAATGATG CCATTCACAT TGCAACCGCA ATCAATCCGA TGTGGCTACA CTCCAAGCCA ATTCTCTCCA GCTGGATTCA TGCACTGTAC AAGTCCAACC CAAGAGAACA ATTTGTGTTC TATTTCCTTG CGGATAATAT GAATGCCAAG CAGCGGTGCA TTGTCCGGTG GATGGTGCTT CGATGGTTCC CCAACGCAAA GGTGTACACA ATACCGCTGG ACCAAGCACA TGTGGGCATT CCAAAGATGT TTCGAGAACA TGCTGCCAAG CTTCTTTTGC CACGGGTTCT TCCGTGTGTG GGAAAAGCTC TTTGGATTGA TAGTGATGCC ATTATCCTAA AGTCACTTAG GCCAATGTGG TCTACCTCAA AGGTCATCCC AAATTGTGGT ATTGTCGCCA GGAACTCTGC GAAAAAAACG TCAATTGGTG CTTTGATGAC AGCTCTGAAT GCTACAACCC CCTCACAATT ACTGAAGAAA GACAGTTTAG ACATACCGAT TTTTGACACT GCGGTGATGG TTCTAGACCT TGGAAAATTG CGCAGAAGTC GCTTTATAGA AACGGTAGCT TCGTACTGGT CATTTACGCT TGGTGGAGAT GTCCAAATTT CCATGAACAT GCAATGTAAT GGAACTCATG GAAAACTTGA CTCGGCTTGG AACATGTTCT TGGACGATTC AGAAGATTCC GTGTTCACAA TCAATGATAT TAGCGAGTGG CGTATTGTTC ATTTTCAAGG TCAACAAAAG CCATGGGTTG ATAAGACTAA CTCACTTCAA CGCAAGATTT GGTCAAAGCA TGCCCTGTCT TTGTTTGATG CACTGTATGG GCCAGTCCCG ACAAGGAAAC TCAATGTCAA TACCTAAATT ATGAAAAGTA TCTTAAAGAT T
|
Protein sequence | MVTKHISRVH TVPSRPQNSN WYRIPLVAAT VCTVFSGFYS LRVLDCPFAG SSEAQLSTKW PAPSIIASKS SPSQATVTSD DNGCRPLANG GPVSLVWAYD DPPTLEAMKR TMEGLASALP GIQVAVYCGS SLCVTVAHKA VPKRSCIWIQ HMVAPRLAQD SPLEDWVGDH VLAKLLSANH FEQTLQVVMQ LVVIWKYGGM VLLPGSDVLN ASDLGSLGQD DAVLLTADDQ MLVKPGSGGG LYAVAAPPTN KVIEALMDEM LHVYQWPKYV ASNWPVNIQW DVLCARMTIC LDAQLSLLNM GITTETESVM ERRPGRHFGT LSYQARRHSL KVVGDHNMNK GDEMQGLAGL QFRGIPGYLV ERDRLDVVSV IASKNFSATG EVTVHSPSQS ALTPTTVFLN AWWGDGNWIW PPPGKLEPVF VSMHLNNNKI KQDVNKSKAY LNQHWPIGAR DTESQRFLKS IGVHSVFSGC MTMTLLPTWS KWRASKEQNN EVLIVDVNKN GLQLLPDHIT LGAVKLSANL VNKSVIDDQV ARYFLANEMK LRLQKAKLVI TQRLHIALPA ASMGTPVILI MDHNMPGGGG VDGARFSGLQ DAVHTVNTVN GSEALASFNW DQPPPNPNPA FLEVKRNALQ VLTMCHGELT DSARKFGVIP TSWEYPPEAD VCRATKEDHY DNDAIHIATA INPMWLHSKP ILSSWIHALY KSNPREQFVF YFLADNMNAK QRCIVRWMVL RWFPNAKVYT IPLDQAHVGI PKMFREHAAK LLLPRVLPCV GKALWIDSDA IILKSLRPMW STSKVIPNCG IVARNSAKKT SIGALMTALN ATTPSQLLKK DSLDIPIFDT AVMVLDLGKL RRSRFIETVA SYWSFTLGGD VQISMNMQCN GTHGKLDSAW NMFLDDSEDS VFTINDISEW RIVHFQGQQK PWVDKTNSLQ RKIWSKHALS LFDALYGPVP TRKLNVNT
|
| |