Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50428 |
Symbol | |
ID | 7199227 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011697 |
Strand | + |
Start bp | 331067 |
End bp | 333517 |
Gene Length | 2451 bp |
Protein Length | 816 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185311 |
Protein GI | 219130311 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCATATT GCATATTCTC CTACGAATTG ACCGGAAACC GCGCGTTGTA TCTTGGCGAC GGTGATCGAC ACGAAACCGC CTTCAACCAA TACGAAGTCG TCGTTCCCTT CAACGCCTAT CGAGACCCCG AGCTCGCTGC CAACACGGAC GGGCATTGCC GTTACTCTCT ACACATCTAC CCCAGTCAGC AGTTTGCTCA GGGATACAAG TCCAGCCTTC CCATCGTCTT CACTTCCCTC GTGGCAGCCA CCTTCTTCCT CATGGCTCTG ACCTTCTTGG TCTACGACCG CTTCGTCCAC CGCCGTAACA TCAAAGTGGT TAATGCCGCT GCTCGATCTA ACGCTATCGT GTCATCGCTG TTCCCTTCTA ATGTCCGTGA TCGCTTGTTT GAAGATGCCA AGGCAAGAAG CGACGTCAAC CAGGCCGCCC ACTCTCGTCT CAAGACGTTC CTGCACAATG GTGACTCCTC TGATACCGCC ATCACTGACG AGAATGCGCA TCACAGTGAC TTCTTCAAGA CCAAGCCCAT TGCTGAGCTG TTTCCCCATA CTACCATCAT GTTTGCAGAT ATCAGCGGCT TCACGGCATG GAGCTCGTCT CGAGAGCCGG CACAAGTCTT CCAGCTGCTG GAGACCCTGT ACCACTCGTT CGACGAGACG GCCAAGAAGC GTCGTGTCTT CAAAGTCGAG ACGGTCGGCG ACTGCTACGT TGCCGTTGCT GGACTGCCCG ATCCGCGAAA GGATCATGCC GTAGTCATGG CTCGATTTGC CAAGGACTGC ATGCACCAGA TGCACTCGCT GACAAGAAAG CTGGAAGTAT CTCTAGGCCC GGACACGGCC GACTTGTCCC TACGAATTGG GTTACACAGC GGACCCGTGA CTGCCGGTGT CCTGCGCGGG GAGCGGTCTC GTTTTCAGCT CTTTGGAGAC ACCATGAACA CGGCAGCAAG GATGGAAAGC AACGGAATTC GCGGCCGCAT TCAAATCTCG CAGGAGACGT CCGATCTGCT TGCAGATGCT GGCAAGACCC AGTGGTTCGT TGCACGCGAG GATACGATTG TGGCCAAGGG AAAGGGCGAG CTCAACACAT TCTGGCTTTC TGTGGGAGAT GTGGGTAAGG GGAGGTCGAC AACCGACACG ACGCACAGTA GCGACGATGT TCTTGCGCCC AACAACTATA ACAGCTCTGT AGCGTTGGAT AGTCTGATGA CGACCAGTGC CGAGTCAGAT CAGCAGGTCT ACAACCTTGT TTCGAATAAG ACCTCGCGTC TCATCGACTG GAACGTCGAT GTACTATCGC GATTGATCAA ACAAATCGTG GCGCGTCGCA AGGCCTCCAA GGTACCAAAG AAGGACTCGT CCAAGCAATA CTTCTGTCCC GGCGACAACC GAGGAGCCGG GACCACGGTC CTGGACGAGG TGACGGAGAT TCTGGCTCTG CCGGAGTTCG ACGCGGACGC TGCTCGTCGC CAGCAGGATC CTGAGAACAT CGAGCTAGAT GACAGTATCA CGTCACAGCT GCAGCAGTAC GTGTCCAATG TGTCTGCCAT GTACCGGAAC AACCCCTTCC ACAACTTTGA GCACGCCTCG CACGTGACCA TGTCAGTGGT AAAGCTGTTG TCCCGTATTG TGGCTCCCGT GGATGTGGTG GTGTCGGACG GGAAGAACCA GAAGCGGTCC TTCGCCTCCA AGCTACACGA TCACACGTAC GGCATCACGT CGGATCCTCT GACTCAGTTT GCTTGCGTCT TTTCGGCACT CATTCATGAC GTCGACCACA GTGGAGTTCC CAACGCGCAG CTAGTGAAGG AGAACAGTAA GATTGCTACA TTCTACCAGG GGAAGAGTGT TGCCGAGCAG AACTCCGTGG ATCTAGCTTG GGACCTGTTG CTAGACGACA GCTTCAAAGA TTTGCGAGCC GCCATCTTTG CCACGGACGT GGAGAAGGCC CGGTTCCGAC AGTTGGTAGT CAACTCCGTC ATGGCAACGG ATATCATGGA CCCGGATCTC AAGGCTATTC GTAACGCACG CTGGGAGAAG GCCTTCACAG CCTCCCCAAA TATGCAGGAA GACCTGAAAG ACTTGACAAA CCGCAAGGCG ACGATTGTAA TCGAGCACCT GATCCAGGCC TCCGATGTGG CACACACCAT GCAGCACTGG CACATATACC GGAAGTGGAA TGAGCGACTG TTCCTGGAGC TGTACCAGGC CTACATAGCC AGTCGAGCAG AAAAGAGCCC GGAAACATTT TGGTACAAGG GTGAGCTGGG CTTTTTCGAC TTTTACATCA TTCCACTGGC TATGAAGCTG AAGGAATGCG GTGTATTTGG AGTGTCGAGC GACGAGTATC TGAACTACGC GATGCGCAAT CGCAAGGAAT GGGAAGACCG TGGTCAGGAA GTGGTGCGCG AAATGATGGA AAAGATCAAA GGTAGGGTGA AGCGTTATTG A
|
Protein sequence | MPYCIFSYEL TGNRALYLGD GDRHETAFNQ YEVVVPFNAY RDPELAANTD GHCRYSLHIY PSQQFAQGYK SSLPIVFTSL VAATFFLMAL TFLVYDRFVH RRNIKVVNAA ARSNAIVSSL FPSNVRDRLF EDAKARSDVN QAAHSRLKTF LHNGDSSDTA ITDENAHHSD FFKTKPIAEL FPHTTIMFAD ISGFTAWSSS REPAQVFQLL ETLYHSFDET AKKRRVFKVE TVGDCYVAVA GLPDPRKDHA VVMARFAKDC MHQMHSLTRK LEVSLGPDTA DLSLRIGLHS GPVTAGVLRG ERSRFQLFGD TMNTAARMES NGIRGRIQIS QETSDLLADA GKTQWFVARE DTIVAKGKGE LNTFWLSVGD VGKGRSTTDT THSSDDVLAP NNYNSSVALD SLMTTSAESD QQVYNLVSNK TSRLIDWNVD VLSRLIKQIV ARRKASKVPK KDSSKQYFCP GDNRGAGTTV LDEVTEILAL PEFDADAARR QQDPENIELD DSITSQLQQY VSNVSAMYRN NPFHNFEHAS HVTMSVVKLL SRIVAPVDVV VSDGKNQKRS FASKLHDHTY GITSDPLTQF ACVFSALIHD VDHSGVPNAQ LVKENSKIAT FYQGKSVAEQ NSVDLAWDLL LDDSFKDLRA AIFATDVEKA RFRQLVVNSV MATDIMDPDL KAIRNARWEK AFTASPNMQE DLKDLTNRKA TIVIEHLIQA SDVAHTMQHW HIYRKWNERL FLELYQAYIA SRAEKSPETF WYKGELGFFD FYIIPLAMKL KECGVFGVSS DEYLNYAMRN RKEWEDRGQE VVREMMEKIK GRVKRY
|
| |