Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50470 |
Symbol | |
ID | 7199316 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | + |
Start bp | 151581 |
End bp | 153313 |
Gene Length | 1733 bp |
Protein Length | 474 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185388 |
Protein GI | 219130471 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 0.904967 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAAGAAAGGA ATGGCACCCC ACGGCCCTTT CTCCTGTACT ACTCTCTAGC TAGAAACGTC CCCAACACTG CTTCGTATAT ACCAAAGTTC GCTACAGTTA CAGGCGTTTG CCACAGCATG GTAAAGGCAA AAAAGAGTAA AAAATCCAAG CCCCGAAAAG AGGCGGCTTC CAACAACGGT ACAGTGGTCT ACACTCCCAG TAGCAGTCAC GTCTCCGCTG TATCCGGTTC CTACAAGGGC ACCAAGGAAG ACCTCGATGC ACTCGATTGG TTGCGGAACG TCGAAAACCG CCGGAGTCAG GGCGAGCCAG TTTTGCAGCT CGAGTATCAG GAAGACGATG TGCTGATGAC GACACTGCAG GACTTTTTGG AAACGTCGGC GGCGACGCAC GAAGTGATTG ACGAAATATA TCAAATTTGG CAGCAGACCG TCCTATCGAC CTTGCTTGCC AGCTCGGCTT TGTGGAAACA GGAAACCGTG CAAGCGAATG ATCAGCGCGT ACTGAAAGAT TGTTTGAAAA CGGCGAGGTA TAAAATTAAC TTTGCACTGC ATCACGTCCT GGGAGCGATT CTGACGTCGC CGGAACCCAA ATGGAAACGC CTACAGCCCT ACGTGATTGA TGTGTGTTGG CAAACGCTGG CGTTGTTACC ACGACTGGCC GAGTCGGTAC CACATCATCC CACGGATCTG AGATCTAACA TTCTGCACTT TGAGGGCAAA GAATTCCTAG CATCGGTCGC CTGTGAAACC TGGGACACAT TGTACGACTT CGACATTTCC TTAGATCCGA CCCGATTCGT ACCCATTCTA CGTAAACTCG CCATGTTGGA CATTTTAGAC AAGGACTGGA ACATGCTCGG TGGGTGGGAG GACATACTGA AAGGGTTGGA GGCGAAATGT TTCCACGGAT CGGTGCTGCG GTTACCCTGC GATAGTGTTC TGCACGAGCA CACGACGTTG TTGCGGAACG CCCGCAACTC AATGCAGTGC AAATTCTTTT CCGTCCGCTT CATGCGGTAT ACCGAAGACC AGGCACCACA GCGGTGGCGA TACTTTCCCA AATGCGCCGC CCCGGCCTGT GCGCACGTGG AAACGCCCGA ATCGCCCCAT CCGCACCGGT GTGAAAGCTG TTGGTATTTT CATTACTGCA GTCCGGCCTG TCAGGAATAC TGCGACGTGG TCCTGGGTCT GCATCCAAAA TTCTGCCGTG ATACACCGGC TAATAAGGCG GCATCGTGTC AGCGTGAGAC CGAGGCGTAT TTGGGATGGA GCGATCCCCA ATCCGGACAA CCACTGGTGT GTCACGCCTG CGGAGTGGTA CAAGAAGAGG TCAGTGGTGC CGACAGCCTC GTGGATGCAC AGTACGCCAT CGTGAGCAAT GGAATACCGA CATCCAGTAT GAAGAGGTGT TCCAAGTGTC AAAAGGTGTA TTATTGCAGT CGACAGTGCC AAGAATGGGA TTGGCGTGTT GGGGGGCACA AGCGAGTTTG TCTCTTTGAG GCTGCCCAAA AGCAACAGCA GCAAATAAAA GAGTTGAACT GAAAAGGGAA GCATGAGATC AGCAATGTCG CAATGGCTCT TACATCAAAA ACGTTCAAAG CTTTTTACTT TTCCAAAGTC TCGTATTTGG GAGCAAGTTG GATGCTCCTT TTGGCTATAG CCGTAGGGAG AGGCCCAGTG AGATAGCTAG GTTGGTAGTT TCTTGTTTCT AAGATAGCCG AACGACACTC ATA
|
Protein sequence | MVKAKKSKKS KPRKEAASNN GTVVYTPSSS HVSAVSGSYK GTKEDLDALD WLRNVENRRS QGEPVLQLEY QEDDVLMTTL QDFLETSAAT HEVIDEIYQI WQQTVLSTLL ASSALWKQET VQANDQRVLK DCLKTARYKI NFALHHVLGA ILTSPEPKWK RLQPYVIDVC WQTLALLPRL AESVPHHPTD LRSNILHFEG KEFLASVACE TWDTLYDFDI SLDPTRFVPI LRKLAMLDIL DKDWNMLGGW EDILKGLEAK CFHGSVLRLP CDSVLHEHTT LLRNARNSMQ CKFFSVRFMR YTEDQAPQRW RYFPKCAAPA CAHVETPESP HPHRCESCWY FHYCSPACQE YCDVVLGLHP KFCRDTPANK AASCQRETEA YLGWSDPQSG QPLVCHACGV VQEEVSGADS LVDAQYAIVS NGIPTSSMKR CSKCQKVYYC SRQCQEWDWR VGGHKRVCLF EAAQKQQQQI KELN
|
| |