Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44101 |
Symbol | |
ID | 7204036 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 987553 |
End bp | 989020 |
Gene Length | 1468 bp |
Protein Length | 415 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186165 |
Protein GI | 219113163 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAACATACAA GCTTACACAG GCAGCAAACA TTCCCTTCAC TTGACCATAC GGCTATTGAT CGTAAATCCG GGCCATTTTA CTGTAAGCGA ACCCCGCTTA CCGAAGATGG AGTCTATCCC AAGTGAATCG TTCTTGGATG AGGGGCAGAT GTCAACAGCA AATAATAAAA GCAGTCTTGC CAAGGGAGAG GCAGCAAAAA ATATGCAGCC GCTCCCAGAT GACTTCCAAC CTGGAGAGAA CGATGTTATT TGTGGGCGCG GCCGCAATGT TTTCAACCAC ATTGGTAATG ATAGCTTTCG TACGATTGTT GCGGGGTACT TGGATCATTA TAATCAAGCT TCAGCCAAGT TAGAGAAGAG TTTTATCCTC TCAGAGATCG TTACGAAGGT CCGTGAACTC AGCCCGAACG GTGGATTTGT GAAAAAGGAT CCCAAATCAG GCCGCTGGTT TGAAGTAGGT GACTTTCTGG CCCGGGAGAA AACTTCCCAA GCATTCCGCG ACGCTCTTCA CGACAAGTAC AGATCAAGCA ACACAGCAAA AAAGCTGAGG CGCCAAGTCG AACAAATAGA CAGGCTGCAT TCCTCCCAAA GCGATGAGGA ACGAGATTTT ATTTCGCAGA ACTCGGCTTC CCTCCAATTG GGTGATCTTG AATCTGTACA ATCATCCTTA CTTGGAACGG ACCTACAGAG AGCGAACGCT TTTTTGGGGG AGAACTTACA GTTAATGCAA GCACGCCGGT CAGCTCGTTC AGTACTGGAC TTTCATGCCA ACAACAAGGC TACAGGCCCC CTGAAACAAA ACAATTTCAA TTGGTCATGC CCTAATCTCG GGAGCAAAAG AAACACAACT CCGCTCAATA CCGAAGCTTC GATGCAAAAT TTCGACTGGG GTACGGCCGG CCAAACGAAT ACCAAGCCCT CAGCGAGCCT TTTTGCAGAT TTGGGTGATT TCCCTCCTCT GCCCATCGCT AACGGCATGC TGGCCAACCA GACAGGGGGT TTGCCGGGAA GAAATCAAAA CAGTGCATCA TTCGCCAACC TTCCTCTCGG GAACACATCG TTGCTTGCAA ATTTTTTGTC GCAATCAATT GCTCCCATCC ATGAGAATGC GCCTGTGGAA GGTCCATCGA TTCAAGCTCT AATGGGAAGA GCTTCCAAGT TTGATCCGTT TGACAGCATG CTTTCTCCCG AAGATTCGCA AAAAGAATCA CTTAAGTCTT TGGATGATAT CATGATCACG CCTATAGGAG CAGGTAAGGA TTCGGACTTG TTTGCTAAAC TAGCACTACT CACTGATGAA TATAAGGGGG ATGGCAATGT TTTTGAGCCG ACACCCATCG GAGAGACCGC ATAAAGCCAA TTCATCTCCC CTGATGTAGA TACATGGATG CCTTGCCAGT TTCTTTACTT TTGATTGGCC TTATACACTA CTTCATATAA TATTTTATGG ACTGAATGTC AGCGTCAT
|
Protein sequence | MESIPSESFL DEGQMSTANN KSSLAKGEAA KNMQPLPDDF QPGENDVICG RGRNVFNHIG NDSFRTIVAG YLDHYNQASA KLEKSFILSE IVTKVRELSP NGGFVKKDPK SGRWFEVGDF LAREKTSQAF RDALHDKYRS SNTAKKLRRQ VEQIDRLHSS QSDEERDFIS QNSASLQLGD LESVQSSLLG TDLQRANAFL GENLQLMQAR RSARSVLDFH ANNKATGPLK QNNFNWSCPN LGSKRNTTPL NTEASMQNFD WGTAGQTNTK PSASLFADLG DFPPLPIANG MLANQTGGLP GRNQNSASFA NLPLGNTSLL ANFLSQSIAP IHENAPVEGP SIQALMGRAS KFDPFDSMLS PEDSQKESLK SLDDIMITPI GAGKDSDLFA KLALLTDEYK GDGNVFEPTP IGETA
|
| |