Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_20773 |
Symbol | |
ID | 7201651 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | - |
Start bp | 218153 |
End bp | 220164 |
Gene Length | 2012 bp |
Protein Length | 484 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180965 |
Protein GI | 219120454 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AACACCCCAA GTACGGCAAG GCGAACCGAA GACAGGTAGG TAGGTAGGTA GGTATATATA TATACACACC CTCCCACCAC TAACAGCCCG TAACGAACGT TTCTACCTCT CTGCTTCCAA AGAAACCGGA TACATCTAGG ATTGTCGGAG ATCCGTTGAC GCCAAGAGTT GCACAAAGCA AAACTTGTTT CCAGAATCAT GCCGTCGGCT ACAAAATGTG AGTACTTGCG AGCGACACTC CATCCAGTCA GGGCGACTCG CGTGTCTGTT TCGTGACACT GTGTCTTTTG CTTGCGGTTG TGTCGAACGC TTTGGCGGAG CACTTTATTC CCACATACTC ACACACACAC ACATACACAT TGCTTGGTCG ATCTCGTGAC TGCGCAGCCG CCAAAAAGCG CAAGGCCGAC GACGGCTCGG CCGTGGCGGA AACGTCGGCG GGTCCGATGA TTACGCCTTC GTCGCACACG CCCAAGCTCG ATACGAGTAA GTGGCCGTTA CTGCTGAAGA ACTATCACGA ACTCCACGTG CGCACCGCGC ACTACACCCC GCTACCGACC GGCAGTAGTC CGCTCGCCCG CACCCTCGAA CAGCACTTGC AGTACGGAGT CATCAATCTC GACAAACCCG CCAATCCGTC CTCGCACGAA GTTGTGTCCT GGATTAAACG CATTCTGCGA GTCGAGAAAA CGGGACATTC CGGCACGCTG GACCCCAAAG TTACCGGTTG TCTCATTGTG TGTATAGATC GCGCCACCCG ACTCGTCAAG GCACAGCAGA GTGCCGGTAA AGAGTATATC GGCGTTGTGC GTCTGCACGC TGCCTTGGAC GATCCCAAGC AGCTCACGCG AGCGATTGAA ACCACACTCA CCGGCGCGCT ATTTCAACGA CCGCCCCTTA TTTCGGCCGT CAAACGACAA TTGCGCATTC GAACCATTTA CGATTCCAAG CTCCTCGAGT TCGACGGCGA ACGAAACCTA GGTGTCTTTT GGGTCAAGTG TGAGGCCGGA ACGTACATTC GTACCCTCTG CGTGCACGCG GGACTCTTGG TCGGTACCGG TGGTCACATG CAGGAACTGC GTCGCGTTAA ATCCGGTGTG CTCGGGGAAG AAGACAATCT CGTCACGCTG CACGACGTTA TGGACGCACA GCACGTGTAC GACACCACTA AGGACGAAAC CTATTTGCGC CGGGTGGTCA TGCCGCTCGA GACTCTCTTG ACCAACTACA AGCGTATCGT TGTGAAAGAT AGTGCCGTCA ACGCAATCTG CTACGGTGCC AAGTTGATGA TACCCGGTTT GCTGCGCTTT GCCGACGATA TCGAACTCAA TCAGGAGGTG GTACTCATGA CGACCAAGGG CGAGGCTATT GCCGTGGCTT TGGCGCAAAT GACGACGGCC GTCATGGCCA CAGTGGACCA CGGAGTCGTG GCCAAGATTA AGCGGGTCAT TATGGAACGC GATGTGTACC CGCGTCGATG GGGTCTCGGG CCCATGGCGC AACGGAAGAA GAGTATGATC AAGGAAGGCA AACTGGACAA GCACGGCAAA CCCAACGAAA AGACGCCGTC GAACTTTTTG GATTCCTACA AGGATTATTC GAAATCCAAA CCGCCCGTAT TGGATGGTAA CGGTGAACCG ACCACGCCGG GATCCGCATC CGTCGCCTCC AGCAACGGAG ATAAAATGGA AGTCGACGAA TCGGAGAAGA AACGACCATC CTCGCCAAAA TCCGAGGATG ATGACGATGC TCCCCAGAAA AAGAAGGATA AGAAAGACAA GAAGAAGAAA AAGGACAAGA AAAAGAAGAA GGAAAAGGAG TAGAGGACGC CTTGTTTCCT GCTACTAGTA TATCTTCCAG AGAAATCTAA ACTAATCTTT TGAGAAATTT GTATGCATTC GCGTACATCG AAAAGTTGAC AAGGGGTCAT GCAATGGGTC CGTTGCTGGC AGCAGTTGCG AACGAGTAGG CTAGAACAGA ATTTTTAAAT GGTATGCTCT TAAGATTAGG TA
|
Protein sequence | MPSATKSAKK RKADDGSAVA ETSAGPMITP SSHTPKLDTS KWPLLLKNYH ELHVRTAHYT PLPTGSSPLA RTLEQHLQYG VINLDKPANP SSHEVVSWIK RILRVEKTGH SGTLDPKVTG CLIVCIDRAT RLVKAQQSAG KEYIGVVRLH AALDDPKQLT RAIETTLTGA LFQRPPLISA VKRQLRIRTI YDSKLLEFDG ERNLGVFWVK CEAGTYIRTL CVHAGLLVGT GGHMQELRRV KSGVLGEEDN LVTLHDVMDA QHVYDTTKDE TYLRRVVMPL ETLLTNYKRI VVKDSAVNAI CYGAKLMIPG LLRFADDIEL NQEVVLMTTK GEAIAVALAQ MTTAVMATVD HGVVAKIKRV IMERDVYPRR WGLGPMAQRK KSMIKEGKLD KHGKPNEKTP SNFLDSYKDY SKSKPPVLDG NGEPTTPGSA SVASSNGDKM EVDESEKKRP SSPKSEDDDD APQKKKDKKD KKKKKDKKKK KEKE
|
| |