Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37521 |
Symbol | |
ID | 7202501 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 335570 |
End bp | 337395 |
Gene Length | 1826 bp |
Protein Length | 538 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181706 |
Protein GI | 219122757 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGGAT GGTCGTCCTG TGCTCCTTCT GCGCAAAAGT TTTCAAATCT CAAGAAAGCA ACGTATGGAA GCTTCAGCCT GCACTGTCCT TCATCTCGAT GTGAATGTCT ATGACGCATA GTCAATATGT GCACAGGAGA CATCCAGTTG GGACTGGGAT TCGAACAGAA GAATGGAGGA AGCCCGTCGT GGTGCCGGCA GAGCGATGTG GCACGGAGAA AGTCAAAGGT GAATTGCCTG TAGCTCGCAA GCTGCAAAAA CAACAAAGGC AAATTCCGAC AATCCTGTTG GTCGCCGTTC TCTTGCTTTT TTCTTTTTGC ATGCTTAAAA GTTTTCGAAA AACGGTGCAT CATCATGCGA TACGCAGAGA CCATTTGCAC CAGTACTCTG ATGTTAGCGA GAAAACAGTC ACTTCCATTC GCCAAGCTAG TGGTATTCTT CTCCACAACA ACCAGATAGG GAGGACACCA ATTATTCAGC CACAAATTTT TCTACCAACT GTAAACGAGG ACGGAACTAC TGAAAAAAAA CGTGAAGCTG CGTCACACGA GATTCCGTCA AGCAATGAAG GAATACGCGT AACGGATCAG ATAGCCCCGA ATATCCGCAA GACAGTCCCG AATACAGACC GAATCGCTTT TAGGTACTGG CATGAAGACG AATTGATAAG CAACCAGAAA TCTTGTCGGC AGCCGCACTG GGCATTCTTT CACTTTCCTA CCTGCAACGC CTTTCACGAG ATGCCACTCG AACGTGAATA TTTTGAAGCT TCACAAGGCC GACAGGGTTC CGGAGTAACT GAACTCGACA GCTATTATAT CAACAGCGGC TATTATCGTG ATGTTTGGGT GGTCGCAGGC TCTGCGTCAC TTGGCAGGCT TATTCTCAAA ACATCCAAAT TTGAATTTGA CATAAACTAC AAAACCCTGC ATCAGGTCCA TCGTGAAGCA AACGTGATGG AGCGTTTGTC CAGCAACCCG TCCATAGTTG ATATTTACGG TCATTGCGGA GGCTCGGTGG CAGCTGAAGC CATATCGTAT GAAGTTGAGC GATACGTTGT CGCCGGATCA GGCTATGTGA ATCCTGGCCT AGGGGCTGAT CAACCCGCAG ATCTTTCACC ACAAAATGAT TTCACGCCGT CAGAAAAATT CCGCATGGCT CTCGCCATGG CCGAATCGAT TGCAGCTCTT CATGGCTATC ACGGTGGTGT TATTGTTCAC GACGATATCC AGTTGCGACA ATGGCTGCAA ACCAAAGACG GAATATTGAA GTTGGGCGAC TTCAATAGAG CGTATGTCCT AGATTGGAAC GACTCCACAC AGGCATACTG TTCATACAAC AATGGACAGG CATTTGGAAA TGTAAGTATG ATCTTTGCTG ACCAGTTTGA GTTGTACAAT TACTTCATCT GAGAGTAATC ATTTTTTGGG CCAGAATCGT TCACCGGAGG AATATCAAGC CGGAGAATTA GACGAAGCGA TCGACGTCTA TTCCTTTGGG AATTGCTTGT ACAGTCTGGT AGGTTGAACA TAAAGAGCTG TGACAGCCTC CGCATTGTTT TCAAGTTAAC TGACACCATT TTCTTGTTCT TGCCATACTA GTTGACTGGG CTTTGGGTCT TCTACGAAAA TGAAGATGAT GCTATTGTGC AAGAAAAAGT TTTGACGGGG AAGCGACCCA TGATTGATAT CCGCTACCGA AACCGCAGTC TTGAGGAGAA AATTTTAGTC GAAGTAATAG ACGGGTGCTG GCAACCAGAT CCGAAAAAGC GGCTTGACAT CTTTCAGGTT GTTCGAAGAC TTCGAGAAAA CTCACAGGCA CTTTGA
|
Protein sequence | MTGWSSCAPS AQKFSNLKKA TRHPVGTGIR TEEWRKPVVV PAERCGTEKV KGELPVARKL QKQQRQIPTI LLVAVLLLFS FCMLKSFRKT VHHHAIRRDH LHQYSDVSEK TVTSIRQASG ILLHNNQIGR TPIIQPQIFL PTVNEDGTTE KKREAASHEI PSSNEGIRVT DQIAPNIRKT VPNTDRIAFR YWHEDELISN QKSCRQPHWA FFHFPTCNAF HEMPLEREYF EASQGRQGSG VTELDSYYIN SGYYRDVWVV AGSASLGRLI LKTSKFEFDI NYKTLHQVHR EANVMERLSS NPSIVDIYGH CGGSVAAEAI SYEVERYVVA GSGYVNPGLG ADQPADLSPQ NDFTPSEKFR MALAMAESIA ALHGYHGGVI VHDDIQLRQW LQTKDGILKL GDFNRAYVLD WNDSTQAYCS YNNGQAFGNS NHFLGQNRSP EEYQAGELDE AIDVYSFGNC LYSLLTGLWV FYENEDDAIV QEKVLTGKRP MIDIRYRNRS LEEKILVEVI DGCWQPDPKK RLDIFQVVRR LRENSQAL
|
| |