Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46146 |
Symbol | |
ID | 7201240 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | - |
Start bp | 402689 |
End bp | 404586 |
Gene Length | 1898 bp |
Protein Length | 568 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180630 |
Protein GI | 219119754 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000043051 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCGCGATTGG ACTGGGGGAT TGATTGGCGA AAAGTTATCG GTTGCGATCA TTTTTTCCGA ATGGCATACT CACTTATATA GTAGGTACTA GGATTAGTTC CCGTGGCGGA CCATGGAACT TTCCAAAATA CTTTTACAAC GTCGGGGTGC CGATCTTGAC GACAATCCAT ACGCGGTTCA TCAAATCCAA CACTCATGCC GCACCATCTC GTACAAGCAT GCCAAGGAAT ACTTGGTGCA ACATCAAGAG TGGCTCAAAA CTAGTATCTT CGATTCCTTT GAAGAACGAA TCACTCATGC CTCGCCCGAC TCGACGATCG TGATAGCCTA CCTTTCCAAC AACTCACCAG ATCTATTCCT GTCGGTCATC TCAGCCACGA GTAGTTGTGT AAGGGCTCTC CCTGCTCTCA TTAATACAAG ATGGACTGTG GCTGAAGTTG GACAGGCGCT ACAATCCACA AATTCTAACG ACATGACGTT GCTGTTGTAT GGCCCAGAAT TTCGTGAGAC AGCTGAAGAA GCAAGCAGAA TCATTGCTCA TTTCGTTGTA AGCCTTCCCA TTCCCTCCTT GGTCCGACTT GGTCCAATCA TCGTTCCCGA GAGCGACAGA CGACGATCCG CCACCACAAA TGACTACGAC CTTAATGCCT GTATTCTCGA GCTGAGTCGC TTACCGCTCA GCCACGATGA CGCACTAATA GTCTTCACGA GCGGAACAAC CAGTGGTCCC AAAGGAGTCC GACTGAGCCA TCGGGCACTG GTCATGCAGG CGTTAGCTAA GCTTCAGCCC CCTTGCCGAT ACTCCTCGAA AACTGTAATG CTCGCTACAA CTGTTCCTTT CTTCCATGTC GGAGGTCTCA GCAGTATTTT GGCAGTCTGG TTGGCCGGTG GGACACTCAT CTTCCCCGGA GCTCCCGGTA TGAGCTCAAA GTTTGACCCC TCTCGTCTTC TGGATTCTCT TTCTCACGCC CTACCGTCAA ACACGCTCGT TATGGTGCCA GCAATGATCT TTGCCGTTCA AAAAGAGATG CAACCGGGCG AAACCTTTCC TTACGTTGAC CTCATTCTCA TCGGGGGACA ATCTGCTTCC AACACCACAA TAGACTTTCT CACTGAAACA TATCCCAACG CAAGGGTAGT GCAAACGTAC GCCTGTACGG AGGCTGCTTC GTCAATGACT TTTTTCGATG TAACAGCGGA GCGTTCATGT CTCGTCCAAT CTCTTCCTCT GGCTGGAGAC TGTGTTGGGG TGACGCCTCG ACACGTACGG ATATCTATCT TTGACGCAAC TAAACAGCCC TCGCTCGAGG CTGTTGAAGA ATCTTATACA CCAGGTATCA TTGGTACCGC TGGCGCACAT GTCATGAACG GCTATTGGAG ACGTGGTGGA CTCAATCCTG TTCGCCGCTT TGAGGAATGG TATTTGACAA ACGACTTGGG TTTCTGGGAC GAGGAGAACC GACTATATTT TTGTGGACGA GCCAACGATG TAATTCGTAC AGGTGGAGAG ACTGTCTTGG CTTCCGAAGT GGAGCGGATC CTGGCGATGC ATCCAAGCGT GGTTGAGTGT GCCGTCTTTG CTCTGCCTGA CGAACGATTT GGCGAAGCGG TATGCTGCGC TCTGGTATGC TCGGGGTCGT GTCCATGCGT TTCAGAAGTA CGGAAGTTTT GCGCGAAGGA GGGAACTTTA GCTGGCTACA AGAGACCGCG TAGGTTGTTT GAAGTGGAAG AATTACCCCG GAATTCATCA GGGAAGATTC TGAAATTCCT ACTTCAGGAG CGTTTCAAAG ACGCAGGGAG ATTGCGAAGC AAGCTCTAAC ATTACAATCC AATCGAGAGC AGCAGACTTT GCTTTCAACA AAATAAGTAT TCAGGTCTAG AGAGAGAAAT AGATTTTA
|
Protein sequence | MELSKILLQR RGADLDDNPY AVHQIQHSCR TISYKHAKEY LVQHQEWLKT SIFDSFEERI THASPDSTIV IAYLSNNSPD LFLSVISATS SCVRALPALI NTRWTVAEVG QALQSTNSND MTLLLYGPEF RETAEEASRI IAHFVVSLPI PSLVRLGPII VPESDRRRSA TTNDYDLNAC ILELSRLPLS HDDALIVFTS GTTSGPKGVR LSHRALVMQA LAKLQPPCRY SSKTVMLATT VPFFHVGGLS SILAVWLAGG TLIFPGAPGM SSKFDPSRLL DSLSHALPSN TLVMVPAMIF AVQKEMQPGE TFPYVDLILI GGQSASNTTI DFLTETYPNA RVVQTYACTE AASSMTFFDV TAERSCLVQS LPLAGDCVGV TPRHVRISIF DATKQPSLEA VEESYTPGII GTAGAHVMNG YWRRGGLNPV RRFEEWYLTN DLGFWDEENR LYFCGRANDV IRTGGETVLA SEVERILAMH PSVVECAVFA LPDERFGEAV CCALVCSGSC PCVSEVRKFC AKEGTLAGYK RPRRLFEVEE LPRNSSGKIL KFLLQERFKD AGRLRSKL
|
| |