Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49838 |
Symbol | |
ID | 7198664 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | - |
Start bp | 41944 |
End bp | 43883 |
Gene Length | 1940 bp |
Protein Length | 557 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184720 |
Protein GI | 219129068 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTCGAG GAAATTCTCG CAAGAAGATG CCTGTCGCTG CAACGACGAC GACGACACAA AGAAAGTGAC AAGAGAAGAA GAGCCTGAAG TAAAGTCAGA AGAGAACCGA CAGACCCCTT GGACAAACAC CTCTTCCAGC GAGAACACCA AAAGTAAATC CGTAAAGACG CCTCCGCCTT TATTGAACGA AAACTCTCCC CAAGAAAGCA AGTTAAATGC CAACGCCCCC CAAGAAGCTG AGCCCGAGCC AAAAATCGCT CGTAAAGGTC AAGAGCATCT CGGCCGTTGG ACTGAGCCAG AACATGATCG CTTTTTAGAA GGTCTCGCAA AACATGGACG TGAATGGAAG AAGGTGGCTG CTTCCGTGCA GACTCGAACC GTCATGCAAG TGCGGACTCA CGCTCAGAAG TACTTTGCTT TGCTGAATGC AGGCCAGACT ATGAACAAGT TTGCTACCAC AACACCACCT ACCCGCCAAG AAGCCGCCAT CAAAGCAGAT CAGAAAGAAA AGGCCCTGAA AACTAAGATG GGCAAGGCTA AGACTTTGGA CGGTGCCGCG GTGGCCAGAT CCGAACCTGC GGTAGCCGTT CCCAATTTGG CGACCGCTTC TCAACCTTCG GTGTCGCAAC CCCATCCGAA CTTGCCTGGA TTGGGTAATC CTGGATTGCC CAACGTCCAC AACTTGCTAG CCATTCAACA ACAGAAGCTG ATGTACCAAC ATCGGGTCAT GGCTCATCAG AACGCCATGT TGCAGCAAAA GACCATACAC ACGATTCCCA GTCGCGTGAT CGATGCGCCT GTACAGACGG CTCCTAACAC CACCGTGGCA AAGGCACCGT CTGGTCCGCT TTTGCAGGGG AACGACATTG TCATTTTCCC CGACACATCT GTCAACCCTG CATCTCATCG GCCCTCTAAC CAAGAATACG AAGACTTGCT ACTCATCAAT TGCTTTTACT GGCACTCGTT ACCAGCCGGA TTTCAGTGGC CATACGTTCA GCGGTTGTAC ACCCTACTCA AGTTGCGAGG ATTACGTATG GTTACTTTGT GGCAGGACGG AAGCGTTCAC GACTTTGTGC ACGGTCCTTG GCAAATTGCC AAAGAAATGC GTGACGAGAC GTTCAAGCAG AAAGCTTTGG CAAAGTATGG CTCGATGCAC GTAACGGGAA AAACGGTGGG ACTCTGCGTC GCCTACAAGG ACAACGAGTG GAATGTGCTT GACGTTGAAA AGTATGCAAA ATGGGATTCC AATCTCGTAT CCGGTGGTGG TTTGGTGCGT CTTCCTCAGG GTGTGACACT CGCTCCGGTC CACTACCAAA CCGATGCCAC GTCGGAAGTC ACGGACAAGT CGACTCCACA AATGACCCCA CCACGAAAGT CAAAGACTCC TCGACAAAAC GAGTCACTTC AACAGGTGCA CAAGGAATTT TCTAAGCGAT CCTCTTCTTC AGCCAGTCTC GATACGGCTC CGGTGAACGC AAAGAAAGAC ACTAAACAAA AGACTGAAAA AAATACTTCC GCTGCGAAAA AGTCTGCCGC CGCCAAAGTT AAGGTGGAGA AGGAAAGTGA CAAGTCTCTC CTGAAGAAAG CGAAGAAAAG CAGAACTAGT GTTTCGTTCC CGTTGAGATC CCAAGCTAAA GAGGCTAAAC CACGAAAGTC GATGTCGGCT TTGAAGAAAC GGTCAGAATC TCCGACGGCA ACAGATCCAC CGCGCTCGGC CCGCCGTTCC AAGCGACAGC GTTTGTAAGT GGGGAAAGGG CGGCTTTTCG CCTTGACCGT TCTAATGTGA TATGTTTGTG ATGGGACCCT TTTGGGAAAT TGGCGGAGTG ATCTGAGCAA ACCAAAAGTT GCGGGGAAGG GAATATACTA CGTGGTCAGT TCTGTATACG CAACGGGAGA ACGTAAAGAA AAGAATATAA ATATGAAATG TGGGGTGTTG
|
Protein sequence | MPRGNSRKKM PVAATTTTTQ RNENTKSKSV KTPPPLLNEN SPQESKLNAN APQEAEPEPK IARKGQEHLG RWTEPEHDRF LEGLAKHGRE WKKVAASVQT RTVMQVRTHA QKYFALLNAG QTMNKFATTT PPTRQEAAIK ADQKEKALKT KMGKAKTLDG AAVARSEPAV AVPNLATASQ PSVSQPHPNL PGLGNPGLPN VHNLLAIQQQ KLMYQHRVMA HQNAMLQQKT IHTIPSRVID APVQTAPNTT VAKAPSGPLL QGNDIVIFPD TSVNPASHRP SNQEYEDLLL INCFYWHSLP AGFQWPYVQR LYTLLKLRGL RMVTLWQDGS VHDFVHGPWQ IAKEMRDETF KQKALAKYGS MHVTGKTVGL CVAYKDNEWN VLDVEKYAKW DSNLVSGGGL VRLPQGVTLA PVHYQTDATS EVTDKSTPQM TPPRKSKTPR QNESLQQVHK EFSKRSSSSA SLDTAPVNAK KDTKQKTEKN TSAAKKSAAA KVKVEKESDK SLLKKAKKSR TSVSFPLRSQ AKEAKPRKSM SALKKRSESP TATDPPRSAR RSKRQRL
|
| |