Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48815 |
Symbol | |
ID | 7195074 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | - |
Start bp | 329899 |
End bp | 332109 |
Gene Length | 2211 bp |
Protein Length | 519 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183467 |
Protein GI | 219126444 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CATGACGCGC GCGGATACCA AGGACCAAAA GTGACTCTGC TTTCCGTGTA CACATATAGA AACTCACACC CAAACCAAAT ACATAGATGG ATAACTAGAG GTTGTCGACG GTGTTCTTTT TTTTTGACCG TCTAGAGGGA ACAAGAAAAA AGTTGCCTGC GGAGTTTCTA GGTCCGAGAA AGGGTGGCAC GAACATCGTT TGCGCGCGCA CTCGAGAGAG GGTGGTGACG CTACGAGACG CGAACGTTTT TTACAATATG GCATACATAC CAGACAAGTT CGAACGCTTT GAAGCTTGGC TTCGAGAGAA TGGAGCCCAC TTCGAAGCGG TAAGTCAATC GGAAGCCTTT AGCTTGAGTA ACGGACGAGC TGTGTGCTGG TCGGCAGTGA GTGAGCGAAA TCCTTAACTG TCTCTGTTCT CTCTTTCTTG TATAATAGCT CGAACTCCGT GAGTATGATT GCTTGAATAG CAGCAACACG AGTGCTTCCG AAGTCGATGA CGAAGAAAAG AAGGAATCGC CTCTTACGGT GTCCCTGGAA GAAGTTTCCG AAATGCGGGG CGTTCATGCG CGACGAAGCA TTCCTCCACA CACCACGTGT GTTAGTATTC CTCGTCGCTG CTTGATTACG GTTGAAATGG GGCAGGCCAC ACCGATTGGC CGAGCTATTC TTCAGGCCGA TTTAGATCTA GACGCTCCCA AACACATCTT CCTCATGATA TATCTTCTCT GGGATCGCAA GACACACGGG TCCTCGTCCT TCTTCCACCC CTACTACGAA ATCTTACCAC CTACGCTTCG GAACATGCCT ATCTTCTGGT CCGCATTCGA ATTACAGGAA CTTGAGGGTT CGCATCTTTT ATCACAGATT GCCGATCGGG GTCAGGCCAT CCAGGATGAT TACGAAGCTA TTCTGGAAGT GGCGCCGTCC TTGGGAACCT TATGTACACT CGACGAATTC AAATGGGCCC GCATGTGCGT TTGCAGTCGC AACTTTGGTC TCCAAATTGA TGGACACCGG ACTTCAGCCC TCGTACCGCA TGCAGACATG CTCAATCATT ACCGACCAAG GGAAACGAAA TGGACCTTTG ACGAAGTCAC GCAGTGCTTC ACCATTACCT CGCTTCAATC GATACAAGCC GGTGCGCAGG TCTACGATTC GTACGGACAA AAGTGCAACC ATCGTTTTTT ATTAAACTAC GGATTCGCCG TTGAAGACAA CCGGGAATTG GACGGCTTTT GCCCCAATGA AGTGCCGCTG GAATTGTACG TGGATCCCGC CGACATACTT TTTCAGGATA AACTGGAGTT CTGGACGCGA GGGGAAACGA ATCAAATATC GGGAGCGGTC ACCGCCGGTT TGATTGCCCA AGCCGTTGGT GGGAGCATGG GTAGAGGCGT ACCGTCCCAT GCTGCGGAAT CGTACACTTC CGGGCCCGTG GTCAAACGCG TCAGGGTATG TGTATCGAAC AACGAGAATA CTCGCTTGCT ATTTTCGTTG CTTCGGGCTT TAGCGTGCAA CGAAACAGAG CTATCCGCCA TCGCTTCGCC CGTTTCCAAT GATGGGGTAA CACGTGCACT GTTCGGTCTG GGGGCTCCGC AACCGCTGTC GGGTTCCCGG AATCACCCTT TTGCACCTTC GTTGAGCTAC CATCGCTCCT GTCGGGATAT TCGGCATCCT ATTTCGTTGC GTAACGAACG GGCGAGTATG AAGCTGCTGT TGTCTCTCTT GCACCGGCAG CTCGCCTCGT ACCCCACTAC AATCTCTCAA GACATGGCCG ATTTACAGGA TGAAGCGAGC TATCCGCAAT TCTCGAACCG TCGTCACGCC AAAATACAAG TGCGTGGTGA AAAGGAAGTT CTGCACCATT TTCGCGTATG GTCGGAGACG GCTCTAGATA TGCTGACCTT CATTGAGGAC GAGCTCAAGG AACAACAGCA GGGTAATGTT GAAGTTGAGC TGTTGCAAAA CAATCGTACC TCGGTAACGA CCACGGGCCA AGGTGTCGTG AATTTCGACG ATGTCATTCG GGATATGGAA CAGGACGATG ATGTACACCA TACAATATTG CGTTACTGTG CCGATGTGCT GGGATCATTG CGACGCGAGG AATTCAAGCA CTTGCGGTAC GCACAGGAGC ATGCATCCGG AACGAGAGGC GGAGCGAGAT CGTCCTTCAG ATCAAACAAT GGCTTGAGCG GTTTTTTTTA A
|
Protein sequence | MAYIPDKFER FEAWLRENGA HFEALELREY DCLNSSNTSA SEVDDEEKKE SPLTVSLEEV SEMRGVHARR SIPPHTTCVS IPRRCLITVE MGQATPIGRA ILQADLDLDA PKHIFLMIYL LWDRKTHGSS SFFHPYYEIL PPTLRNMPIF WSAFELQELE GSHLLSQIAD RGQAIQDDYE AILEVAPSLG TLCTLDEFKW ARMCVCSRNF GLQIDGHRTS ALVPHADMLN HYRPRETKWT FDEVTQCFTI TSLQSIQAGA QVYDSYGQKC NHRFLLNYGF AVEDNRELDG FCPNEVPLEL YVDPADILFQ DKLEFWTRGE TNQISGAVTA GLIAQAVGGS MGRGVPSHAA ESYTSGPVVK RVRLASYPTT ISQDMADLQD EASYPQFSNR RHAKIQVRGE KEVLHHFRVW SETALDMLTF IEDELKEQQQ GNVEVELLQN NRTSVTTTGQ GVVNFDDVIR DMEQDDDVHH TILRYCADVL GSLRREEFKH LRYAQEHASG TRGGARSSFR SNNGLSGFF
|
| |