Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48900 |
Symbol | |
ID | 7194975 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | - |
Start bp | 634348 |
End bp | 635918 |
Gene Length | 1571 bp |
Protein Length | 456 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183522 |
Protein GI | 219126558 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTCTTGCGT TGCACAGGCA AACCTCGAAT TGACGATCAA GCACCAACCA AACGGGGTAC ATCCTTCATA CCGTCTCACT TGGGTAGCAC GAAGACTGTC TCTGCGACTG CACTTGTCTG ATTTCGTGCT TTCTATCAAG AATTCACTGT CCAGTTTGCA AGCAATATGC TACATCAAGC TGCGCAACGC GTCCAACGCG TCGTGATCCG AACCGCTGTC GGAACGACCG TGGTTGTCGG TACTGTTGAA TGGGCTACCC ATTTTCCCTC GCAAGGGCGA TCGTCTCAAT TCTATCACGA CTTAGTCGAT CACATTGTGA CTCCAACCAT GCGACGAATT TTGGATCCAG AAACGGCTCA TCACGCGGGA ATATTTTTCG CCGAGCAAGG ATTGTCACCG AGATTTCGAC CGTCGGCGTT GGAACAACGC TGCGTTGTCT CAAATTCCGT TTTTGGAAAA ACGTTTCCCA ATCCAATTGG ATTAGCGGCG GGTTTTGACA AAGATGGCGA GGTGGTGCAG GAAATGCTGG ATCTGGGCTT TGGATTCGTG GAAATTGGAA CCGTCACTCC TCAAGCCCAG CCGGGCAATC CAAAACCTCG CATGTTTCGA CTCGTCGAAG ACTTGGGCAT TATCAATCGG TACGGATTCA ATTCGCAGGG AGCAAGTACG GTTGAGAAGA ATTTGAAATC GTTTCGGTCT CCACAACCTA TCGACTCCGC CAAGCTTTCT TGGCCACGTT ACATTTGGAA CTTTTTATAC CCGCCGCGGC ATCAATCCGG TCTCGTCGGT GTAAATATTG GCAAAAACAA AAATTCCCTG GATGCAATTG GAGACTACGT TTGCAACATT CGGCAGCTTT CAAGCTTGGC GGACTATATG GTCTTGAATA TCTCCAGCCC CAATACTGCC GGATTGCGAG ACCTACAGCA GGCAAATTCG TTGCGTGTGC TGCTTACGAC TTGCCTTACC ACGCGCAATG AGATGACGCA TCGTGTTCCA CTCTTGGTAA AGCTGGCACC CGATCTGACT GACGAAGAAT TGGATCAAAT CGCGAATGTA TGTTTAGAGG TCGGCATAGA CGGTATCATC ATAACGAACA CAACTAATCA CCGTCCTGCC GATTTGCTCT CGAAGCATCG CGGTGAGATT GGTGGATTGT CGGGAAAACC CGTGAAGGAT CAAAGCACTG AATGCATTCG TCGACTTTTT CGTCTAACGG ACGGTAAAAT CCCCATTATC GGAGTGGGGG GCGTAGGGAG TGGCCACGAT GCATACGAAA AGCTCAAGGC TGGAGCAAGT TTGGTCCAAG TCTACAGTAT GATGGTGTAT CAAGGCCCGG GAGTCATTAG TAGAATTCGA CATGATCTTG CCACCCTCAT GCTGGAAAAC GGTCAGCGTT CGATTGTAGA CGTGATTGGG GCTGACCACG AGGACATCTT CTGGCGTAAA CGCGAAGAGC GAATTGCCCA AAAGCGCAGA CGAGACACCC GCATCTCGAT TGAAACACTA CCAGTAAAGA AAACGGAGTT TGCTTAAAGT AAAAGAAAAC GTAAGATTGT GGATTATGAA C
|
Protein sequence | MLHQAAQRVQ RVVIRTAVGT TVVVGTVEWA THFPSQGRSS QFYHDLVDHI VTPTMRRILD PETAHHAGIF FAEQGLSPRF RPSALEQRCV VSNSVFGKTF PNPIGLAAGF DKDGEVVQEM LDLGFGFVEI GTVTPQAQPG NPKPRMFRLV EDLGIINRYG FNSQGASTVE KNLKSFRSPQ PIDSAKLSWP RYIWNFLYPP RHQSGLVGVN IGKNKNSLDA IGDYVCNIRQ LSSLADYMVL NISSPNTAGL RDLQQANSLR VLLTTCLTTR NEMTHRVPLL VKLAPDLTDE ELDQIANVCL EVGIDGIIIT NTTNHRPADL LSKHRGEIGG LSGKPVKDQS TECIRRLFRL TDGKIPIIGV GGVGSGHDAY EKLKAGASLV QVYSMMVYQG PGVISRIRHD LATLMLENGQ RSIVDVIGAD HEDIFWRKRE ERIAQKRRRD TRISIETLPV KKTEFA
|
| |