Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_16674 |
Symbol | |
ID | 7199045 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | + |
Start bp | 199208 |
End bp | 200335 |
Gene Length | 1128 bp |
Protein Length | 376 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185148 |
Protein GI | 219129968 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAGATA CAGCGCCGCT GCGGTACGTG GAGTTTTTCG CCGGCGTCGG AGGCTGGACA ATGGCCTTGC AGGAAGCCAT CCAAATCGTC TATCCGTCAG ATCCACCCGA GCTGTTCTGC AGTGCTGCTC TCGATCACTC AGACCTTTGC ATTGAAGTTT TTGAGCACAA TCATAGTCTA GTCATCCAGA AAGCGGTACG GATTGAGAAA CTCACAATGA ACCAAATATT CGAATATCGA GCCGATATTT GGATGATGAG TCCGCCTTGC CAACCACATA CACGACAGCA TTCTAATCAA GACCAAGAAT TGGAAGATCC TCGCTCCCGT AGCTTTTTGC ATCTGTGCGA TCTCTTGCTT GAACTCCCGT CTGAAAACTT GCCCAAACTT ATTTTTCTGG AAAATGTTGT TGGTTTTGAA AGTTCGCAGA GCTGTCGAAA ATGGAACACA ATCTTGCAAA GTCGGCAGTA CATCATTAAG CACTTTCATT TGAACCCAAC ACAAGTTGGT GTTCCCAACG ATCGTCCCAG GTACTTTTGC TTGGCCGTTC GCTCCACCGA GATTCATGAC TCTAACGACA ATGATCTACA ATTTCACGTA CACGCAAAAA CGAAAATGGC TGACAGTGAT CTACGTCCAA TTACGCCAGA TACAAATTTG CCCAATCTGA ATATCAAAGG TTTGCGCGAC TCTACTGTTA AAGTTTCTTC TGTGGCCGAA TTTTTGGATA AGGATTTGAC CGAGCATCAA AAGACCTCTT TGCGCATACC GCAAAGTATT CTACAGCGCA ACGCTGCTTG GTGCTTTGAC ATTGTGACAC CGGAGAGCCT ACGTAGTGCT TGCTTTACAA GCAGCTATGG AAAGTTTGTC AAAGGTACAG GAAGTGTCCT TTATACGGGA CCTTATCGGG ACAGAATTCG TTTGACCAAC CCCGAGGACC GGAAGTTTGA CGATGCCTGG GACCAGGGAC TCGACTTGCC CAAGCATCTA CGATACTTTT CTGGATCTGA ACTGGCACGA ATCTTCGGTT TTCCTTCCAC CTTTTCATTT CCGGAAACGA TAACAAGGAA GCAACAATGG AAGCTCATTG GAAATTCTTT AAATGTTCGA GTGGCGGCAA AACTTGTT
|
Protein sequence | MPDTAPLRYV EFFAGVGGWT MALQEAIQIV YPSDPPELFC SAALDHSDLC IEVFEHNHSL VIQKAVRIEK LTMNQIFEYR ADIWMMSPPC QPHTRQHSNQ DQELEDPRSR SFLHLCDLLL ELPSENLPKL IFLENVVGFE SSQSCRKWNT ILQSRQYIIK HFHLNPTQVG VPNDRPRYFC LAVRSTEIHD SNDNDLQFHV HAKTKMADSD LRPITPDTNL PNLNIKGLRD STVKVSSVAE FLDKDLTEHQ KTSLRIPQSI LQRNAAWCFD IVTPESLRSA CFTSSYGKFV KGTGSVLYTG PYRDRIRLTN PEDRKFDDAW DQGLDLPKHL RYFSGSELAR IFGFPSTFSF PETITRKQQW KLIGNSLNVR VAAKLV
|
| |