Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45012 |
Symbol | |
ID | 7199679 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | - |
Start bp | 994469 |
End bp | 995992 |
Gene Length | 1524 bp |
Protein Length | 437 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179104 |
Protein GI | 219116618 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.380882 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCGAATTGCC CTCGTTCCAT CCGGATACCC CCGAAACGCA CAACGGCAAA GACCCGTCAC AATTCCCCCC ACCACAACGC CAGTGAGCAA GTATCGGCGT CGGGATGGCA CGGGAACAAC AATCGTCTCG ACTCCTCGTC GCGGGGTCAC GACTGTTGCA GTTTGCAACG AGTGGTGCCG CGGGAGGTCT TTTCGTCGGT CAAGAAGATA TCGACGCGAA CCGGACGGTG GTTCCGATCA CGGAAACCAA CGCCGACACC AACGAACCGG CAGACCGGGA CAACCTCGAT CGGGTCAAGG CGTTCTTTTT GATTCTCTTG TCCGGGGTCG GGATCGTTCT AGGGCTCATC GTCATCACCT ACCTCGTTAT GATTCTTCTA GACAAGTGTA CGCGCCGTGG CGATGCGGAA GAAGAACGTG ATCACGGGGT GGTCTCGCGG AAAGCCGGTT TGTGGGGTCT GAAGCAATCC GAACGCCAAG CAATACTGGA GTACATTTTT CGAAAACACA AGACGGTCTT TGAGTACAGT CAAGAATTGG TACCAGCTGG GAACAATTCC GGAGTCGACA CGGGGGAAGC GATACCCGTA CTCGATGCGG ATGTTTCGAC AACGAGCGAC AGTGGAAGAG ATCGTTTCGG GAATGAGACA ACTTCAAAAG ATAGTCCCGC CGTCAGTGAA AAAGATCAGG GTATCATTTC CGAGCCATCG GTCGAGTCGT CGCGCTCCAT ACCCGGACGC GCACCGGTCG ACGAGCACCG GGACGAAAAC GACACTGCAA CGTTCCCGGA AGACGATTCG GACGAAGATC AGGCCAGCCA CGACAATCTC TTCTCGAGCG CCACCACGGA ATCCAGTGAC GACCTCGCGT TGACCGAGCA CGACGACAAC GACCACGACC GCGTGTGTTG TATCTGCTTG GCTCCTTACG AAGCTGGCAG TACCATGTTA ACAGCGCGAA CGTGTCCACA CCAGTTCCAC TACGATTGTT GCATGGAGTG GCTCGTTGCC TTTCACGACC ATTGCCCGTA TTGCCGGGTG GAAATGATGA CACCGAACCA GATGCGCAAA GCCGCACGCA AGGTACTCGG TCAAGCTCGT GTCACCGAAC TGGGCATGTG GCAGCAGTAC CAGCAATCAA GGAATGATCA TAACACGGAT CCCACGCAGG TACGCATTGG ACGAGAACTC CAAGCCGAAG TGGAGTTGAC CGTGCAAACG GAAGGTAGCA ACGATCAAGG TGCAATTTTG GACGTCTCGG TTGCCGAACA GCACAGAGAT ATCGAATCAG GGACAGTAGT TGATATGGAA AATTTTACCG ATAGTGGGAC AGCTGGTCAA GACCACCCGG CTCATGGAGA TTCCAGCGCG AACGGTGACT GCACGCGGAC AACAGTTGTA AGTCACAATC ACGAATAATC GAAAGCTGTC TTCACAGTTA GTATTGAAAG AACGCTGGAT AGTTTTGTGT TCATTTGCAG TCAGTGTAAA ATAATGATAG CGTCTTTTTG TTTTCTTTAT GGTT
|
Protein sequence | MAREQQSSRL LVAGSRLLQF ATSGAAGGLF VGQEDIDANR TVVPITETNA DTNEPADRDN LDRVKAFFLI LLSGVGIVLG LIVITYLVMI LLDKCTRRGD AEEERDHGVV SRKAGLWGLK QSERQAILEY IFRKHKTVFE YSQELVPAGN NSGVDTGEAI PVLDADVSTT SDSGRDRFGN ETTSKDSPAV SEKDQGIISE PSVESSRSIP GRAPVDEHRD ENDTATFPED DSDEDQASHD NLFSSATTES SDDLALTEHD DNDHDRVCCI CLAPYEAGST MLTARTCPHQ FHYDCCMEWL VAFHDHCPYC RVEMMTPNQM RKAARKVLGQ ARVTELGMWQ QYQQSRNDHN TDPTQVRIGR ELQAEVELTV QTEGSNDQGA ILDVSVAEQH RDIESGTVVD MENFTDSGTA GQDHPAHGDS SANGDCTRTT VVSHNHE
|
| |