Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39304 |
Symbol | |
ID | 7195037 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | - |
Start bp | 170641 |
End bp | 171825 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | |
GC content | 62% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183433 |
Protein GI | 219126372 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.635663 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGGGAA CCGACAAGGC GGGAGCCAGC ATTCTCAAGA ACCCGAACGA CCTCGATACG AAACGATTCG ACGTGCCCGT ACAAATTGAT GACGTCGCGT CTCGCTTTGT CGACAACAAC AACAATCACA ACAACCACAT TACGATCGAT CACGTCGTGT GCGGTCCCAC GGACACGGCT TGGATTCTCA CGGACGGGCG TGTGTGGGTG CATGGCGAGA ACCAGTCGGG ACATCTCGGT GTGGGACACC GCAACGTGGT GCCCCAACCC GTCCCGCTGG TCTGGCCCGA TCCGGACACC GCCCCACGGA TCGCGTCCGT CGCACTCGGC GCCTCTGCCG CGGCCTGGAT CGACACGGAC GGTGACTTGT ACACCACCGG GTACGGCGGG TCGGCCTTTG CCGGTCGCGG AGCGCTCGGA CACGGACCGG ATCACGATGG ACGGTACGTC CTACAACCCA CCCTGGTCAC CTCACTCGTC GAAGACGGAG TGTATGCCCG ACAAGCACAA GTGGGCGAAG CCCACTTGAC CGTACTCACC ACCGAAGGCG AAGTCCTTAC CGCCGGCGCC GGCAGTTACG GACGGCTCGG GAATTTCGAA ACCATCGATC AACTCTTTCT GGAACCGGTC GAAATACTCA CCTCGGGAGT CACGCAAATC GCCGGAGGCA AGTCCTTTAC ACTCGCCCTG CACGAGGACG GAGTCCTCTA CGGCTGGGGC CGCAACCACA AGGGACAACT CGGGACCGGC CTGGGCCTCG CCGTGGACAT GTACGCCATG CAGGCCGTGC CGGAACCCAT CGAAGGGGAC GAACTCCGCC ACCGCGTCGT CACGCAAGTC GCCGCCGGAC ACTCCCACGC CACCTGTCTC ACCGAAAGTG GCGAAGTCTT TTACTGGGGA GATTCGGTCT ACCTTGAACC CACCCGCGTC GACGCACTCC TCGGCGTACG GATCGTCGAA CTCGCCTGTG GAGAACACTA CACGCTCTGT CGCGACGAAC ACGGGAAACT GTACAGTTTC GGCAAGGGCA AAACCGGAGT CTTGGGACAA GGGGGTGCCG TCAAACAAGC GAATCAAGCC GCCCACCTGG AAGCCTTGGA CGGTGTCTCC GTCACGGCAA TCTCCGCCGG GTGGAAACAC GCGGCCTGTT TGGCCACCAA CAGTACCCCC ACCACAATAC AGTGA
|
Protein sequence | MWGTDKAGAS ILKNPNDLDT KRFDVPVQID DVASRFVDNN NNHNNHITID HVVCGPTDTA WILTDGRVWV HGENQSGHLG VGHRNVVPQP VPLVWPDPDT APRIASVALG ASAAAWIDTD GDLYTTGYGG SAFAGRGALG HGPDHDGRYV LQPTLVTSLV EDGVYARQAQ VGEAHLTVLT TEGEVLTAGA GSYGRLGNFE TIDQLFLEPV EILTSGVTQI AGGKSFTLAL HEDGVLYGWG RNHKGQLGTG LGLAVDMYAM QAVPEPIEGD ELRHRVVTQV AAGHSHATCL TESGEVFYWG DSVYLEPTRV DALLGVRIVE LACGEHYTLC RDEHGKLYSF GKGKTGVLGQ GGAVKQANQA AHLEALDGVS VTAISAGWKH AACLATNSTP TTIQ
|
| |