Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_18877 |
Symbol | |
ID | 7198026 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 251901 |
End bp | 253359 |
Gene Length | 1459 bp |
Protein Length | 455 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178193 |
Protein GI | 219114795 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCCAGT CGAGAGAGAA TAAAGACATT TCCGATTTGA CCTACAAGCT AGAAACAATG GGTTGCCAAA TGAATATGGC TGATTCCGAA CGAATCGAGG GTCAATTACA AGGTCTCGGT ATTCGACCTT TAGATCCAGA CGTAGACAAG AACAAACAGC CGGATGTCGT CATATTGAAT ACCTGCTCCA TTCGAGATCA CGCCGAGCAA AAAGTTTATT CATACATTGG ACCCCACGCC AAACGCAAAC GCGACGGGGA GGACGTGACG ATTATTGTAG CTGGCTGCGT CGCCCAGCAA GAAGGGGAAG CGCTGTTACG ACGTGTACCG GAAGTTGATC TCGTCATGGG ACCGCAGTAC GCTAATCGGA TTGGCGATTT ATTAGAAGAT GTCAGCAATG GCAACCAAGT TGTGGCCACC GAAGCCAGCC ATATCATGGA GGATTCCACA AAACCACGAC GACAATCAAC GGTCGCCGCG TGGGTGAACG TCATTTACGG CTGCAACGAG CGATGTACCT TTTGCATTGT ACCTACCACG CGTGGAGTAG AACAATCTAG GCCTGCTGAG AGTATTGTAA GAGAGGTTAC TGAACTTGTG GAACAAGGGT TCAAGGAAAT CACGCTATTG GGTCAGAATA TTGACGCCTA CGGCCGTGAC ATGATCCCGA AGCGAAAATT TTCGGATTTG ATCCGGATTG TTGGTGAAAT ACCAGGATTG GACCGCTTGC GATTTGTAAC GTCTCATCCT CGTTACATGT CGCTGGGCGT CGTGGACTCC GTTGCCGAGA CACCGGCAGC TTGTGAATGC TTTCATATTC CTTTTCAAAG CGGATCTAAC GAGATACTCG CCGCTATGGG TAGAGGACAT ACCCGAGAAA AGTACCTGCA CATTGTGGAT CGTATCCGAT CGGTAAGTTA ATGTCGACGG TCTTTTCCAA TCGGTACTCT CTCGTACTGT CGTTGCGTTG ACTCACACAT TTCTTCGTTT ACTAAATCCA CAGAGAATAC CGGATGCAGC GATCACCGCC GACGTGATCG TGGGCTTCCC TGGGGAAACG GAGGAGCAGT TTGAAGACAC CTTGTCCTTA ATGCGCGAGG TGGTTTTTGA TTCCGTCAAT ACAGCCGCGT ACTCTCCCCG CCCCAATACG CCGGCAGCCG TTTGGGACGA CCAAGTTGAC GACGCCGTCA AACAGAATCG TCTGCAACGG ATCAATGCAC TCAATCTAGA ACACGCCGCT CAACGTCGGG CCCGCATGAA GGGGCGGACG GTCGAAATAT TGGTGGAGGA ACGCAACGTA CGCGTGCCCA CGCAAGTAAT GGGTCGTACG CGGCACGGGT ATATTGTCTA TTGCGACGGT GAGATTGATG AGCTTCGTGG AAAGCTAGTC AACGTCGAGA TTGACACCTG CGAGCAATAC TATCTTGCCG GAAAGCCAGT TGCCCAAGAT GGACACTGA
|
Protein sequence | MGQSRENKDI SDLTYKLETM GCQMNMADSE RIEGQLQGLG IRPLDPDVDK NKQPDVVILN TCSIRDHAEQ KVYSYIGPHA KRKRDGEDVT IIVAGCVAQQ EGEALLRRVP EVDLVMGPQY ANRIGDLLED VSNGNQVVAT EASHIMEDST KPRRQSTVAA WVNVIYGCNE RCTFCIVPTT RGVEQSRPAE SIVREVTELV EQGFKEITLL GQNIDAYGRD MIPKRKFSDL IRIVGEIPGL DRLRFVTSHP RYMSLGVVDS VAETPAACEC FHIPFQSGSN EILAAMGRGH TREKYLHIVD RIRSRIPDAA ITADVIVGFP GETEEQFEDT LSLMREVVFD SVNTAAYSPR PNTPAAVWDD QVDDAVKQNR LQRINALNLE HAAQRRARMK GRTVEILVEE RNVRVPTQVM GRTRHGYIVY CDGEIDELRG KLVNVEIDTC EQYYLAGKPV AQDGH
|
| |