Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49251 |
Symbol | |
ID | 7195546 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | - |
Start bp | 376984 |
End bp | 378294 |
Gene Length | 1311 bp |
Protein Length | 407 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183981 |
Protein GI | 219127519 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.392148 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATCCCT GTAGTGTATT CAATGTGCTT CTCATGATGA ATTCGCTGAC AGCGGCGACG GGCATATCGG TGAAGAAACA GTCGCCTCGT TCAAGCACGA TTCCATCATC TTTACCAGTC TCCTACAAAC ACGATTCGGC TCTTTCGCCA CATCGACGGG TCCAAAGTCG CGAACCTGCC GGGGCGGAAT GTCTCTTTTC TCGAGCTTCG TGGATGCGCC ATCGTTTGCT GCATCACTCC ATAACACGAC CGCAGCGTTG GCTCCGACGG TAAGTCTGCC TCCGACCACA CTGGCAAAAG CGATATCAAT GATGGTGGTG TACCTCAGAC TCCCAATGAA CTGCAATTCC GTCCAGTATT TTGGTATTGG AAAAGATCGA CGATAAGGTA CAGGCCCCGT CTCACAATGA AACGAATGGA GAGAATTCAC TTAGTTTTCA AACTGTCTCA AAGGATCTTT CGCTACCCTG GTGGGCATAC AATCTCAAAG ACGTAAAAGA AACCCAGCGA CTTGCCAGCA TGAAACTCCT GTTACTAGCT GATTTGGAAG CCATTGAAAG TCGCACTGTC CAACTTCGCG GTCAAACAGT CAACGTTACG GAGGCTTTTC CTGATGTATA CAGTGATTTA CGTATGTTGC GTTTTTTACG GAAAGAAAAA GATCAGGACC CCTTGTCTGC AGCTGAACAC TACCGCACCT TTTTACAGTG GCGAGTAACC AATCGCATCG ACGAACGTAT TCGAGTATCC GTTGACCAGC GACCCTTTGA GCACTTGTCT GACGTTGTCG CGGAGCATTT TCCTTGTCGA TTTCATTTGG TGGATGATCT TCCCGATCGA TCGAAACTGC TACCGATTGT ACTGCATGTT GGTGACTGGA AGACATCCGA GATTTCCAAT GTGATTTGTA GCAAGAAAGA GCTATCCCTG GAAGAATTCC TGGACCATTG GATTTATCTT TTTGAATCCT TGCATCGGCA GTTGTATAAA GAAAGCATGA AACACGAAAG CATGATATAC GTGGATGAGA TCTGCGATCT ATCAGGGATG AATCGTCACC AATTTTCTCC CAGCTTCGTC CGCAAAATCA TGAAGCCTTG GTTGTCTTTG ACTCAACAAT ATTATCCGGA AACTACCAAG CAGATCCAGC TGTTGAAGCC TCCCCGTCTT TTGTCTATGG TGTGGAATAC CGTTGCTTGC ATGATTTCCC CCGGTACGGT CGCTAAAGTC CAGCTTGTGT CCAAGTACGA TGGGACGGTG GACGACTTTG TCCGCGAGAT TTACTTCGAC AAACCACACC AGAAAACCTG A
|
Protein sequence | MHPCSVFNVL LMMNSLTAAT GISVKKQSPR SSTIPSSLPV SYKHDSALSP HRRVQSREPA GAECLFSRAS WMRHRLLHHS ITRPQRWLRR ILVLEKIDDK VQAPSHNETN GENSLSFQTV SKDLSLPWWA YNLKDVKETQ RLASMKLLLL ADLEAIESRT VQLRGQTVNV TEAFPDVYSD LRMLRFLRKE KDQDPLSAAE HYRTFLQWRV TNRIDERIRV SVDQRPFEHL SDVVAEHFPC RFHLVDDLPD RSKLLPIVLH VGDWKTSEIS NVICSKKELS LEEFLDHWIY LFESLHRQLY KESMKHESMI YVDEICDLSG MNRHQFSPSF VRKIMKPWLS LTQQYYPETT KQIQLLKPPR LLSMVWNTVA CMISPGTVAK VQLVSKYDGT VDDFVREIYF DKPHQKT
|
| |