Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47591 |
Symbol | |
ID | 7202646 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 189754 |
End bp | 190911 |
Gene Length | 1158 bp |
Protein Length | 327 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182022 |
Protein GI | 219123418 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000312282 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGCGGA AGCTGTCGGG AACCGCCGCC TTCTCCTCTC TTTCTATGGG AACCCAACAG ACCGTGCGTC CCAAACAACG TAGAATCTCG CGACGTTTTT TGATGAAATC CGTCGCTATC GGTGTGTTTT CAATAGCTGG CCTTTTTTCA AAAGAGTCGC GCTACTGGTT TCGACATTCC GACGGTGACG AAATTGGGAA AGAAAACCCG TCCAGGACTG TTAGAAGAGT TCTCCAACTT GACAGGCCGC GCTCTGTACG ACACCATCGC GGCGCTCCCT TCCGATACGC TGCCCCTGGG GGATAAGCCG CTTCCCAGGT TGTTCCTAGA ATCTCAAACG CCGTCCAATA ATCGTACTTT GGTAATCTTG ACTGGTGATT TGCGATGCGG CGAAAAGGCT TGGAGCACGT TGTACGAAAA TGTCTTGGAC TTGAACCACG CGGACCTGGC GCTATTCGTG CAGGTTCCCA GTCAGCTTGA GTACAAGAAC GCGTCTCTGT TCGCACGGGC CCGCCACATT GAGTGGATTC CTCGGTACGA CGACTGGGCA GACGCTATTG ATCTAATTGA CGGACCAGCT TGGCGCGAGA CCATCCTCCA ACTCTACCCT GCCGAAACGC ACTACTCAAT TCTAGGGGGG TCGAGGGGTA CAGAGCTAGC GCGGCTATTG TGCACATGTT TCGATATTTT GTGGCGGAAC GAATTAAACG CAATGGCTGG GAGCAGCAGT ACGATCGATT CATTGTAACT CGGACGGATC AATTCTATAA GTGCCCTATT GACTTGTCCA AGTTAGACCC CGAGAGTCTC TGGTTGGCGG AAGGTGAGGA CTTTGGCGGC TACAACGACC GGTTTTACGT TGCGCCATCC AGCTTGATAA GGAAAACCTT GGAGGTGATG CCGACCTTTG TGCGCAAACC CTACATTTTT CAAAACGTCA TACCCGGCCG GAGTATGAAT TCCGAAAAGG TACTGTATCT GCTATGGAGA GAGTTGGGTC TGGTTCCGTC TGTCAAGCGA TTCGTACGGA CCTTTTTCAC TTGTTCCATG CCCATGGATT CGGCCCGGTG GTCCAAAGGT ATACGCATGG TGGACGAGGG TGTGCACATC AAGTACGAGC GGGAGTACGC CAACGCCAGC GGCGTTTCGT GTACATGA
|
Protein sequence | MLRKLSGTAA FSSLSMGTQQ TLAFFQKSRA TGFDIPTVTK LGKKTRPGLL EEFSNLTGRA LYDTIAALPS DTLPLGDKPL PRLFLESQTP SNNRTLVILT GDLRCGEKAW STLYENVLDL NHADLALFVQ VPSQLEYKNA SLFARARHIE WIPRGVEGYR ASAAIVHMFR YFVAERIKRN GWEQQYDRFI VTRTDQFYKC PIDLSKLDPE SLWLAEGEDF GGYNDRFYVA PSSLIRKTLE VMPTFVRKPY IFQNVIPGRS MNSEKVLYLL WRELGLVPSV KRFVRTFFTC SMPMDSARWS KGIRMVDEGV HIKYEREYAN ASGVSCT
|
| |