Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48302 |
Symbol | |
ID | 7203782 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | + |
Start bp | 69287 |
End bp | 71095 |
Gene Length | 1809 bp |
Protein Length | 549 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182763 |
Protein GI | 219124969 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCGTTGTACA TTTCACAGGT AGACTTACTT TATCATTACT GCTAGTTGAC ACTCCAATTG CATGCGCGCC GTTGTTGGGA CCTCCAGTGA CATCGCGTAC CAATCCTATT TGCAACGGCA GCCGTCTCCC ATGTCGGTCA AGGCGATGCG TTCCCACGTA CCGACGAGAC GCTCAAGGCA GCTATGGATA CTGTTCCTGC TCGGTCACGC TTATAGCCTT CGTGCGTTGA CGCCCCACAG TAAAGCCAAC ACGCACAGCA AGAAACCCAT GTTCCAACAC GCCTTGGCTA TTCTCACCAT GCCCAACAAC AGTGTGGATC GCATTATCAA CGAAGCTATT CTGGAAAAAG CCTTGCCCTC GGCACACAAA CTTTCCGTTG TTTTGCGTTG TCAAGGAACG AACAAGCGCA GTGGAACTTC GACGTCGCCC TCGTTGGCAA CGTTGCGTCG CTACGTTGGC GAAGTATATT CACAAATGTG GGATCTCGCC ATGCACAGTC CACAACTCCT GGCGCAGCAG CAGTCGCAAC AACAACAATC ACTCGACGAG AGTTCCCGCA ACGCCGCCTC TCCCACAGAC TTCTTTGTCC TCCCCGACGT TGTGGTCTAC CCCCAAAATT TGCCCAACGC GGCACCGGAA AGCTGGATTC ACATACAGAA AGATCTCGAC CTCGTCTGCA GTGTGGATTC CATGGCTGGA TGGATTTCCA CACAAGCAAC GGGACGGGGA GAACGCTACC AACGCATGCA TGGGGATGGG CTGGGTGGCC TCGACGAGCA CGTGCAAGCC ATGAACAGTG AACGGGCCTT TCGAAATCTT GCACCCGTGC AAATACTGCA CGTAAATCCC TCGGACTGTA TGACCAACGC CGTCGTGGAC CCGAACGTGG TCTTTTTGGA CGATGACGAA GAGACCGAAC GGTTACCGAA GGTACAACAA CAACAACATC GAAAAGGACA ATCAAAAAGT ATGGACCAGG GCGTGACGTG CAACGGCGGC GCTGATGACG ACGAAGAATG TGATTTGATT CTCGGTGGAG CTCGCATAAC GCAAGGACGA TTATTCGATT CCGTTGCGGT CGGCGGCACG TTCGATGGCT TGCATTTTGG CCACCGCAAA CTGTTAACCC TCGCCATGTC GTCGGTACAT CCCGTCACGG GCTTGCTACT CGTTGGAGTC ACGGTCGATG ACATGCTCCG ACGCAAACGC TTCGCGGAAT ATATCCCTTC GCTGCAAGCC CGTATGGAAG GCGTCCAGGA CTTCTTGCAC CGTCTCGCTC CGGGCATGAA GAACAACATC CGTATCGTCC CGATTCGAGA CGCCTTTGGT CCACCAGGAC AACCGGGTTG GCATTTTGAC GCCCTTGTAT TGAGTCATGA AACTCTCGAG ACGGGGTACG CGCTCAACGA GCACCGCATC GAACAGGGCA TGCATCCACT GACATTGTTG TGTACCCGAC GAACCGAAGC ACACGGCATG AGTAGCACGG CCTTACGACG CCGTCGCTCG TTGCAAACGA GCGCAACGGC GTCCAGCGCT CCCACACGAA ACGCGGCGGT CCTAAGACAG CAGCAGCAGC AGCAACCAAC AAGGGATCAA AACAACACTT CAGCAGCAGC CGGATCCAGC AACGGAGCGA AGTCTCATCC CCATTCCACG TCCGTCAACG GACACGACTC TGCGGGGAGA AGGTCGGCCA ATCCGCTTTA ACGCAGCCAC GGCACATTGA ACCGGTTTGT ACCATTTTTT CTTTTTACAC TACGTATTTG TTAATTCTAC CTGAGTTACA TAAGGATACA CTTCACTCT
|
Protein sequence | MRAVVGTSSD IAYQSYLQRQ PSPMSVKAMR SHVPTRRSRQ LWILFLLGHA YSLRALTPHS KANTHSKKPM FQHALAILTM PNNSVDRIIN EAILEKALPS AHKLSVVLRC QGTNKRSGTS TSPSLATLRR YVGEVYSQMW DLAMHSPQLL AQQQSQQQQS LDESSRNAAS PTDFFVLPDV VVYPQNLPNA APESWIHIQK DLDLVCSVDS MAGWISTQAT GRGERYQRMH GDGLGGLDEH VQAMNSERAF RNLAPVQILH VNPSDCMTNA VVDPNVVFLD DDEETERLPK VQQQQHRKGQ SKSMDQGVTC NGGADDDEEC DLILGGARIT QGRLFDSVAV GGTFDGLHFG HRKLLTLAMS SVHPVTGLLL VGVTVDDMLR RKRFAEYIPS LQARMEGVQD FLHRLAPGMK NNIRIVPIRD AFGPPGQPGW HFDALVLSHE TLETGYALNE HRIEQGMHPL TLLCTRRTEA HGMSSTALRR RRSLQTSATA SSAPTRNAAV LRQQQQQQPT RDQNNTSAAA GSSNGAKSHP HSTSVNGHDS AGRRSANPL
|
| |