Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41506 |
Symbol | |
ID | 7199373 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011699 |
Strand | + |
Start bp | 23023 |
End bp | 24381 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185473 |
Protein GI | 219130650 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.000024093 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTACTG ACGGTGCTAA TGCGTTTCGT GGTGCGCTCG GGTGGATCGG ATGGTCTGTT CCTGCGGCGA ACGCGTTTAC GAACGAAGGT TTTGATGCGA TGGAATCCCT TGGCTTGGTT ACCTGTGACC GTCTTAAGGA TATCTGCAAG ATCATTCGTC GTGGTACCGA TGGCGTGGCC GCAGTGCCAG CTGCTGGTGG AAATGCTGCG GTGGCGGCGG TGCCTGGCAT CCCTGGGATA GCGATCCCCA TGATGTGGGA GTACAAGCTA AGCGGAATGC ATCTCTGGGT GTCTGAGCGT CTCCGACAGG GGACTCCGGT TGTTGCGGCG GACTTTACTG CGGCCATTGG AAACCTGTAC ACCAGGAAAG TGCGTAAATT GGAAGAAGCG AAGGATGACG AGGATGTCCA GGTCAAGCCC CCGGCTCCGT TCTCGAAAGA AACGAAGTGG ATTTCGTTCT TCAAGTTGCT GGTCAATTAT TTGAGCTCCG TGACGGGTGT CAACAAAGTG CCATTGGATT ATGTTGTCCG GAAGGATGAC AACATTGCTG CCCCCAATGC CGAGTTTGAG ACGGAGCACA AGAAGTTGGT GTTGTTGACT CCCCATTTGG GGACGGCTTT CGACAAAGAC AACGGCAAAG TTTGGATCCA GGTGAAGCAA TTGACTGTGA ACGGTCCGGC CTGGACCTAC GTTGCTCCTT TCGAGAAGAA ACGCGACGGT CATGGAGCGG TCAAGGCTTT GAACAGCCAC TATGAAGGAG ATGCGGTGAT GTCAAAATCG AAGGCGGCTG CCTTTAATGT GCTTGAGCAC ACCACCTACA CTGGAGAACG TCGAAATTTC GGTATGGAAC GGTACACGAA CGCCTTGTCC ACGGCATTCC AGACCTTGGA CAAGTACGGA GAGACCTTGA CGGAGTCAAG AAATGTGGAT GTGTTCCTCC GCAACAATCA CTGTACGGAT CCCAAGATGC TCTCAGGAAT TGCGGTAATT CAGGGAGACG CGGATTGGAT GTCCAATTTT GCCAAGGTGG CCGACCATTT GGCCTTGTTT ACTAACACCG ATAACTCTCA AAAGACAGGT TGTTTGATCT CAAGTGCTCA GCGGACTAGT AACAACAAGA AGAAGCCGGG TATCCGAGCG GGCAATTATA ACCCAAATGA ATGGCATCAA CTCTCGGACA AGGCAAAGGA CGAAGTTAGA GCCAAGCGAG TGGCCGCCAA GTCCTCTCGC AATAAAAATA AGCGCTCGGC AGCAGCAATC ACTTGTTCGA GCAAGAAACC TGACAAAAAT CAGTTTGCTC TCCCGAATAA GAAGAAAAAA AGGAAGACTG TTGGTTTTGA AGGCGAAACG AGCAATTGA
|
Protein sequence | MATDGANAFR GALGWIGWSV PAANAFTNEG FDAMESLGLV TCDRLKDICK IIRRGTDGVA AVPAAGGNAA VAAVPGIPGI AIPMMWEYKL SGMHLWVSER LRQGTPVVAA DFTAAIGNLY TRKVRKLEEA KDDEDVQVKP PAPFSKETKW ISFFKLLVNY LSSVTGVNKV PLDYVVRKDD NIAAPNAEFE TEHKKLVLLT PHLGTAFDKD NGKVWIQVKQ LTVNGPAWTY VAPFEKKRDG HGAVKALNSH YEGDAVMSKS KAAAFNVLEH TTYTGERRNF GMERYTNALS TAFQTLDKYG ETLTESRNVD VFLRNNHCTD PKMLSGIAVI QGDADWMSNF AKVADHLALF TNTDNSQKTG CLISSAQRTS NNKKKPGIRA GNYNPNEWHQ LSDKAKDEVR AKRVAAKSSR NKNKRSAAAI TCSSKKPDKN QFALPNKKKK RKTVGFEGET SN
|
| |