Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50500 |
Symbol | |
ID | 7199332 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | + |
Start bp | 227400 |
End bp | 228705 |
Gene Length | 1306 bp |
Protein Length | 358 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185403 |
Protein GI | 219130503 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 0.650002 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTACTAGCAA ACGTTCCGCG CTACCGTTGT TCGGGGTGTC TCTAGCCTGC ATTGTATTGT TGCAGCATAA ACTAGCCAGG TAATATTCCG AGCTTAGAGT AATATCAATA TGCCTTGTAC GAAACCGTTG AACATTGACA CGTCCCAACC CGTTTTGGTG ACGGGTGCAA CTGGCTACGT TGCCGGCGTC TTGATCCAGC AGCTATTGGA AGCCGGCGTG ACGGTCCACG CCACGGTCCG CGATCCTTCG CAGACGGAAC GCGTACAATA CCTGCAAGAT CTCGCGGACC AGAGCGGTTC CGGAACGATC CGATTCTTTC GGGGAGATCT CTTGCAGGAA GATAGCTTTG ACGAAGGCAT GAAAGGATGC GGAATTGTCT TTCATACGGC ATCACCTTTC CAGTTGGACT ACAAGGACGC GTACCACGAT CTCGTCCAAC CAGCCGTCCG GGGAACGCAA ATTGTTTTCC ACACGGCATC TCGGACGCCG TCGGTAAAGC GGGTTGTCTT GACCTCCTCC TGTGTCGCCA TTTACACCGA CATTTCCGAA TGTGACGCTG TCAACAACAA GTCACTCAAC GAAGAGACCT GGAATCGCAC TGCCTCTCTC GATTATCAGC CCTATTCCTT GAGCAAGACG CTGGCGGAAC AAAAAGCCTG GGAGATTGCC GGAAGTCAAA CGGCCTGGAA ATTGGTAACG ATCAATCCAT CCCTAGTCTT TGGTCCGGGA GTCAAGTACC ACGAATCGTC CACTTCCTTT TCCCTGATGA AACAGCTCGG GGACGGTTCC ATGCCGCTCT GTCCCAATAT GGGCATGGGT ATGGTGGATG TTCGAGATGT GGCGGCGGCG CACATTGCGG CCGCCTATCT CCCCGAAGCC TCCGGCCGTC ACGTTTTGTC CGGACACAAT AGCAGCCTCT TGACCATGGC GCGACTTTTG AGTCCGAAAT TCCCGGATTA CCCCGTTCCG ACCAGGGCCG TGCCAAAACC GCTGCTCTGG TTGCTGGCGC CGTATTTACC GGGAGGCATG TCGCGTCGGT ACGTTTGGAA CAATATCAAT GTGGAAGCTA GCTTCGATCA TACCAAGTCG GTGAGCCAAC TAGGTATTCA ATATCGTCCC TTGGACGAAA CAGGTGCGGA CATGTTCCAA CAGCTTGTCG ATCTTGGCGT CCTCACGAAG AAGTAGCGGA GGAATAGCTG GTCGTACGCG ATCCATTCTT TGCGGAAAAG ATTTGAAATT TTAGTGTTTG ATGGAGAGCC GTAAAAAAGA CCGTAATTTT CTAGATGTAG GTTTTACTAC CTGGTT
|
Protein sequence | MPCTKPLNID TSQPVLVTGA TGYVAGVLIQ QLLEAGVTVH ATVRDPSQTE RVQYLQDLAD QSGSGTIRFF RGDLLQEDSF DEGMKGCGIV FHTASPFQLD YKDAYHDLVQ PAVRGTQIVF HTASRTPSVK RVVLTSSCVA IYTDISECDA VNNKSLNEET WNRTASLDYQ PYSLSKTLAE QKAWEIAGSQ TAWKLVTINP SLVFGPGVKY HESSTSFSLM KQLGDGSMPL CPNMGMGMVD VRDVAAAHIA AAYLPEASGR HVLSGHNSSL LTMARLLSPK FPDYPVPTRA VPKPLLWLLA PYLPGGMSRR YVWNNINVEA SFDHTKSVSQ LGIQYRPLDE TGADMFQQLV DLGVLTKK
|
| |