Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40994 |
Symbol | |
ID | 7198829 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | - |
Start bp | 87122 |
End bp | 88665 |
Gene Length | 1544 bp |
Protein Length | 463 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185045 |
Protein GI | 219129752 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.236316 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAGAG TAGCGAATCA AGGAGGCTCG CCTGACGGTA AGTGCGGCGA GAGAGGCCAA CGCCACGATA TAGCATCGTT TTTGAGGCAA AAAAACGCCG ACTAGTACTT AAACAATCAA TTCTTTACGA AGCGCGCGTT GGGATCAAGC GACCAAGAAC TCAAGGGAAA AATACAAATG ACCGATTTGT CGATTCACTC AGAGCGGCCA GCAGTGTTAA ATTGCTCGAA AGCCCAAAGA CTGAAGCGAC TCAAATCCTA AATGAGAAAG CCTCTTTTGC AACAGCAGAA AGTGATGCAA TTCACAATGA CGGGACACGC GAAAACAATG CTCCCTCCAA ACTCTTCAAA GATGGGAAAG TGAGTATCGT ACAATACAAG ACATCGAAGC AAAACCTTGA ATCATCAACT TTGGGGAAGC ATCTCGACGT AGCCTATCGG ACCTTTCCCG AGAAACTCAT GGAGTTGCTT GAGACCGAAG CCTTGAAGGA CGTGATGTGG TGGCTAACAG ATGGAGAAGG GTTCGCACTA GCACCGCAGC TGTTTTCCAG CAAAGTCTTG TCTCCATACT TTTATGGAAC AAAGTTCGAA AGTTTCACCA GGAAGCTGAA TCGATGGTAC GTATCGACCC CGACGTCAGA CGATGCACTC CTTACCTTTT CAAACCTAAA AGTTTGTCTC TCTCATTGGC CCTTTCAAAG GGGGTTCAAG CGCGTTGCAG GACAGAGAGT GCCACCAAAT GCTATTGCAT ACCACCATGT ACTATTCAAA AAGGGTAGAT TAGATGAGCT GAAGGGTATA AGGAGTGGAC GGAAAAGCCA AGCACCTGCC ATAACACGAC ATGGAAATTC AGTATATGCC GAAATGCGCC ACAATGCACA GTTCGCAGGT GTTTTGCCGC CGGTAGCTAC GATAGAAGCC GGATTACCTG TAAGGCTCCA ATTGTCATCA AATTTTTTCG GCGTTCCATG CAGTAATTTC CTGCCTCACT CTAGTGGAGC GACTTTTTCA GAGCGACAGG TGATGCGCCT TTTCAGGCTT CAACACATTC AAGAGGAAAA CACGTCCAGT TCAGATGTTC AATTGCAACG GTCACTTGAT TTGAGATCGT TCAGTAGCAG CACAGCTATA CAGCAGATGC TGGAGTCGCA GCAAAGTGGC CGTCTAGCGA TGGAGAGACA GCTGCAAATA TTCCAGGCCA ATGAAGCTGG TAGACAGCAT GCCGCCCGCC AGGCGTTGTT CCAACGTCAC GCGGCTGAGC AGGATCATTT GCAGCATTAT CACCATCAAA CACAGATGCA ACTACAGAAT TTAGTGAATA TATCATCTGA TGCTAGATCA CTTCTCTTGG CTAGCCTCGG TTACTCCGCT AGCCAGCACC AAAACATTGC TCCAGATATG AGGACGTTAC AAGGCGTTGG ACGGGATCAG ACATCCTTCC CTCTAGCTTC ACTGGAAAGT CTGCGAGAAA CTGATCCTGT GCTCTACCAA ATGATTGTAC TCAAAGAGCA GGAGCAGCTC AACCAGAGGC TGCGACGTCC TTGA
|
Protein sequence | MDRVANQGGS PDVLKQSILY EARVGIKRPR TQGKNTNDRF VDSLRAASSV KLLESPKTEA TQILNEKASF ATAESDAIHN DGTRENNAPS KLFKDGKVSI VQYKTSKQNL ESSTLGKHLD VAYRTFPEKL MELLETEALK DVMWWLTDGE GFALAPQLFS SKVLSPYFYG TKFESFTRKL NRWGFKRVAG QRVPPNAIAY HHVLFKKGRL DELKGIRSGR KSQAPAITRH GNSVYAEMRH NAQFAGVLPP VATIEAGLPV RLQLSSNFFG VPCSNFLPHS SGATFSERQV MRLFRLQHIQ EENTSSSDVQ LQRSLDLRSF SSSTAIQQML ESQQSGRLAM ERQLQIFQAN EAGRQHAARQ ALFQRHAAEQ DHLQHYHHQT QMQLQNLVNI SSDARSLLLA SLGYSASQHQ NIAPDMRTLQ GVGRDQTSFP LASLESLRET DPVLYQMIVL KEQEQLNQRL RRP
|
| |