Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44967 |
Symbol | |
ID | 7199495 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | + |
Start bp | 836184 |
End bp | 838030 |
Gene Length | 1847 bp |
Protein Length | 512 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178855 |
Protein GI | 219116120 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATTCTGCAAC GAGGCTTGTC GCTTCCTTGC ACAATCAGCG CAACAGCATA TCTCTAATTA TTTTGACGTG AAGACAACAA AAAGATGAAG GTGGAAACGG AAGAACATCA CGTGGAGCAG GAGAAAGGAG AGGAGACACC CGCGTTTTCG ACTTCCAAAA AAGTTCGATT TGATGCCGAG GTAAAACCTG GACTTTCGTC ATCAACATCG AGCAACTCGA AGCCAATCAC CCCGCAACGA CAAGTCGATT CGCGTCGTAT CAAATTTGGT TCAGGGAAAG GCCATCCGTG TCGAGACCTA TTGACGTTAC TCAGTATTGT AGGTATTTTG CTGGTCGTCA CGATGGATTT CTCATCGGAC ACGCCCGAAT TTGACCCGGA AGAGCGCACT CCGAAGGAAA AGGCGGCATC CCGACTAAAT AAAGTCAAGG AATCTTTAGA ACGGAGTTTG CAGGACCACA TTGAAGCTAC TAGACGAATG GCAAGCTGCG ATATTTTTAT CTCGTCTAGC TCTATACCAG GGACCGGAAA CGGCCTATTT GCTGGGCGAG CATATCAGGA AGGCGAGACT GTTCTATTGG ATACGAAGGC ATTCGGACAC CCGATCTTGG GGCTACCGTC GAATGCGGCA CTATCTCAAT ACGCTTTTCT TGTCAAGCAT CATCCAACTC TAGTGAATTT AGAAGGTTCT TTGTTCCATG TGGACTTCGA CCATTTCAAA GCCGGTGGAC AAACGTTCCA ATTCAAGGCA ACACAGCAAA TTAAAGCCGG CGAGGAGCTC TTTGTATCCT TTGAAAATCA TCCACAAAGC TCAAGTTTAT TTCCTTCGAA GGAGACACTC CCGTTTTTGC CTTACATCCC AACTTATAAC GATTACAACA CAATCGACTC GATCCTCGAC GATGTTAAAT CGACGGCCCG TCGCATGAGC ATTTCGCACA GTCGGCATCG TCGGAACATG CTGATCGACA GCAGCCACAT TCTCAAACTA GCCCGCGGAA TCGTTGGCCG CTTACACGAA TCGTTGGCCG CTCTCATCCC GTCGACTGGA GACGCTTGGG GAAAGCGTCC AGACAATACT CCTTCCGTCT GGACTGCTCT GGAAAATCGG ACTCTGGTGT CGCTGCAACT GAGTGGATTC TGCTTGACTG ATATGAATCA AGAAAAAGAT GGTACCCTAT TCGTCACTAG AAACGTCTCG AAAGGAGAAA TTGTTACCGT AGCACCTCTG CACGTTGCCA CTCATGCAGC ACTGAAAGCA GCAACGGAAA AGTCTGCCGC GGAAGAGGCT TGGTGCCAAA CTTCCGCAAG TGACCGTTGT TTTGGAAAAG CTGGCTCTTC ACTAGTTTTA TGTCCCATCA CGAATGCAGC TTTCATATCG ATCAGCGTAG ACGTTGAGAA CGTTGAGTTC CAGTGGAGCA TGAATCAAGT GCCTTCCCAG TCCGCTGATC AGGCCATATC AAGTCCTGCA GGTACCCTTT CCTGGAATAT ATTGGCGCTA CGAGATCTAC ATGCAGGTGA AAAGGTAAGA AAGATAGATG TCGTTTCAAG CGAATCTCTT CAGAGTATGC AAAGCCCTCA CGTCTTTGCT CTCTAGTTGG TCGTGAAAGG CTCCAAACCA AGTGGCTTCG ATTTTCGAGT TTTGGAAAAG ACGGATTTAT TCCCATGGTA TACCGCAGAA ACCGCAAATG CGTAGATTAT CCGAAAATTA CAATTTGCTT CCTCCGTTTG TGTATATTGC TCATTTGAGA AACGATTTTC TGTTGTTGAG CTACTAGAGT CGGTTGACTC GGGAGCCAGC GATCGTTAGC AATTTTCACT TTTTAAACAC CATGATTCTC AAAGAGG
|
Protein sequence | MKVETEEHHV EQEKGEETPA FSTSKKVRFD AEVKPGLSSS TSSNSKPITP QRQVDSRRIK FGSGKGHPCR DLLTLLSIVG ILLVVTMDFS SDTPEFDPEE RTPKEKAASR LNKVKESLER SLQDHIEATR RMASCDIFIS SSSIPGTGNG LFAGRAYQEG ETVLLDTKAF GHPILGLPSN AALSQYAFLV KHHPTLVNLE GSLFHVDFDH FKAGGQTFQF KATQQIKAGE ELFVSFENHP QSSSLFPSKE TLPFLPYIPT YNDYNTIDSI LDDVKSTARR MSISHSRHRR NMLIDSSHIL KLARGIVGRL HESLAALIPS TGDAWGKRPD NTPSVWTALE NRTLVSLQLS GFCLTDMNQE KDGTLFVTRN VSKGEIVTVA PLHVATHAAL KAATEKSAAE EAWCQTSASD RCFGKAGSSL VLCPITNAAF ISISVDVENV EFQWSMNQVP SQSADQAISS PAGTLSWNIL ALRDLHAGEK LVVKGSKPSG FDFRVLEKTD LFPWYTAETA NA
|
| |