Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48343 |
Symbol | |
ID | 7203804 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | + |
Start bp | 168403 |
End bp | 170080 |
Gene Length | 1678 bp |
Protein Length | 528 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182786 |
Protein GI | 219125017 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.577972 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAAACA GAAAACGCCC CTTGCCGTGG TCTGAGGCTA CCCAAACTGC CTTGAGTCCA TGGTGCCGCT CGAACATAGT CGGCAGCGCG ACGACTTCGG CTTTTTCCGT TGGACTAGCG GTGGGCGTCG CGCTCGGATT GGCGCTGGGT CAAGTATTAT TACAAGGAAA TCTTTCCATT TCTCAATGCT CTTCCTATCC CGTAGAAAGC TCAGCCAGCT CTTCGTTTGC TACGAACATT CCACCGTCGA TGGAACGGAT CGTCGCCGGA ATGTCGAGAA TGAGTAGAGA CAGTTTCTTG GGAACGTTCG ATGTCGGTAT GGGTAATTAC GGTTCGACAG ACGGAAACAG CCAGGTCTTG CTCTTGCATA CCTCCCAAGC ATCCCTCCCC CGTCAAACGC TAAACGGAAA AGTCGCCCCT CTGCTTTCTG TTAAAGACGC CACGGCAAAG TGCAGTATCG TCAAAGTAAT CATGACAAAC GCTAGTCCGG GAAAGGAATC TCTCGGAGAT TGCGTCGCTA TTGTTGGAAG TTGGGAGTCC AATCATGTAC ACAAGTTTGC CCCCCATAAA GGTACGAATC ACAACACGGA TGGGAGCACC CAACCGCGTA CTTGGCAGTA CACCAGCGGC CGCCGGTTCC CACCGCCTTC CGTAAACACC ACTCGCACCA GTCTGTGGGA ACTGCAAACA TACCTGGATG CCTATCCGGC TGCCGTCGAA CGGTTAAGGC CGTTGGCACA GGCAGCGGCA CTCGGGCGGG ACGGACGTGT CGCACCCTTG CTCGTTATGG TAACCAATTT CGGACAAGCA CAGCTGCTGG TTAACTTTGT GTGTTCCGCC AGGGCACGCG GGTTGGATAT TTCACGGCTG CTACTCTTTG CGACCGACCG AGAAACCTCA AAACTAGCCG AGAGTCTAGG TATTCCGGTC TTTCTGGACG AAGCAGTACG TGGATTGAGC GGATCAGCTA GTTTGGCTGC TTGACAAAAG CACGTGTTTC TCATTTCTTT TTACATCATT CTTTATCTCA ACGTAGATTT TCGGAGCGAT TCCTTCCGGC GCAGCCAAAG GCTACGAAGA TGCCAACTAT GGCAGAATCA TGATGTGCAA AGTGTATGTT GCGCACCTAA TCAGCGTACT TGGCTACGAT TTCCTCTTTC AAGATGTTGA TATTGTTTGG TACCGGAATC CCCCTCTAGA CAAATTTCGG AACAGCAATT ACGATATGAT TTTTCAGCAC GACGGACATT ATTTGCAGGA GCGCTTCCAG CCGATGATGG CCAACTCTGG CTTTTATTTT GTGCGTGCGA ATGCCCGCAC CAAATACTTC TTTGCGCTCT TCATTCGCAT GGGAGATTTA GTGCTCCAAC AGCAATCACA CCAGGCCGCG TTGAGCACGT TGCTGAACGA ACAGATGAGC TTACGAGGCC TACGTGTCAA GGTATTGTTG GAGGATGAAC TGCTGTATTT GTCGGGCTAT CACATGGAAA AAGAACCGGA GAAGTGGTCC CGGGCATTGC AAGCGTCTCC CAAACCGTAT TTGCTTCACG CCAATTGGTT GGATGGCAAT CAGAAGCGGC CAATGTTGAA TGAAACACGG AACTGGTTCC TTACTACCAT ATGCTCCGAT CGATTGTCTC AAGGTGCCAC GGACGCTTCC GAGTGTTGCA CACCATGA
|
Protein sequence | MRNRKRPLPW SEATQTALSP WCRSNIVGSA TTSAFSVGLA VGVALGLALG QVLLQGNLSI SQCSSYPVES SASSSFATNI PPSMERIVAG MSRMSRDSFL GTFDVGMGNY GSTDGNSQVL LLHTSQASLP RQTLNGKVAP LLSVKDATAK CSIVKVIMTN ASPGKESLGD CVAIVGSWES NHVHKFAPHK GTNHNTDGST QPRTWQYTSG RRFPPPSVNT TRTSLWELQT YLDAYPAAVE RLRPLAQAAA LGRDGRVAPL LVMVTNFGQA QLLVNFVCSA RARGLDISRL LLFATDRETS KLAESLGIPV FLDEAIFGAI PSGAAKGYED ANYGRIMMCK VYVAHLISVL GYDFLFQDVD IVWYRNPPLD KFRNSNYDMI FQHDGHYLQE RFQPMMANSG FYFVRANART KYFFALFIRM GDLVLQQQSH QAALSTLLNE QMSLRGLRVK VLLEDELLYL SGYHMEKEPE KWSRALQASP KPYLLHANWL DGNQKRPMLN ETRNWFLTTI CSDRLSQGAT DASECCTP
|
| |