Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37131 |
Symbol | |
ID | 7202117 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 231085 |
End bp | 232951 |
Gene Length | 1867 bp |
Protein Length | 577 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181332 |
Protein GI | 219121977 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTCCCG GCATTATCCT TCTGCGGGAA GGAACCGATA CATCCCAGGT AAGAAGTTTG TCGGTTTCGC TGTCGGATTC ACAATACGTC ATATCACAGC ACACCATCCC GTACTCGTGG ACTTTTCAAC CTACTCACAT TTCCTCTGCA TTTACATTGC TTGAATATAT ATATCTAACA GGGAACACCA CAGCTCATTT CGAACATTAA TGCCTGCCAG GCAGTGGCGG ATACGGTGCG GACGACGTTG GGCCCTTCGG GGCGGGACAA GCTCATCGCG ACGGGTCGTC ACGTGACCAT CAGCAACGAC GGCGCCACGA TTATGAAACT CCTCGAGATT GAACATCCCG CCGCCAAGAC ACTCGTGGAC ATTTCCATGA GTCAGGACGC CGAAGTCGGC GACGGTACCA CCAGCGTCGT CCTCCTCGCC GCCGAAATAC TCGCCAAAAT GAAACCCTTC GTCGAAGAAG GCGTCCATCC GCAAATCCTC CAACGCAATA TACGCAACGC GGGGAAAATG GCCGTGGAGA AAGTACAGGA ACTCGCCGTA CCGTTTCAGG GAGACGAGCT GGAGGATATG CTGCTCAAAA CGGCACGTAC CGCTCTCAAT TCCAAGCTAA TTGCCAACCA TAAAGATCTC TTTGCACCAA TGGTAGTGGA AGCGGTACAA GCATTGCATC AAGGAGGCTC TCTGGACGAT CTATCAAGCT TGGTCGCTAT CAAACAAATA TCCGGTGGGG ATGTCCGGCA ATCCTTTCTC GTCAACGGCG TGGCTTTCAA AAAAACCTTT TCCTACGCTG GTTTTGAACA AATGACCAAA CAATTCACTA ATCCAGGGAT TCTATTGCTC AACGTCGAAT TGGAACTCAA GTCCGAAAAA GAAAACGCCG AGGTACGCAT CACAGACCCA TCCCAGTATC AGAGCATCGT CGACGCCGAA TGGAAGGTAA TCTACGACAA GTTGGACGCC TGTGTAGATT CCGGTGCCCA GATCGTCCTC AGTAAATTGC CCATTGGTGA TTTAGCGACG CAATACTTTG CCGATCGGGG ACTCTTTTGT GCGGGCCGTG TGACAGATGG AGACCTGAAG CGTGTGGCCA AGGCCACGGG TGGTAGCGTC CAAACCAGCA CTCACGGTAT CACCAAGGAC ATGTTGGGGA CGTGTGGGGT CTTTGAGGAG CGCCAGGTTG GTGACGAGCG CTTCAACGTC TTTACAGACT GCCCCCAAAA GCTAACGTCC ACCATTGTTT TGCGCGGAGG AACCGAACAA TTCATTGCCG AGTCCGAACG GAGTGTGCAC GACGCCTTGA TGGTCGTCAA GCGATCGCTC CAGTCGGGAT CGGTCGTAGC CGGTGGCGGT GCCGTCGAAA TGGAAGTCTC GCGTTGTCTG CGCGAGCACG CTCTGACCAT TGAAGGAAAA GGACAGCTTA TCATTACAGC CTACGCCAAG GCGTTGGAAG TCATTCCTCG TCAATTGTGC GAGAATGCGG GGTACGACTC AACCGATATT CTGGCTGCAT TGCGAAGAAA ACATGCCGTC GACGCGGACG GAAAGTGGTA CGGAGTCGAT GTCATTAACG GTCATATTTG CGATACTTTT GATTTGGGCG TATGGGAACC GAGCGACAAC AAGGTAAATT CGTTCGACGC TGCCACGGAA GCAGCGTGTG TGATTCTGTC CATTGACGAA ACTGTCATGG CGCCCAAGTC ACAGGACCCC AACGCTCACC ATACGGGTCA AATGGACCAG GGTAATAAAC CAATGAGTAA TATGATGGGA GGCGCCATGC AGGCCGCCCA AGGAGGCGCT CGGTCGGGTC AACTTGGGCC CGGAGTCAGC TACATGAAAG GCCGGGGAGG CGGTTGA
|
Protein sequence | MRPGIILLRE GTDTSQGTPQ LISNINACQA VADTVRTTLG PSGRDKLIAT GRHVTISNDG ATIMKLLEIE HPAAKTLVDI SMSQDAEVGD GTTSVVLLAA EILAKMKPFV EEGVHPQILQ RNIRNAGKMA VEKVQELAVP FQGDELEDML LKTARTALNS KLIANHKDLF APMVVEAVQA LHQGGSLDDL SSLVAIKQIS GGDVRQSFLV NGVAFKKTFS YAGFEQMTKQ FTNPGILLLN VELELKSEKE NAEVRITDPS QYQSIVDAEW KVIYDKLDAC VDSGAQIVLS KLPIGDLATQ YFADRGLFCA GRVTDGDLKR VAKATGGSVQ TSTHGITKDM LGTCGVFEER QVGDERFNVF TDCPQKLTST IVLRGGTEQF IAESERSVHD ALMVVKRSLQ SGSVVAGGGA VEMEVSRCLR EHALTIEGKG QLIITAYAKA LEVIPRQLCE NAGYDSTDIL AALRRKHAVD ADGKWYGVDV INGHICDTFD LGVWEPSDNK VNSFDAATEA ACVILSIDET VMAPKSQDPN AHHTGQMDQG NKPMSNMMGG AMQAAQGGAR SGQLGPGVSY MKGRGGG
|
| |