Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49978 |
Symbol | |
ID | 7198561 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | + |
Start bp | 486539 |
End bp | 488639 |
Gene Length | 2101 bp |
Protein Length | 526 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184715 |
Protein GI | 219129058 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAACATTTGA CTGGATTTAC ATCTCAAATT ATAATTCGGA TTTCTTCATC TTCCCGAACC GTTTGCCGAG AGGAAGATTG GTGTTCGCAG CTGCTCAGAA AGTCAAGCAG CTGATTGCTC CGTCTTGTGT CCGGAGTGAT TGACTGCGAG TCTTAATCCG CTATACACGT TCAGGCAAGA GCTCTTGAGC GCAATTTCTC GGTCCCGTTA CATTGTGTTT TCAGCATGAA CGATCTACGC TGGCTGCAAG ACACGGCAGG AGAGGCAAAT CAGCAAGAGA ACGAGAGCAC TGACACTTCT GCTGAGTCGA GCAAAAGCAA TCAACTCGTC GGAGTCGTGT TATCAAATGT TTTTCTGTTC TTTCTAATTT TTGGACTGTC CGCGACTGTC GATGTCAAGA ATATGAAGCG ACAACTCACC AATAGATTCG CTATTGGCTG CGGTGTTGCA ATGCAGTTTA TTGTCATGCC ACTGCTAGGG TTTGTCGCTG TCGTTTCCCT TCGAAACCAA GGTCTTTCCG AAGCTATGGG AGTTGCGTTG TTGGTTGTTA CATCATCTCC TGGAGGATCC TACAGCAACT GGTGGTGTTC GACTTTTAAT GCGGATCTGG CTTTGAGCGT TGCCATGACA ACTGTTTCAA GCATACTCAG TATATGCTTG TTGGTACGTT CTTTTCCGAC TGGTTTTGTG GGGACCACGT AATTGCTACT CTGTTGTAAA AATTACAAGA TCCACCTTCT TACCCTAAAA CTGGTCTTCT AATGTTCCAT TTTTAGCCCC TCAATCTCTT TCTATACACC TATCTGGCCT TTGGTATCAC GGATAAGGAC CAAGAGTCTG TAGTTGAAGC TTTGGATTTT GGAACTCTCT TCATAACACT TGGAATTGTA CTCGGCGCCA TTCTATCTGG TCTCGCAGCT GGCTATCGTT GGGACAACGC CACCTTTCAC GTCTATGCCA ATCGGTTTGG TACTATTTCG GGCATGTTAC TCATCTTGTT CTCGGTTTTC TTTTCTTCGG GGGCCGATGG GGCTGAGTCT AATTTTTGGA GTCAACCTTG GGCCTTTTAT TGGGAGTTGC CTTTCCTTGT TTGCTCGGTA TCGCCCTCGC CAACATTATT GCTCGATCCG TGCGTTTAAG TCCACCGGAG ACGGTTGCCA TTTCGATCGA ATGCTGCTAT CAAAATACAG GCATTGCAAC ATCAGTGGCT ATAACAATGT TTGATAATGT CGAAGAGAGG GCTCAGGCAG TTGCAGTCCC GTTATTTTAC GGTATTATCG AAGCTGTGGT AATCGGCATT TACTGCATTT GGGCCTGGAA AGTTGGCTGG ACGAAGGCCC CCAAGGATGA GAATTTATGC CTTGTCATTG CGAGGACGTA CGAAATCGAT GAAATTGCTG CAAATGACGA ACCTCAGGAT GAAGAATTCA ACGGCAAAGA ACAACTTGTA GAAGCCTCTA GCCCCATGGG TGCTCAAAAG GATAGTTGTT GCATGGAAGG ATCAGGAGAA AGGGTGATCG AGCTGCATCA AGAGAACACT GAGAGGGACC GCCTTTACAA TGAAGGTTCA GGCTTTTGGG CGAGAATATT TCCACCTATT CTGCTTCGAA AGCTCTCGTC TCTGTTGATG AATGGAGTTG AAGTTGACGA GAACTTGGAA GAGCAAGGAG ACGGCGTTGA TATCAAAGTC GAAGGAAATT TGTTGGCTCG AAGTCGATTA GGAACAGCTG AAACATCACT CTCATCGTCA GTATCCAGTC CGCCACATCG CTCTAGAACG GGCTCAAGCA TGTCCATTGA GCAAGGTTGC CTAAACACGG ACGACCCCTC AATACCATGT TTAACCGTTC CCGAAGACTC TGAACACGCC TTACCCGACC TACTACCCAT GATCTCCTCG TCAACTGAGC ACTATCTGGC TTCAGAAGAC CTTCCTTATG CAAACAGAGC CGCTAAAGAG GAATAGATGT TGTCGCGATA GCAACTTGTC ACCGAAACGG ACTCCCGGAC AAAACTACCT GAAGGTAGAC CATACTACCT AATGATACTA GAAACCATTC TAAGCAGTGA CCATAAAAAT AAAGCATCAA CAAACACGAT C
|
Protein sequence | MNDLRWLQDT AGEANQQENE STDTSAESSK SNQLVGVVLS NVFLFFLIFG LSATVDVKNM KRQLTNRFAI GCGVAMQFIV MPLLGFVAVV SLRNQGLSEA MGVALLVVTS SPGGSYSNWW CSTFNADLAL SVAMTTVSSI LSICLLPLNL FLYTYLAFGI TDKDQESVVE ALDFGTLFIT LGIVLGAILS GLAAGYRWDN ATFHVYANRF GTISGMLLIL FSSTLGLLLG VAFPCLLGIA LANIIARSVR LSPPETVAIS IECCYQNTGI ATSVAITMFD NVEERAQAVA VPLFYGIIEA VVIGIYCIWA WKVGWTKAPK DENLCLVIAR TYEIDEIAAN DEPQDEEFNG KEQLVEASSP MGAQKDSCCM EGSGERVIEL HQENTERDRL YNEGSGFWAR IFPPILLRKL SSLLMNGVEV DENLEEQGDG VDIKVEGNLL ARSRLGTAET SLSSSVSSPP HRSRTGSSMS IEQGCLNTDD PSIPCLTVPE DSEHALPDLL PMISSSTEHY LASEDLPYAN RAAKEE
|
| |