Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40744 |
Symbol | |
ID | 7198616 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | - |
Start bp | 282788 |
End bp | 284616 |
Gene Length | 1829 bp |
Protein Length | 583 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184770 |
Protein GI | 219129173 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGCGTA CTCGATATAC ATTGATAGCA TGGGCTGTCT CAACTCTGCG AGTCTCGCAG TCCTTGGCTC CACCGGACCC CGGTTTTGTG GGTGGAGGTC GAAGTTGGCA AGACGTCCAC GAGTTTCGAG CCATGCACAA TATTACGTTC AAATACGAAC CGTTACATCT ACAAAACGAA CATTGTCGCT ATTTGACTGA AGCCGAATGC CAACACGACG ATGAAGCTTA CTGGCAATCA AAATTCAGAC CCCAATCCTC TAAGGAACGA CGACTGAACC CTTCAATTGG TACTTTCCGC GCTTTGGTTA TTTTAGTACG ATTCACAGAC CACGCCAGCC GACAACTTCC CAGCCCTGCT TATTTCGATG AACTCTTTAA CGGAGCTAAG GGCTCGGTCA ATGAAGTGGG ATCTGTACGA GAGTACATGC GATTCAATAG CATGGGAAAG TATAATGTTC AGTTCGATGT ACTGGACTGG GAAAATGCGG AGAATACCGA ATCGTTCTAT GCCGAAGGGA AATCAGGCCG AGTAGGAAAT GTTCGCATTC AAGATATGTA CGGTTCGGTG TTGGACAAGT TGGACAGGGC AGGAAAAATC AATTGGTTTG ACTATGATAT TGGCGGTGAC CCTGAAAATC CTGAGTGGGG CGACGGACTG CTTGATCATA TAGTTGTTGT ACACTCGGGT TACGGAGCAG AGCAGTAAGT ATAGAGAATG GTGCGGCTTG ACACCTCAAT TCCCAACAAT TAGGCTCACG CAGCCTTCTT TACTCCATTT GTTTTGTAAG CGGTGACAAA CAGTGCCTTC CAGGTAGCTA CCTCGATCGC ATATGGTCAC AAGGATCGGC AAGTAGTAAC GGAGGATGGC GATCATTGGA TGGTAATTTG GAAATTGGTG GGCATACGAT CGCGTCAGCT TTTGCGAATC CGCGCTGCGA CAGGAACAAT GACTTTCAGC TTCTTATCGA ACCGAACACA ATGGGAGTCT TCACCCATGA ATATATGCAC GGCTTTCGAA TGATCGACCT ATACGACAAC GACGGAGACA GCGCGCCAGT TAGGCTTGGA GGGGTGGGGC ATTTCGACAT CATGTGCAAT GCCTATGGAT GGTTTCGCTC TGGCACAATA CCGGGCTATG CCAGCCCGTA CAGCAAAATG ATCGCTCAAT GGCTCTCACC TATCGAAATC ACTATGGATG GAGTGTACGC TGTCCAACCA GCAGAAATTT CCAGTCAAAT TTACATGATC AGTACACCAT ATCCAGCGGG CGAGTATTTG CTAATTGAAA ACCGGCAGCC GCTGAAATGG GATAAAGACT GGCCTGGACG AGGCATCGTA ATATATCATA TAGACGAGTT GGCTCCGCGA CAAACTGCGC GAGGATACCC AGGAGGACCT GGATGGCCAA CCGATCACTA TCAAGTCGCG GTTGTCCAAG CGGACGGCAA CTTTGACCTC GAAAAAGGTG AAAATGAAGG AGACGAGGGC GATTTCTGGA CGCGCGGCAT GACATTAGGA GCCGACACGA ACTCGCAGCC AAATACGGCA GCATACCAGA GCGGAAATCT CCGGTCAACT GGGATCTCTA TCACTATCTT GTCCGATCCT GGTTTCATCA TGAACTTTCA AGTCGAAGGG TTGGGCGGAA TGCGAGCTCC TGGGCAATTC TGGGACGACG ATGAGTCACC ACTGGCGAAC AGCGCCCCCG ATTCTATTCT ACCCGTGTCG ACCGATCCTG GCGGTGGGAC GGGCAAAACG CTGGCCTGGA TTTTCTCAAT GATTGGGGGA CTATCGCTGG TTGTCGGTCT TATCGCAATA CTACTATAG
|
Protein sequence | MVRTRYTLIA WAVSTLRVSQ SLAPPDPGFV GGGRSWQDVH EFRAMHNITF KYEPLHLQNE HCRYLTEAEC QHDDEAYWQS KFRPQSSKER RLNPSIGTFR ALVILVRFTD HASRQLPSPA YFDELFNGAK GSVNEVGSVR EYMRFNSMGK YNVQFDVLDW ENAENTESFY AEGKSGRVGN VRIQDMYGSV LDKLDRAGKI NWFDYDIGGD PENPEWGDGL LDHIVVVHSG YGAEQLTQPS LLHLFCSYLD RIWSQGSASS NGGWRSLDGN LEIGGHTIAS AFANPRCDRN NDFQLLIEPN TMGVFTHEYM HGFRMIDLYD NDGDSAPVRL GGVGHFDIMC NAYGWFRSGT IPGYASPYSK MIAQWLSPIE ITMDGVYAVQ PAEISSQIYM ISTPYPAGEY LLIENRQPLK WDKDWPGRGI VIYHIDELAP RQTARGYPGG PGWPTDHYQV AVVQADGNFD LEKGENEGDE GDFWTRGMTL GADTNSQPNT AAYQSGNLRS TGISITILSD PGFIMNFQVE GLGGMRAPGQ FWDDDESPLA NSAPDSILPV STDPGGGTGK TLAWIFSMIG GLSLVVGLIA ILL
|
| |