Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_50770 |
Symbol | |
ID | 7203857 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 903287 |
End bp | 905335 |
Gene Length | 2049 bp |
Protein Length | 549 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186151 |
Protein GI | 219113135 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGTAG AATATAGAAT CGCGAACGGG ATTTGTATGC GATTTCGTTT CGAACGTTCC ATACCATTGT CTGGCATGGA CCCGCTGCCA GACAGTATGA GGTGTCTCGT ACTCCTCATG ACTGTGAGAA CACCTATCAC ACGCGCTTCT TGTCTGATTC ACTGTCAAGC CAAGACGATC TTTCGACCAT GAAGATTTCG ACGACGTTCC TCTGATGCAA ATCAGCTCAC TTGCTGTTAC ACCACAGTCT TTTTACTGTC ATTTCACCCT TTCTCCTGCC GTCATTGTTA CAGCATGGCG GAGACACACG AAGTGAAAGT CCAAGTATTT GACAATGTCT TAAAAGAAGA TTACTTGAAG AACTTTACGC CCATCAAGAA CAAAAAGAAG AAAAAGAAAA ACAAGGAAAA TCCCAACAAA CTTGTGGGGG CCATCGACCA AGGCACTTCT TCTACTCGGT TTGTTGTATT TACCACAAAA GGCCGGATTG CGGCATCTTC GCAAATGGAA CACACGCAGA TATTTCCGGA ATCAAACGCC GGATGGCACG AACACGACCC GGTCGAGATT TTCCGCAACA CGGCAACCTG CATTCAAGCA ACGCTGCAGG CGCTCGAACG GAAATCCTAC GCGGTATTTC TTTCCGCTGT AGGCATTACC AATCAGCGGG AAACGACTAT TGCGTGGAAT CGAGTCACCG GGGTACCTTA CTACAACGCC ATTGTCTGGG ACGATACAAG GACCACGGGT ATTGCCAAAG CGATCGCTCA GGGTAACTCG AATCGCCTGC GAGATCAGAC AGGCTTACCG TTGGCAAGTT ACTTTGCAGG TACAAAAGTC AAGTGGTTAT TGGACAACGT CGAAGCCTTA AGAAAGGACT TGGAAGACGA TGCGAAGTCT AGCGAAGTTT GTTTCGGAAC TATCGATTCC TGGTTGGTTT ATCAACTTAC TGGCACCAAA TCTAATCACG ACGGTGCAAT TAACTCGGGA GGAGTCTTTG TGACGGACGC TACCAATGCA TCCCGTTGGT TGTTTATGGA CTTGGCAGCC CGTGTGTGGG ATCAGACTTT GGTTAACGCC GTTTGTGCGC CGCATAGAGT TCCACTTTCG GCATTGCCGG AAATCTGCCC CAGTAGTCAC GTTTACGCAA CCTGTAAAGG AATCGAAGTA GGTGTTCCCG GTCTCGACAA AGTTCCCTTG GCAGCCATCT TGGGAGATCA ACAGGCCGCT CTCTTTGGGC AGACTGCCTT TGCTCCAGGA GAAGCCAAGA ATACCTATGG TACCGGTTTG TTCCTCATGA TGAATACCGG GACTAAAATC GTTCCGTCTA AGCATGGTCT CTTGACCACT GTCGCTTACC AAATTGGGCA AAATGCCCCC GTTCAATACG CTCTTGAAGG CAGCGTCTCC CATTCGGGTA GCACTATTCA ATGGCTCCGC GATCAGCTTC AAATCATAAA AGATGCGCCA GAATCTGAAA CCATGGCGAG AACGTGTGAC TCCAACCAAG GCTTGTATTT TGTACCGGCA TTTTCGGGAC TCTTCGCCCC GCATTGGCGT TCAGACGCTC GCGCGTGCAT TGTGGGAATG ACGGCTTCCC ACCACAGGGG GCACGTGTGT CGGGCCGCCC TCGAAGCTGC AGCGTACCAA ACTCACGAAG TCTTCGCCGC GATCGAAGCA GATTCTAACG TGACGTTGCG GACACTCAAT GTGGATGGGG GCGGAACACA CAATCAATTA TTGATGCAGT TCCAAGCGGA TATTATTGGT GTACCAGTGG TCAAACCTGC CGTTATGGAA ACCACATCAA TGGGTGCTGC GTTTGCCGCC GGTTTGGCAG TGGGAATCTG GCAGGATCAA CAGGAAATCA AAGAATTATG GACTGCAGCT CAAACATTCA ATCCAAAAAT GGATGTTGAA GAACGAGACT CTTCCCTCGC GGGCTGGAGA AAGGCAGTTT CCAAAAGCCT CGATTGGGTA GGTGAAGACC AGGACGCAAA GAAAGAAGAT ATCTGGTTGC TATGTCCGAT GAATCGGAAC ATATTTTAA
|
Protein sequence | MNENPNKLVG AIDQGTSSTR FVVFTTKGRI AASSQMEHTQ IFPESNAGWH EHDPVEIFRN TATCIQATLQ ALERKSYAVF LSAVGITNQR ETTIAWNRVT GVPYYNAIVW DDTRTTGIAK AIAQGNSNRL RDQTGLPLAS YFAGTKVKWL LDNVEALRKD LEDDAKSSEV CFGTIDSWLV YQLTGTKSNH DGAINSGGVF VTDATNASRW LFMDLAARVW DQTLVNAVCA PHRVPLSALP EICPSSHVYA TCKGIEVGVP GLDKVPLAAI LGDQQAALFG QTAFAPGEAK NTYGTGLFLM MNTGTKIVPS KHGLLTTVAY QIGQNAPVQY ALEGSVSHSG STIQWLRDQL QIIKDAPESE TMARTCDSNQ GLYFVPAFSG LFAPHWRSDA RACIVGMTAS HHRGHVCRAA LEAAAYQTHE VFAAIEADSN VTLRTLNVDG GGTHNQLLMQ FQADIIGVPV VKPAVMETTS MGAAFAAGLA VGIWQDQQEI KELWTAAQTF NPKMDVEERD SSLAGWRKAV SKSLDWVGED QDAKKEDIWL LCPMNRNIF
|
| |