Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49733 |
Symbol | |
ID | 7198428 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011692 |
Strand | + |
Start bp | 93625 |
End bp | 96041 |
Gene Length | 2417 bp |
Protein Length | 310 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184485 |
Protein GI | 219128576 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.597636 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCAATAGTCG GAGGAATCCG AAACGACGCC GTCTTGGATA CGCCGAACAT TGCTTTCCTG TCTCCGTAGT ACACGCGAGC GGCCGGACTG GCCAACACTT CCACAGTCTG CGCTTGTCTC CGTCCGTATA CCGCCATCCA CGCCATACAA AAGAGAACTT CCTCCCCGTT CTTCCCGGTG CTCCAGTCTA AACCGCCTTC CATACCCACC GCCAAGTGCG GATGCCGGCC GTTGGCTTGG CGAAACTCGT TGTGAGCCGC CTGGGCCCGG TTCTGGGCAC CCCGCAGAGT TTCGTCGTCA CCAAACGGTT GATCCGGCAC GCCGCTAGCG ACGGTAAAGC TTTGAATGTC CAAGACGACG GTTGTATGCG GCTTGTCCTG GATCGCCTGT CGCAAAGCGC GTTCCACTGC CAGGACTTTG GACGGATTCG ACGACCCCAC TGCCACGCGC AGTATCGTCG TTGTTGACGA CGCCACGACC GTCGTGTCCG TGCCGTTCAT GGGACGGGAC ACGAATATGC ACCAGTGGAG AGTTACGGGT TGGTATTGCG GAGTGTCTTG CCTGCGGAAA CCGAAAAAAT CCCCGCTTAC CCTCCCAATG TGTTCCGACC GTTGACCTGG ATCTTCATTG GCCGCACTGC CGTCAGTCGG TGGGGGACCA GAGTTGGGAC AGACATCCTG TCGGAGTAGA GCCGCGATAT TTGGTATAGG AATGATACAA TATAACGGAG AATTCACTTA TCAGTCAGAC CGGAGGGTGA CGGGATTGCG ACATCGAGCC AATCCAGTAC AACAGCCCCC CATGTCGTTG TATCCCTGCA TGCGTACACA TCCATAAGAA CTTTACCATC GCATTCGTTC CGTTTCAAAA TTCTGTCCGA CTTGCCTTTT GCAAATCAAG CGTTCTTGCT ACGGTAGAAA GTGGTTTGCG CATCAGGACT AATGATGTGT GACCGAATTG ACGACTGTGA GGGAAACAAG GCTGGACCTG TTGTTCACAG TCAATCTTTT TAGAATGATG ATCTACTTAT GCAGACATCG ATGAAGGCAT TTTCACTTTT CAGTCTCTTG TGAGGTTGAC GTCGGATGTC CGTTGGCATG ACCCAGATAC CGAATGGTCG AGCTTGCTCG GAGCGCCATG CTGTGATCAA GGGTAAGGGG TTGTCTTTTT GGGCGCGTTT CTTTCGATGT CGGTACTTCC GGAAAGAAAT CTTTTGGAAT CCAGTCAAAA AAAGATACGA AACTTGATGG CATCCATTAC CGGTCGCACA CTCTCTTTAC ATTGGAAGTA AAAGCAGGCT CTTGGACGCA CCCGAGTCGA AAGACATACT CTTTCTTATC TCGTTACAAA GAACAACGCT GCATTGCTCG AAGCCCAACG GAGCTAGTAA TGACCTTGGC TAGCTTCATC TCGTCGGCGA ATCCTTCCGT CGCCGTTGCG GGTATCCGCC ACTCGGTTAC CTCCGTGGTT CCCCACGTGT TCCCCTCGGT GCTTCCAAAG ACTGCGTTGG TCGTGTCCAA GTCGTCAGCT GCGGTGGTGA ATGTCTTGCG GGCTGGTGCT GCGTTGACGG CGACGAAGGA CTACGGCGAT GTTGCCATTC AGTACTTTAC ATCGATACGG GTACCGGCAG CTCTGGTTGC CGGAAGTTCC TTGGCGGCTC TGTTTGCCCT CGTGGACGAG GCTAAAGTTG AGAGTATGGG CGAAGAAACA TCCCTGGAAC GCAAGCTGGT ACTGCTCTAT CACGTTTTGG TACTTTCTTC CTTTCTGCTT TCCATCAACG TCATTATTGT ATCGACGGCC ATGAGTAACG TGTTCCTACT AGGGGTCAAC AACCCGATGG CCACGTCAAC ATACGCGTTG CTCAAACGAG AATACGATTA CGAATTTGCC TTGACTAGCT GGTCTTTTAT GGTGCGTAAG ACGAGCCTGT GGTACGGCTG GCGACTACAT AGGCCTTTGC CCTTTTGACT GACCTTGCGT TTCGTTTCGT CATCGCGACC GACTAGACGA GCGTCTTTTC CTTTTTGAGT GGAGTCGCCG TCCGGGGTCT GATTCAATTC GAAATGTTCA CCAAACGAAG AAAGCGACAC GCTCTCTTGG CTATTTGCTC GGTCTCTTCC ATGGTGTTGC ATCTCATGGG ATTCGTGAAT CGGCGTTTAC CGCACTGGCC AAATATGTTT CGTATGACCG TGGATGTCAC GACCATGTTC GTACAGCGTT CCATACAAGG AGGAGCACCG TGTGAGCTTG CTTCGGTAAT ACTCATGTTG GGAGGCGTTG TGGCAGGCAT CTCGCTCTTT GTTGTACCGA CCCAGTACAC AGGGACGGCT CAGGATTACA AAACAATCAA CGACGAAGAA ACTCCGAAAA AGAAGAGTGC GATCCAAAAA TCACAGAGTA AACCAGCAAA AGCTTGA
|
Protein sequence | MTLASFISSA NPSVAVAGIR HSVTSVVPHV FPSVLPKTAL VVSKSSAAVV NVLRAGAALT ATKDYGDVAI QYFTSIRVPA ALVAGSSLAA LFALVDEAKV ESMGEETSLE RKLVLLYHVL VLSSFLLSIN VIIVSTAMSN VFLLGVNNPM ATSTYALLKR EYDYEFALTS WSFMTSVFSF LSGVAVRGLI QFEMFTKRRK RHALLAICSV SSMVLHLMGF VNRRLPHWPN MFRMTVDVTT MFVQRSIQGG APCELASVIL MLGGVVAGIS LFVVPTQYTG TAQDYKTIND EETPKKKSAI QKSQSKPAKA
|
| |