Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_43981 |
Symbol | |
ID | 7204195 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 610446 |
End bp | 612519 |
Gene Length | 2074 bp |
Protein Length | 418 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186092 |
Protein GI | 219113017 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.321285 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACAAA GACAGCGCAC CGATCGCCTG CTTTCTTCAG CGATGACTTC CCACCACATC AACAGCAACT TCTCGCTGGC TATCTTGGCC AGTGTGGGCG CTGGTGCAGC GCTCACCATT GCCGGCCAAT ACTGGCTTGC TCGTCGAAAA CCGGATGGCG AGACGGATCC TTCCATTGGG CACAGCCTTG TCTGGTCCAG CATCGGCGGG GCTATGAACG CGGCCATGAT GTATACTGGC GATCGTCTGA AACTTTACGA AACTCTTCGA GAAATGTGTG CAAAGCCTGC CTCTTCGGTG ACGGCTATTG AGTTGGCCGA AGCCACGGTA TGTTTACCAA AGCAACTCAC AGTCAATATT GTCAGCAAAG AGAGCTTACA CTAATACTTC ACTTCGCAGG GGCTAAATCA ACGATGGTTG CGAGAGTGGC TAGCGCAACA AGCGGCGATG GGTGTCCTCA AGCTTTTATC TGGAACGGAA AACGACGATG CGGCACTTAG ATACCGATTA CCGAAAGCAA CGGCTGAAGT TCTGGCCGAC CCGGATTCTA GAGAATACGA CATTGCCATG ATTCAAGCCG TACCGTCCCT TGTAAATCGC GCCAAAACGA TGTTACCGGA AGCCTTCGCA ACAGGAATGG GACGGCCTTA TGACGAAGCA GACGTGGCCG AAGCCATTGA CCGGAATCAT CGGAAGCACG TTCGCGACGT GTTCATCCCG CTCGTCCTAA GGCCCGCTCT CGGGGGAAGT ATAGCGCAGC ATTTGGAGGA TGGCTGCGAT GTGGCGGATC TGGGATGCGG TGCAGGAGTC ATGCTCATTT TACTAGCCAA ATCATTTCCT AAATCGAGCT TTCACGGGTT TGAAGTCTCT CAAGTAGCCT TGGAAAAGGC CGCTTTTCAC GTTGCGGCGG CTCGCGTGTC TAATGTCTTT CTGCATAACG CCACCGAGCC TGGCGAATCT TTGCGCGATC AACCAAGTCA ATTTGATCTT GTAGTGGTCT TTGATGTTCT GCACGACTCT CCCTTTCCTG ATGATTTGAT CCAGCAAGTC AAAACTGCTT TGAAGCCCTC TGGTGCCTGG TTGCTGGCGG ACATACCGAG CGCTCCCACG ACACGAGAAA ATCTCGTACA AATGCCCACC GCGTCAACGT ATTTTGCCTT TTCCACCTGC CTTTGTATGA GCTGCTCATT ATCCGAGGAA GGCGGCGCTG GTCTGGGTAC TTTAGGATTT TCCGTCCCTG TAGCTGAGAA AATGCTCCGA GAAGGCGGTT TTAAATTCGT CAAGGTCTTG CTAGAGAAAG ACAATGCACG GTGGTTCCTT GTGCATTGAG TGACCCAGTA TATGTGCTTC AATACAGTGA ACAAGACGTA GCAGTACAGC GGTAACACCG GGGGTACATA ACGTATCGCT GAAGAAGGCG TTCATGGATG TGGCGTGGAC GAAGGACTAA TGAAGAATTG CGCCTTCATA TCCAACGAAT AACGGAAAGA CTATATCTAA TTTTACAGTT TTCGGTTGGT CCCTACCTTG CTGTGTCACA GGTATCCAAC AGCGGTTGGA GTTTAATCAT CCGAACCCTT TCAGCGCCTC TATTGTGATC GCCTTGATGG AAGGATTTTC TATTTTTTTC TTCTCCTCCA CTGAAACGAC GCGTTCGCGA TCCGCCATTT TCGTATTGGG ACCACCTGGA TATCCCGCAT GATGCATGAG ATACAAGCGC ATCGAAATGG TAGCTACCGT CGAGCCGATG ACGCCTACTC CTAGCAAGAG ACGTTTTGCC TGAGAGGTTT TGACGTCACG GTATAATGTA ACGCAAATAT GCGCCACGGG TGCAGCGAGG ATGGGGCCAA GATAGACCCA CTGCCGTTCC CTCACTCGAG CCATGACAAT TTCCTGGTCA TGACTTTTGC CCGTTGACAT TGCTGCAAAT ACTGTTGAAA GCGCAGAGAC GCTAAAAGTT GATTCATTTG TTTGCACTAC AGCTCAATGG CGTGATATGA GCGTGGGATA GGCAAATGAT GCTATTCGGT CCTATCTAAT GGAAAATGAA ATTTTAAATA GCCTGTTTTC GGAC
|
Protein sequence | MTQRQRTDRL LSSAMTSHHI NSNFSLAILA SVGAGAALTI AGQYWLARRK PDGETDPSIG HSLVWSSIGG AMNAAMMYTG DRLKLYETLR EMCAKPASSV TAIELAEATG LNQRWLREWL AQQAAMGVLK LLSGTENDDA ALRYRLPKAT AEVLADPDSR EYDIAMIQAV PSLVNRAKTM LPEAFATGMG RPYDEADVAE AIDRNHRKHV RDVFIPLVLR PALGGSIAQH LEDGCDVADL GCGAGVMLIL LAKSFPKSSF HGFEVSQVAL EKAAFHVAAA RVSNVFLHNA TEPGESLRDQ PSQFDLVVVF DVLHDSPFPD DLIQQVKTAL KPSGAWLLAD IPSAPTTREN LVQMPTASTY FAFSTCLCMS CSLSEEGGAG LGTLGFSVPV AEKMLREGGF KFVKVLLEKD NARWFLVH
|
| |