Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48689 |
Symbol | |
ID | 7194919 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | + |
Start bp | 611804 |
End bp | 613483 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183127 |
Protein GI | 219125731 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000119699 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTGGC CAACGGTACG CAACGTCCAG AAACTCATCC TCCTTACTAT TCAAGTAGTG AGCAGTTTTG TTCCGAGACC GACCGCTCGC TCCAGATTGC AGTGGACGAC TCGAGCGACT GCCTCCTGCA GACTACTGTC GTCTACCGAA GGACCAAACA GAGATGACCT ACCAACTCCC ATATACCACC ACATAGATCT ATACAACGAA AACTCGTATA CCGACGACGT TGAGACAGTC TTGGGGCGTG TCGATGGGTT TATGAGACTG CTTGAGCAGC AGGAAGCCTG GCCACTTCTA CACGAGAGTA AAATCCGTGA TAGAATTCAT AGCCAGCATA CACGAGATTG CGAGGCAGCG TCTCTTAACT TGGCGCCCGA ATCGCCAGTC CTGCGCGACC AGCAATGCAG TACTCGGTAC GAATGGATTT TTTCTGGCAC CAAATCCTAC CCGTTAGCGG CGCAATCGTT AGAGCCAATA TTGAATACAA CGGCTATTTC TACAATTCGC GAAGCCGCAG AAGCGCATTG GGAAAATCCG GCTTTCTGTA CATCCCGGTT CACCTATCAG CGCCCAGGAA ACTATGAAGC CCACGTAACG GACTTGGGCG AGCGAGTTCG CTCGATTGTC AACGAAACAT TGACAACTAG AATTTACCCA CTAATTCGCG ACGTGTTTTG GAATGACCCA ACCTGTTCGT TGGCTCCAGT TGATCAACTT TGTGTGTACG ACGCTCTCTA CATACGCTAT AATTCCACAC AAGCCAAGCT ACTGGATCAC ATTGGTGCCG GGCAACCTTT GCACCGTGAC TTGGGACTCA TTAGTATCAA TATTCGACTG AACAACGATT TTGAAGGTGG TGGCACCTTT TTTGAAAATC AGCTCCTCGA CCGAAAGGAG TCCGATCTCG AACCAGGAAT AACACCTCTG ACACCTCTCG AGCGCGGCCA CGTCCTATTG CACAAATCAT CGGAACGACA TGCTGGTGCC AGCACCGTAG ATGGAGTGCG AGACATTCTC GTCTTTTTTG TATCAGGAGT CTCCTGCGAG ACAGGCTCCG GCAGTGGCTC GACAGTACCG ATGCCAATTC AATCCGCTAT TGCGAAACAG TCTCGTGGAT ACTGCGACGA CTGCTATTCG AATAATCCAC TACGGGCTAT CTTTTGTCGC ATCGCACATC AACGCTACGC GGTTCGGGTT GCGCCTTCGG ATGGGGAAGC TTGGCAGTAC TTGGGTACTG CCTTCATGGA GTACGATACC TACCTGGCAG TAATGAAGGC TTCCGCCGGG CTTCGAGGGG CCGTACTCCA GACGGCGACA CGATGTCTAC AGCTCGCCAC GAGACTGACG CCGTGCGATT CGCGTGTCTG GAACAACCTG GCTCTCACGC TGAACCGGCA ATGGAAATTG AAAAGCGACG CGAGTTTGCT TAATACCACG GAAATGGCTT TTGCGACGGC TCACCAACTT CTGAAAATGT CTCGAGAAAA GTGCGATGTT GAGGGCGAAT TGGACAATGT GAATGTAAAC TATGGACTAT TTCTTTCCAA TCAAGACAGG TTTATGGAGG CGGGGTGTAT TTTGGAAGGC ACTGCACTGA AGAAGATCAT AGATCAAGAT TGCGGAAAAG CGGTTGAAGA TGCCTATGAA TTGTGGAAGT TTTGTAGACA ACAGCAATAA
|
Protein sequence | MSWPTVRNVQ KLILLTIQVV SSFVPRPTAR SRLQWTTRAT ASCRLLSSTE GPNRDDLPTP IYHHIDLYNE NSYTDDVETV LGRVDGFMRL LEQQEAWPLL HESKIRDRIH SQHTRDCEAA SLNLAPESPV LRDQQCSTRY EWIFSGTKSY PLAAQSLEPI LNTTAISTIR EAAEAHWENP AFCTSRFTYQ RPGNYEAHVT DLGERVRSIV NETLTTRIYP LIRDVFWNDP TCSLAPVDQL CVYDALYIRY NSTQAKLLDH IGAGQPLHRD LGLISINIRL NNDFEGGGTF FENQLLDRKE SDLEPGITPL TPLERGHVLL HKSSERHAGA STVDGVRDIL VFFVSGVSCE TGSGSGSTVP MPIQSAIAKQ SRGYCDDCYS NNPLRAIFCR IAHQRYAVRV APSDGEAWQY LGTAFMEYDT YLAVMKASAG LRGAVLQTAT RCLQLATRLT PCDSRVWNNL ALTLNRQWKL KSDASLLNTT EMAFATAHQL LKMSREKCDV EGELDNVNVN YGLFLSNQDR FMEAGCILEG TALKKIIDQD CGKAVEDAYE LWKFCRQQQ
|
| |