Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_27375 |
Symbol | |
ID | 7201095 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | - |
Start bp | 461099 |
End bp | 463118 |
Gene Length | 2020 bp |
Protein Length | 537 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180241 |
Protein GI | 219118949 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTTCTTCCAA ACCAGGCCAG GGATTGTAGT TCACGGCGGT TCGTTCCCGT TGACAGTGAA GTACCTTACC ATGGTCAAGA AGCAAAAAGA GAAACAGTAC CAGTCGGCGA GGGAGCATCG ATGGACCTCA CAAGTTCGCT CCGGTTCGGC ACTACAAGCG ACCGCGTCGG CTGTCCAACG ACGACTGCCC TTTTCGCACT GCGCGTTAGG CTTGACACCG TACGAACATC CCGTGTGTAC ACGACAGGGT ATTGTATTTG AAAATTCTTC CTTACTACCG TTTCTAATGA AGCACAAGAC GGACCCAGTG ACTGGAGAAC CAGCAACATC GCGAGATATT GTCACCCTGC ATATGGACAA GGACGAGGAA GGGCGTTGGC AATGCCCGGT TTTGACAAAG CCTTTTTATG ATCACACCAA AATTGTAGCT ATTCTGCAGC CAGGAGGGAA CGAAGCCAAC GTCTATTCTT ACGAAGCATA CCGCGAGCTG AATCTCAAAG CAAAAAACTT TGAAGATCTC ATTTCGGGAC AGAAATTCAG TAGCAAAACT GATGTCATCA TTCTCAACGA CCCGAGCGAC GAAGCTTTCA ATCAGCGCAA GGACATCAAT CGGTTCTATC ATATCAAGCA TTCACGTGAG CTGGAAAAGG ATAAGGGAAA CACCGGGACG GTCCGGCATA GTTTGACGGC CACGAGAGTG ATGGAACAGC TGAACAAGAA TAAAGCTATA TCGGAAACAG CAGCTCAAAA GAAGCGAGTA GCTCCTGCCA CCTCATCGGG AGACAGCAAA CGCCCAAGAA TACTGGCAAA AGATGTGACA GGGGTCCAAT ACACAACTGG CAAGGGCGCG GCCTCGTTCA CGTCAACCGC GTTTGAAGTC TCCCAGATAA ACCAAGACCG TGATGCGACC GAAGAAGAAA TTCGAGAAGC TCAATTTCGC GTGATGCGTA AAATGAAAAA GAAGGGATAC GTCCGATTAC GAACAACGTT GGGGGATCTA ACACTCGAAC TGCATTGTGA GATGGTTCCA CGAACGTGTG CCAACTTTTT GGGACTGTGC GAACAGAAGA TGTATGACGG AACAGAATTT CACCGCCTTA TCCCCAACTT CATGATCCAA GGGGGCAAGT GCAAGGATGG GCCGGACGAA AGCGTCTGGG GTGGAACACT GGCTGACGAA TTTGACGAGC GATTGAAGCA TACGGGTGGG GGTGTTGTCT CAATGGCAAA TGCGGGACCA AATACTGGAA CGCGACAGTT TTTCATTACC TACAAGAGTT GTAATCACTT GGACCGAAAA CACAGCGTGT TTGCGACAGT GATCGATGGC ATGGAAGTCC TTAAATTGAC GGAAAAGGTC GGAAGAGACA AAAAGGAAAG GCCTTTAGAA AAGATTATGA TCTTCGGAAC TGACGTGTTA GCGAACCCAT GCAAAGAAGC CACAGAAAAG GAAGAAGAGC GGATCCGACT ACTCGTGGAG GCCCGAACAC CCAACCATGA AAATAGTCTT TTCGAGAAGC CGAAGGAGAC GAGGGGGCAG CCATCGCAGG AAGTTGGCCG GTATTTACGG GACAAAGTGA AGTCGTTCAA GCCGGCGACG GTGGAAAACG CCGGAACCAC GATTCCGTCT CGGCTGCCAC CTCCACCAAA GTCAACCTCT TTTGGAAACT TTGGCGGATG GTGAGCGAAA CGATTTTGCG ACAGTACATT GGAACATTCT TAGTCGTATA ACGTCAACTT TAGGGTGTCA CAACTCCGGT GCAGCGAAGG CTTCGTTGCT ATTCGAAGAT TTTGGCTATT CTCGACTACA GAAGAAGCAG GTCTCAATGA GGTATAGCAA CAATTTGATA GATACAGATT TGGCGCTTAG TGGCGCCGGT AGAGAAGGTA AACTGTGTGT ATCAACGCAT TGTACTCGTC TTTTAACGTA GCTCCACCTG GACTAGTTGG CACGGCTTCA CCGACATGAC GCTTTTCTGC TCCGCCTGGT AATAGTTTAT ATTGAAAAAG TAGCTTTCAA
|
Protein sequence | MVKKQKEKQY QSAREHRWTS QVRSGSALQA TASAVQRRLP FSHCALGLTP YEHPVCTRQG IVFENSSLLP FLMKHKTDPV TGEPATSRDI VTLHMDKDEE GRWQCPVLTK PFYDHTKIVA ILQPGGNEAN VYSYEAYREL NLKAKNFEDL ISGQKFSSKT DVIILNDPSD EAFNQRKDIN RFYHIKHSRE LEKDKGNTGT VRHSLTATRV MEQLNKNKAI SETAAQKKRV APATSSGDSK RPRILAKDVT GVQYTTGKGA ASFTSTAFEV SQINQDRDAT EEEIREAQFR VMRKMKKKGY VRLRTTLGDL TLELHCEMVP RTCANFLGLC EQKMYDGTEF HRLIPNFMIQ GGKCKDGPDE SVWGGTLADE FDERLKHTGG GVVSMANAGP NTGTRQFFIT YKSCNHLDRK HSVFATVIDG MEVLKLTEKV GRDKKERPLE KIMIFGTDVL ANPCKEATEK EEERIRLLVE ARTPNHENSL FEKPKETRGQ PSQEVGRYLR DKVKSFKPAT VENAGTTIPS RLPPPPKSTS FGNFGGW
|
| |