Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50623 |
Symbol | |
ID | 7199481 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011701 |
Strand | + |
Start bp | 4624 |
End bp | 5845 |
Gene Length | 1222 bp |
Protein Length | 382 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185591 |
Protein GI | 219130901 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000102021 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAGTTGGTTC TCCCTCTCAT AGTGGAACAA AGACGACGAA AATGAGCTCA AAAACCAAAA TGCGACGCAA GAAGCAATCC ATATCCTTAG GTAGCGTTAT TTCTGCTGCA GCTGTCGCGT ATGGAACATA CAAAGTAGCG GATTGGGCGT GGAATCGTTA TGTCACAAAA CGGAAGAAAA ATGATTATCA AGTCAACGCC GCCATTGCTA CTTCTTTCAT GAACTTCTTA TGCTCGCAAA CAAGTGTGGG CGCCCACGCG GAAGATGGAG TCGCTTCTCA TATTGATCAC ATCCCCGGCC CAAACCGTCG CTTACGAATG CGTCGCCAAC GCATGACTCG TTGCAGGCAG GAAGCAGCGC AGGCGCTTCG AGGTTTCTCA CCGGCACTCC GGTCGATTGT AGAGTTGCAT ACAAATACGG CGCAGGCAAC CCGGCTGCTC AAGCAACTTC GGGCGAATCG AACCACAGAA AAGCATGCTA CTTCTCGACG TTCTGAAGAA CAGGCGCTAT GGAAGGAAAT TCAACGGAAG ACGATGACCC GTATGTTGAC AACTGCCTAT GCCCATACGA TTTTATTTCT TGTCCTTACC ACGCAAGTAA ATCTATTGGG AGGACGATTA TTCGAGGAAT CTTTGCAGAA TACTTCCTTG TCTTCAAACG TCTCGATGAG TAACGACAGT GTCGCCTCCG ATCGAATGGT GTCTTATCAA GAGTCCCATC GTTTTGTCCT CCAGCATACA TATGATTATT TTCTGAACAA GGGTGTTCAC TCTCTGTTGT CAACAGTCGA GCAGGCTGTC GATTCTGTTT TGGGAGGATG GAACGTCTTC GATAAAGCAT GCCTACACAT TTCACGAGAA CAGTTTGACT GTGCGCTCGT GAAAATCCGA GGCTTGATAG AAGGTGGCCT GAGGACAGAT GTGAGCAGGA CTTCTGGAAG GTCATCAAGA CGCGAAAGCA TCCTTCGTTT TCTTATGCCC TCCTCAATCT TGGAGCATTC CATTCAAGAC GACCTAGCGA GATCCATTCT CGACGAAACT TGGGATCTTG TAGAAAGCCC TGTGTTTTCG GATGCTCAAC AGGAGTGTTT AAATGCCACT TTTGCATCTA TGCGGGATCG TTTTTGGGGC AAGATATTTG ATGACAACGG ACTTTCTGGG ACAAAACCAT GGGCGCATGT CTTGACCCAA CTAAGAACGA CGTCCAACAG TTTTTTCGTT GA
|
Protein sequence | MSSKTKMRRK KQSISLGSVI SAAAVAYGTY KVADWAWNRY VTKRKKNDYQ VNAAIATSFM NFLCSQTSVG AHAEDGVASH IDHIPGPNRR LRMRRQRMTR CRQEAAQALR GFSPALRSIV ELHTNTAQAT RLLKQLRANR TTEKHATSRR SEEQALWKEI QRKTMTRMLT TAYAHTILFL VLTTQVNLLG GRLFEESLQN TSLSSNVSMS NDSVASDRMV SYQESHRFVL QHTYDYFLNK GVHSLLSTVE QAVDSVLGGW NVFDKACLHI SREQFDCALV KIRGLIEGGL RTDVSRTSGR SSRRESILRF LMPSSILEHS IQDDLARSIL DETWDLVESP VFSDAQQECL NATFASMRDR FWGKIFDDNG LSGTKPWAHF FR
|
| |