Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46335 |
Symbol | |
ID | 7201393 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | + |
Start bp | 962524 |
End bp | 964084 |
Gene Length | 1561 bp |
Protein Length | 490 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180549 |
Protein GI | 219119585 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00257866 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGAACCGGTG GTAAGCCAGA GTCGGACTCT TTTGAATTTC TCAAAGGATC GTCAATGAGA TTCGGAATAA CTGTGAACAC TTCTCGAAAT GTCTTCTTGG ACTCTCCAAA CGCGTCGTAG GTCTCACAGG ATCCTACCAC TGCACACCGA ATCTACGGAT GCAAAAAGAA CATCAACCGC ACCACGGCAC CATGATTCCA GAACATCTCA GTCTTTGCGT AGTTTCCCCA ACACCGATGG CAGAAACAAC TCGCTCACTA GTTTTGATCG GTTGTCGCCA GAGGACGATT CTTTGAAGCG ATCCAAAAAG GCAAGGTCCA AAAGGATTTC CCGTTATCTT GTGGGTTGTT TCTCTCTCCT CGTGATACTC GGTAAATTCA CTACCAACTT CCGAGGCCAG TCGTCGACTC AAGATGTGCA CTTCACCAGT ACCCATGGCT CTGTCAAAGC TTCCCCAGAT GCGGTTTTTT CAGTCAACGG AACGTCATCG TACAAATTGG ATACGTTGCC ATTGATACCA GAGGATCATC ACCGCCATAA AGTTCCCAAT ATCATTATAT TTACGCACTA TCGAGACTTT CTCCACGAAG ATCTTCCTTA CTCCATGCCA AATGTTTCAG CAGCACACCA TCAACGCTCA CATAACCTGA CAGAAGATCA AATTGAGTTA CTGGCCTTGC AGGCAAACGT CCGAGCATCG GCCAGCTTAC ACCCAAAAGC ACAAGTAAGA TTTCTTACCG ACAACGACTG TATCGAGAGC TTGTCTCACC TCCTTGACAG CGCCGAAGAC AATGCTGACT TGGTAGCGTA CTTTCGGCGG GAATCTATTG GCATGTTCAA AGCTGATTTG TGTCGGGGAG CTGCGCTCTA TGAAACCGGA GGTCTCTACT TTGATGTAGA TTTGGGCGTC CGCCAGAACG TCTTTGAAGT TCTGAAGGAG ACCACACGTT TCGCAACCGT TTTGGTTCAT TCTGCTTCTA ATCACAAGGG ATCCTTCTTT CAGGCCTTCA TCGCATCGAC CCCGCAACAT TTTGTATTGA AACGCTACAT TGAACTGTTC TTGTCCTATG CACGACACGA ACTAGATATC GATGGTCCGC TAGGCGTGAT TCTACTACAA CGCGCGTACA ATGAAATGGT GGAGGAGCGA CCGGACATTG TCGATTCGAC AGAATTGTGG CAGGAAGTCA TGTACAGACC AGGATTTCAA ACCAACATCC TGAGTGATGT GCCGCCCCCA ACCTGGGGCT ATCGGCGAGC CTGTAAATTC ATCGTACTTG CCAACAAGGC CCTGCCGTTG CGGGTACCTT TCTACAGTCG TATCGCTGGT TCACGCATGT GTCCATTCGG ATTTGATTCA CGGTACAAGA CAAAGCCGAA GAAGAAAAAG ACTGGGAAGA AAGCTAAAAC CACAACATCC GAAAGCGACG ACAATCAGCA CCACTTCGAA AGAATCAATC GCAAATTCGC AGCGGGCAGC GAAAAACAAT TTGATGAGCT ATGGATGAAT CAAAGCAACT TTAGGACCAA GAACAGTGCT TTTGTCGAAG AGGATGAGTA A
|
Protein sequence | MSSWTLQTRR RSHRILPLHT ESTDAKRTST APRHHDSRTS QSLRSFPNTD GRNNSLTSFD RLSPEDDSLK RSKKARSKRI SRYLVGCFSL LVILGKFTTN FRGQSSTQDV HFTSTHGSVK ASPDAVFSVN GTSSYKLDTL PLIPEDHHRH KVPNIIIFTH YRDFLHEDLP YSMPNVSAAH HQRSHNLTED QIELLALQAN VRASASLHPK AQVRFLTDND CIESLSHLLD SAEDNADLVA YFRRESIGMF KADLCRGAAL YETGGLYFDV DLGVRQNVFE VLKETTRFAT VLVHSASNHK GSFFQAFIAS TPQHFVLKRY IELFLSYARH ELDIDGPLGV ILLQRAYNEM VEERPDIVDS TELWQEVMYR PGFQTNILSD VPPPTWGYRR ACKFIVLANK ALPLRVPFYS RIAGSRMCPF GFDSRYKTKP KKKKTGKKAK TTTSESDDNQ HHFERINRKF AAGSEKQFDE LWMNQSNFRT KNSAFVEEDE
|
| |