Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43745 |
Symbol | |
ID | 7197030 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 1356847 |
End bp | 1358696 |
Gene Length | 1850 bp |
Protein Length | 585 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177814 |
Protein GI | 219112125 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0989203 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACAGTGTGG CTTCCTTGAC TATCCCAAAA AATCCGATGT TCAATAAGTC ATAACGTATG TTACACAGCA AGGAATCAAA CATACTACTC ATGTCCCGTT TGGTTTTTCG AAAAGCGTCT CTAAACAGCG CCAGCAGTCG AGCCAGAGCC GTCCTCCGCC ATCCGTGTAA TAGCGCTCCA GAGAATGGTT CGTTCCTTGC GCCGTCGCTA GCGAACACTC ACCATTCGCA CAGCCTCCAC GTAGTGAGCA ATTTTTGCAC GAGTCTGGTC CGACGAGTCT TCTCGACGAC GACATCATCA ACGGACGAAG ATGTTACCGA TGAGATAATC TCGGACGATG AAGATTACTT CATTGGGTCG AGCACAAAGC CGGAACCATG GAAAACACTG CTTCAAATTC CTGCCCGCGC TGCATGGCGC GTGGCATCGC CGCATAGTGA TACACCTTTG GATAAAGCTC GTCAGTCTGT TTTGGATGAA GGTTCTCGAT CTAATAAGCA ATTGCACAAG TCCTACTGCG ATAAAGTAGT AGACGTCCAT AGAGAGTTGT TGGGTCGCCG AGAACGAGAA CGACTCCGCA TTCTACGATC TCGAGAGTAT AAGCCGCGAG AAATCAAGAA GGACGCGGCA ATTCAACCCG TCTTGTACGG CCCTGATGAA ACCTTGGCTG CTTTTAAATT TCGTCTACTG TCGAACTATG CGATTGCGTT TCGCGTCTTG GATGAGGCCA GTAGTCTGTT GGGGCCCAAC AAATGGAGAC CGAAGAGAGT TATCGATTTC GGAATTGGGT GTGGAAGTGC CGCGGCAGCA GCGATGGAAG TTTGGGACGA TATTGAATGG GTCCACGGAA TCGACTCCTC GCAAGCAATG CGAGAGGGCG CTCAGCTTTT CTTGGAAGAT TATATTAAAC ATCAAGGAAG AGAGAACGGC CCAGTGCGCG TTACCTTGTC GGGGCATCTT TCGGTGGAAG TGGCACCGCC ATCCTTTGAT CTCGCTCTAT TTTCCTATAC AGCCATGGAA CTATCACATA GCGCGGGGAT CCTGGCCGCA GCCGGGTCAT TGTGGGAAAA GCTTCTTCCT GGAGGTATTT TAGTCATGAT TGAACCGGGT ACACCCGACG GATTCAGTTC TGTTCGAATA GTACGTAATA TGTTGCTTGA ATGCTGTCCT CCTAATCAAT CGCAAGCGGG TGGCGACGAA TGCCATGTTA TTGCACCTTG TACACACAAC GGGCCTTGTC CCATGGAACG ATACCAGGAG CTTGTCGATG AACGACGAAC ACAGCAAGAT GTTCCAGAGC CATCCGTCGA TCCCGTTTCC CCAGGCCGCA AGGGTAAGGA CAAATCTGGC GAGCTTGAGG GACGCGAGGA CGTTGATGAA AACGACGGTA TTCGTACCGG ATTTTGCAGT TTTGTTCAAA CAATGCCTGG GGCGTCGTGG AATAGCAAAG GTGAAAAGTT CTCGTACCTT GTGGCACAGA AGAGACTTAC AGGAGAATCT TTGGACGAGC CCCACCCTTT TGCTGACGAC GACTTGCTCG CTTTGCTAGA GCGCACACAC AGATCGCCGA ATGACGTTCA AACTTTCCAG GCTGCCATCG ATTTGGAAGA GAGATACATA GATTCTGAGG ACGACACTTT AGGACTCGAG CTCCTTCGAG GCGACAGAGC CCGGGCGTCG TTCGGAAGAA TCGTGAATGC ACCCAAAAAG AAGAAGGGCC ACGTGTATAT TGAAACATGC ACTGCGCCTG GGCGATTAGA TCGGCACAAA GTGCGCAAAA GCTTGTCAAA GATCGTACCT GGTATATATG CAGCGGCAAG AAAAAGTCGG TGGGGCGGAT TCTTTCCTGA
|
Protein sequence | MLHSKESNIL LMSRLVFRKA SLNSASSRAR AVLRHPCNSA PENGSFLAPS LANTHHSHSL HVVSNFCTSL VRRVFSTTTS STDEDVTDEI ISDDEDYFIG SSTKPEPWKT LLQIPARAAW RVASPHSDTP LDKARQSVLD EGSRSNKQLH KSYCDKVVDV HRELLGRRER ERLRILRSRE YKPREIKKDA AIQPVLYGPD ETLAAFKFRL LSNYAIAFRV LDEASSLLGP NKWRPKRVID FGIGCGSAAA AAMEVWDDIE WVHGIDSSQA MREGAQLFLE DYIKHQGREN GPVRVTLSGH LSVEVAPPSF DLALFSYTAM ELSHSAGILA AAGSLWEKLL PGGILVMIEP GTPDGFSSVR IVRNMLLECC PPNQSQAGGD ECHVIAPCTH NGPCPMERYQ ELVDERRTQQ DVPEPSVDPV SPGRKGKDKS GELEGREDVD ENDGIRTGFC SFVQTMPGAS WNSKGEKFSY LVAQKRLTGE SLDEPHPFAD DDLLALLERT HRSPNDVQTF QAAIDLEERY IDSEDDTLGL ELLRGDRARA SFGRIVNAPK KKKGHVSAQS AQKLVKDRTW YICSGKKKSV GRILS
|
| |